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TTCGAAGAGT ATTAGTACAT TCTTTGAGAT TGGAGCTAGT ATGAAAATCC ATAAAACCGT 4500 
GAATCCTGTT GCCTATGAAA ATACCTATTA TCTAGAAGGC GAAAAGCACC TCATCGTCGT 4560 
CGATCCTGGT AGTCATTGGG AAGCCATTCG TCAGACAATC GAGAAGATCA ACAAACCGAT 4 620 

CTGTGCTATT CTCTTGACCC ACGCCCATTA TGACCATATC ATGAGTCTGG ACTTGGTTCG 4680 
CGAGACGTTT GGCAATCCTC CTGTCTATAT CGCAGAGAGC GAAGCCAGCT GGCTCTACAC 4740 
TCCTGTCGAT AATCTCTCCG GTCTCCCTCG CCACGATGAT ATGGCAGATG TGGTCACAAA 4800 

ACCTGCAGAA CACACCTTTG TCTTTCACGA AGAATACCAA CTAGAGGAAT TTCGTTTTAA 4860 

GGTTCTACCG ACCCCAGGGC ACTCTATCGG TGGTGTTTCC CTAGTCTTTC CTGATGCTCA 4 920 

TCTAGTCTTG ACGGGAGATG CTCTATTCCG CGAAACTATC GGACGGACCG ACCTTCCGAC 4980 

TGGTAGCATG GAGCAACTCC TTCATAGTAT CCAGACCCAA CTCTTCACCC TACCAAACTA 5040 

CGATGTCTAT CCAGGACATG GTCCAGCTAC TACTATCGCT CACGAAAAGG CCTTCAATCC 5100 

CTTTTTCTAG CAAGATGATG ACAATCGAAA TTTAAGTAAA CTATCCAGCA AATCTTTCTA 5160 

TTACAAAAGG CATCCTATCA AGGTTTTCAC ACATGATTGG ATGCCTTTTT TCTGATGACT 5220 

AGATTTTTTG CATTACCAAA TAATCACGCG CTCCTCTGGT GAACGCCACA TTCCGTCTCC 5280 

TTCTTTGACA TCATAGGTTG TAAAGAAATC GTCGAAGTTT GGTACTTGCA CATTGACACG 5340 

GAGTTTGGCT GGTGCGTGCA CATCGACGCT AGCCAAAAGT TTCATAAATT CTGGTCGACC 5400 

•riTCATGCGC CAGATGCGAC CGAAGTTGTA GAAGAACTCT TCTGCTGAGA AGTCTGCTTC 54 60 

TCTCTTAGCT GCTTCAAGCG CTGCTGCGAT TCCTCCCAAG TCAGCCACGT TTTCTGATAC 5520 

AGTCAATTTA CCGTTAATGG TTGCTCCATA AGAATCCTGT CCATCAAATT GGTCAATGAC 5580 

TTTTTGTGTT TTCTCCTTGA AGGCAGCATA GTCGCTCTCT GTCCACCAAT CCTTGAGGCT 5640 

ACCATTTTCG TCAAAGGAAG CCCCGTTAGT ATCAAAGGCG TGGGAAATTT CATGGGCAAT 5700 

CACTGCCCCA ATACCACCGT AGTTAGCAGA AGATGACTGA TGCAAGTCAT AGAAAGGCGC 5760 

CTGTAAAATG GCCGCTGGAA AGACAATCAG GTTCTTCTGA GGATTGTAGT AGGCATTGAC 5820 

CATATGAGCA GGCATGCCCC ATTCCTTATA ATCTACAGGC TGGTTCCACT TACTCCAACT 5880 

GTGCTTGATT TCCACACGCG CAAAGGCTAG AGCATTCTCA AAAAGACTGG CAGTTTCATT 5940 

CACTACCTTA TCCTTGTAAC GTGCAGGCAA TTCTTCTGGA TAGCCAATAT AAGGTTTGAT 6000 

CACATTGAGC TTCACGATAG CCTGTTTACA GGTTTCTGGA GTGAGCCAGT CATTCTTAAG 6060 

CAGACGCTCC TTATAAACAT CAATCATGGT TGCCACTTTT TTCTCCACAT CCGCCTTGGC 6120 

TTCTGGAGAG AACTTCTCAC GGGCGTACCA AAGACCCAGG GCTTGCTTGA AAGGTTCTTG 6180 
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TGCTAGATGA TAAGCTGCTT TGACCTTATC TTTTGCCTCT GGAACTCCAG AAAGGGCACG 6240 
GCTGTAGGCA CCAGACAAAA CACGGATATC CTCTGTTAAA TAGCTGGTTG AAAGATTGAC 6300 
AACACTCAAA ATCAAGGTTG CTTTAAGGAG AGACCAGGCT TCCTCACTGT AGAATTGCTC 6360 

TGCTGCTTGC CAGAAACGTT CCTCGTCTAC AATAACCTTG TCTGGTAATT GCCCAATAAC 6420 

TGCTTTGAAG AAGTCATCCA AAGGTAGGGC AGGCGCGAAT TTCTTGAAAT CTTCGTAAGA 6480 

ATATGGATGA TAGAGTTTAG CATATTCTGA ACTTTCTTCA TTAGAGAGCA CCACTGCCGC 5540 

AACTCGGCGG TCCAATTCAA GTCTTTTTTC TAGCAAGTCT TCAATTTCTT CATCAGAGAA 6600 

ATCATAAGCC TTGAGGAGAT TTGCGCTGCT TTCTTTCCAA AGAGTCAAGA GCTCTTCGCG 6660 

CTGAGGATGT TCTTCTGCAT AGTAGGTCGT ATCTGGCAAG ATTGTGCTTG GAGCGCTAGC 6720 

CCATAGAACA TTGATTCTAG CATCCATAAA GTCTGGCGAT ACACCAAAAG GAAGGAAGTT 6780 

TGGTTTTCCT GCAAGCTCAA ACTCTGCTAG TTTAGCTGTA AAATCCGCAA AAGTCTCCAA 6840 

TTCTTGGAAT TCTTTAAGGA GTGGTAAGAC AGGTGTGATA CCGTCAGCTT CTCTCTTGTC 6900 

AAAATCACGA ACTAGGCGGT GGTATTTGAC AAAGTTTTCC AAGATAGCAT CCTCAGGCAC 6960 

TTCTTCACCT GCTAACCACT TGTCTGTTGT CGCCAGCATC AGGTCTTCAA TTTCCTGGTC 7020 

TAAATCAACA AAACCTCCTG TTTGAGACTT ATCTGCTGGG ATTTCAGCTG TCTGTTGCCA 7080 

TTCTCCATTG ATAGCATCAT AAAAATCATC TTGATAACGT GTCATCTTGT TCTCGCTTTC 7140 

ATTTGTATTT GCATTTATCT TAACAAAAAT CG 7172 
(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4518 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS ; double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 
CGGGAAGTTA TGCGATCTAG ACTTCGTTCC TGTACAGCTA CTTTCTCAGG TGGTCTTGTT 60 

GTTTGTATGA GTTTGTTTAG AGAGGATCTT TCTATGTCTT TCTTTCTTAT TTTTGTTTTA 120 

TATGCTTTTC TGATTTCTTA TCTAATTTAT GGTTATTTCA GACTAAAAAG GAAATACCGA 180 

GTAGATGAAT AGCAAGGTTC TAGGTCTTCA GATTGATTTT TAGCACTCTT GATAAAAGAG 240 

TGCTAATTTT TTGAGTTTTT GTCTTGACAT TCTCTTCTAA GGGTGTATAA TAGAATCATG 300 

AGTTAGCACT TGGATGCATT GAGTGCTAAT TGATCAGACA GAGAGGAGTG ATGAGATGGT 3 60 

TACAGAGCGT CAGCAGGATA TTTTAAATCT GATTATTGAC ATCTTTACCA AAACGCACGA 420 
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ACCTGTCGGA TCAAAAGCCT TGCAAGAGTC TATTAACTCT AGCAGTGCAA CCATTCGTAA 480 

TGACATGGCG GAACTAGAAA AACAAGGGTT GCTTGAGAAG GCTCATACTT CAAGTGGTCG 540 

GATGCCAAGT GTTGCTGGTT TTCAGTACTA TGTGAAACAC TCACTGGATT TTGACCGGCT 600 

GGCTGAAAAT GAGGTATATG AGATTGTCAA AGCCTTTGAT CAGGAATTCT TCAAATTGGA 660 

GGATATTCTG CAAGAGGCTG CTAACTTACT AACAGACCTG AGTGGCTGTA CGGTAGTGGC 720 

ACTGGATGTT GAGCCGAGCA GGCAACGTTT GACAGCCTTT GATATCGTTG TTTTGGGGCA 780 

ACATACAGCC TTGGCGGTAT TTACCCTAGA CGAGTCGCGA ACGGTTACTA GTCAGTTTCT 840 

GATTCCAAGG AACTTCTTGC AGGAGGATTT GCTGAAACTG AAGAGCATCA TTCAGGAACG 900 

TTTCCTCGGT CACACCGTTT TAGATATTCA CTACAAGATT CGGACGGAGA TTCCGCAGAT 960 

TATCCAGCGT TACTTTACAA CAACGGATAA TGTCATCGAT CTCTTTGAAC ACATCTTTAA 1020 

GGAAATGTTC AACGAAAACA TTGTGATGGC GGGCAAGGTC CATCTCTTGA ATTTTGCCAA 1080 

TCTAGCAGCC TATCAGTTCT TTGACCAACC GCAAAAGGTG GCCTTGGAGA TTCGTGAGGG 1140 

GTTGCGTGAG GATCAGATGC AAAATGTTCG TGTTGCAGAC GGTCAAGAGT CCTGTTTAGC 1200 

TGACCTAGCG GTAATCAGTA GTAAGTTCCT CATTCCTTAT CGGGGAGTTG GAATTCTAGC 12 50 

CATTATCGGT CCAGTTAATC TGGATTACCA ACAGCTAATC AATCAAGTCA ATGTGGTCAA 1320 

CCGTGTTTTG ACCATGAAGT TGACAGATTT TTACCGCTAC CTCAGCAGTA ATCATTACGA 1380 

AGTACAttAa GATTGAAATC ATTAAAGGAG GCGAACATGG CCCAAGATAT AAAAAATGAA 1440 

GAAGTAGAAG AAGTTCAAGA AGAGGAAGTT GTGAAAACAG CTGAAGAAAC AACTCCTGAA 1500 

AAGTCTGAGT TGGACTTGGC AAATGAACGT GCAGATGAGT TCGAAAACAA ATATCTTCGC 1560 

GCTCATGCAG AAATGCAAAA TATCCAACGC CGTGCCAATG AAGAACGTCA AAACTTGCAA 1620 

CGTTATCGTA GCCAGGACTT GGCAAAAGCA ATCTTACCAT CTCTTGACAA CCTTGAGCGT 1680 

GCACTTGCAG TTGAAGGTTT GACAGATGAT GTGAAGAAGG GCTTGGGGAT GGTGCAAGAA 1740 

AGCTTGATTC ACGCTTTGAA AGAAGAAGGA ATTGAAGAAA TCGCAGCAGA TGGCGAATTT 1800 

GACCATAACT ACCATATGGC CATCCAAACT CTCCCAGCAG ACGATGAACA CCCAGTAGAT 1860 
ACCATCGCTC AAGTCTTTCA AAAAGGCTAC AAACTCCATG ACCGCATCCT ACGCCCAGCA ■ 1920 

ATGGTAGTGG TGTATAACTA AGATATAAAG CCCGTAAAAA GCTCGCAGTA AAAATAGGAG 1980 

ATTGACGAAG TGTTCGATGA ACACAAGAAA ATCTATCTTT TTTACTCAGA GCTTAGGGCG 2040 

TGTTCGATTC GGCAATTCTG ACGGTAGCTA AAGCAACTCG TCAGAAAACG GCAATCGCTA 2100 

TGGCGTTTGC CTAGCTTCCT TACTAACTCG TCGTCGAAAT AAAATCGATT TCGACTCCTC 2160 
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GTGTCGCAAT TTACATAATA GAAAACTTGT CCGAAACGAC AATAAACTAT GAAGAAAGAT 2220 
AAAATATGTT TGGCTTTGTA ATAGTGAGCG AAGCGAACCA AACACGATAC TCTTCGCCGT 22 80 

GGCGCTATTT GCGCAAATTT TGAGACCTTA GGCTCAAAGT TTAGTCAAAG AGATTGACGA 2340 
AGTCAAGCTC TGACGGCGTC GCCACTGTCG CCACTTAAGA AGAGTATCAA AAAGAAAAAT 2400 
AGAAAATTAA CTAACAAGGA GAAAAACACA TGTCTAAAAT TATCGGTATT GACTTAGGTA 24 60 

CAACAAACTC AGCAGTTGCA GTTCTTGAAG GAACTGAAAG CAAAATCATC GCAAACCCAG 2520 
AAGGAAACCG CACAACTCCA TCTGTAGTCT CATTCAAAAA CGGAGAAATC ATCGTTGGTG 2580 
ATGCTGCAAA ACGTCAAGCA GTTACAAACC CAGATACAGT TATCTCTATC AAATCTAAGA 2640 
TGGGAACTTC TGAAAAAGTT TCTGCAAATG GAAAAGAATA CACTCCACAA GAAATCTCAG 2700 
CTATGATCCT TCAATACTTG AAAGGCTACG CTGAAGACTA CCTTGGTGAG AAAGTAACCA 27 50 

AAGCTGTTAT CACAGTTCCG GCTTACTTCA ACGACGCTCA ACGTCAAGCA ACAAAAGACG 2820 

CTGGTAAAAT TGCTGGTCTT GAAGTAGAAC GTATTGTTAA CGAACCAACT GCAGCAGCTC 2 880 

TTGCTTATGG TTTGGACAAG ACTGACAAAG AAGAAAAAAT CTTGGTATTT GACCTTGGTG 2 940 

GTGGTACATT CGACGTCTCT ATCCTTGAAT TGGGTGACGG TGTCTTCGAC GTATTGTCAA 3000 

CTGCAGGGGA CAACAAACTT GGTGGTGACG ACTTTGACCA AAAAATCATT GACCACTTGG 3060 

TAGCAGAATT CAAGAAAGAA AACGGTATCG ACTTGTCTAC TGACAAGATG GCAATGCAAC 3120 

GTTTGAAAGA TGCGGCTGAA AAAGCGAAGA AAGACCTTTC TGGTGTAACT TCAACACAAA JlbO 

TCAGCTTGCC ATTTATCACT GCAGGTGAGG CTGGACCTCT TCACTTGGAA ATGACTTTGA 3240 

CTCGTGCGAA ATTTGACGAT TTGACTCGTG ACCTTGTTGA ACGTACAAAA GTTCCAGTTC 33 00 

GTCAAGCCCT TTCAGATGCA GGTTTGAGCT TGTCAGAAAT CGACGAAGTT ATCCTTGTTG 3 360 

GTGGTTCAAC TCGTATCCCT GCCGTTGTTG AAGCTGTTAA AGCTGAAACT GGTAAAGAAC 3420 

CAAACAAATC AGTAAACCCT GATGAAGTAG TTGCTATGGG TGCGGCTATC CAAGGTGGTG 3480 

TGATTACTGG TGATGTCAAG GACGTTGTCC TTCTTGATGT AACGCCATTG TCACTTGGTA 3540 

TCGAAACAAT GGGTGGAGTA TTTACAAAAC TTATCGATCG CAACACTACA ATCCCAACAT 3600 

CTAAATCACA AGTCTTCTCA ACAGCAGCAG ACAACCAACC AGCCGTTGAT ATCCACGTTC 3660 

TTCAAGGTGA ACGCCCAATG GCAGCAGATA ACAAGACTCT TGGACGCTTC CAATTGACTG 3720 

ATATCCCAGC TGCACCTCGT GGAATTCCTC AAATCGAAGT AACATTTGAC ATCGACAAGA 3780 

ACGGTATCGT GTCTGTTAAG GCCAAAGACC TTGGAACTCA AAAAGAACAA ACTATTGTCA 3840 

TCCAATCGAA CTCAGGTTTG ACTGACGAAG AAATCGACCG CATGATGAAA GATGCAGAAG 3 900 

CAAACGCTGA AGCCGATAAG AAACGTAAAG AAGAAGTAGA CCTTCGTAAT GAAGTAGACC 3960 
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AAGCAATCTT TGCGACTGAA AAGACAATCA AGGAAACTGA AGGTAAAGGC TTCGACGCAG 4020 

AACGTGACGC TGCCCAAGCT GCCCTTGATG ACCTTAAGAA AGCTCAAGAA GACAACAACT 4080 

TGGACGACAT GAAAACAAAA CTTGAAGCAT TGAACGAAAA AGCTCAAGGA CTTGCTGTTA 4140 

AACTCTACGA ACAAGCCGCA GCAGCGCAAC AAGCTCAAGA AGGAGCAGAA GGCGCACAAG 42 00 

CAACAGGGAA CGCAGGCGAT GACGTCGTAG ACGGAGAGTT TACGGAAAAG TAAGATGAGT 4260 

GTATTGGATG AAGAGTATCT AAAAAATACA CGAAAAGTTT ATAATGATTT TTGTAATCAA 4 320 

GCTGATAACT ATAGAACATC AAAAGATTTT ATTGATAATA TTCCAATAGA ATATTTAGCT 4380 

AGATATAGAG AATTATATTA GCTGAACATG ATAGTTGTAT CAAAAATGAT GAAGCGGTAA 4440 

GGAATTTTGT TACCTCAGTA TTGTTGTCTG CATTTGTATC GGCGATGGTA CCGTATCTGA 4500 

CGAACGTTCA GCTTATAT 4519 
(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8145 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 

TGCTATTTTC GATTCCCTTG GGCGTTTTGA TTGCCTTTGC CTTGCAAGTC CATTGGAAGC 60 

CCCTCCATTA TCTGATTAAC ATTTACATCT GGGTTATGCG AGGAACCCCC TTACTCTTGC 120 

AACTGATTTT TATCTATTAT GTGCTCCCAA GTATTGGGAT TCGTTTAGAC CGCCTTCCTG 180 

CAGCTATTAT TGCCTTTGTT CTCAACTATG CAGCTTACTT TGCAGAAATT TTCCGTGGGG 240 

GAATTGACAC TATTCCAAGA GGACAGTATG AGGCCGCCAA GGTCTTGAAG TTTAGCCCTT 300 

TTGACAGAGT GCGCTATATT ATCTTGCCCC AAGTGACCAA GATCGTTCTT CCTAGTGTCT 360 

TTAATGAAGT TATGAGTTTG GTCAAGGATA CTTCTTTGGT CTATGCTCTC GGAATTTCAG 420 

ACCTTATCTT GGCTAGTCGA ACAGCTGCTA ACCGCGATGC TAGTCTAGTT CCTATGTTCT 480 

TGGCAGGAGC CATTTATTTG ATTTTGATTG GGATTGTGAC AATTATTTCC AAAAAAGTTG 540 

AGAAGAAGTA TAGTTATTAT AGATAGGAGG CTGCCATGTT AGAATTACGA AATATCAATA 600 

AAGTCTTTGG AGACAAACAA ATCCTGTCTA ATTTCAGTCT AAGTATTCCT GAAAAGCAAA 660 

TCCTGGCTAT CGTTGGACCT TCTGGTGGAG GTAAGACAAC TCTTTTACGT ATGCTTGCAG 720 

GTCTTGAAAC CATTGATTCA GGGCAAATCT TTTATAATGG ACAACCTTTA GAGCTGGATG 780 
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AATTGCAGAA GCGCAATCTA CTGGGATTTG TCTTCCAAGA TTTTCAACTA TTTCCTCATC 840 

TATCAGTTCT GGAAAATTTG ACTTTATCGC CTGTGAAGAC CATGGGAATG AAGCAGGAAG 900 

AGGCTGAGAA GAAGGCGAGT GGACTCTTGG AACAGTTAGG ACTAGGAGGA CACGCAGAGG 960 

CCTATCCTTT CTCACTATCT GGTGGGCAAA AGCAGCGGGT GGCTTTGGCG CGTGCTATGA 1020 

TGATTGACCC AGAAATCATT GGCTACGATG AACCAACTTC TGCCCTGGAT CCAGAATTAC 1080 

GTTTGGAAGT GGAGAAGCTA ATCTTGCAAA ATAGGGAACT TGGGATGACC CAGATTGTGG 1140 

TTACCCATGA TTTGCAGTTT GCTGAAAATA TCGCAGATGT ATTATTGAAA GTAGAACCTA 1200 

AATAGGAGGA AAAATGGATG AAAAAATGGA TGCTTGTATT AGTCAGTCTG ATGACTGCTT 1260 

TGTTCTTAGT AGCTTGTGGG AAAAATTCTA GCGAAACTAG TGGAGATAAT TGGTCAAAGT 1320 

ACCAGTCTAA CAAGTCTATT ACTATTGGAT TTGATAGTAC TTTTGTTCCA ATGGGATTTG 1380 

CTCAG7VAAGA TGGTTCTTAT GCAGGATTTG ATATTGATTT AGCTACAGCT GTTTTTGAAA 1440 

AATACGGAAT CACGGTAAAT TGGCAACCGA TTGATTGGGA TTTGAAAGAA GCTGAATTGA 1500 

CAAAAGGAAC GATTGATCTG ATTTGGAATG GCTATTCCGC TACAGACGAA CGCCGTGAAA 1560 

AGGTGGCTTT CAGTAACTCA TATATGAAGA ATGAGCAGG7 ATTGGTTACG AAGAAATCAT 1620 

CTGGTATCAC GACTGCAAAG GATATGACTG GAAAGACATT AGGAGCTCAA GCTGGTTCAT 1680 

CTGGTTATGC GGACTTTGAA GCAAATCCAG AAATTTTGAA GAATATTGTC GCTAATAAGG 1740 

AAGCGAATCA ATACaVir.CC TTTAATGAAG CCT~TGAT"I-Ga ITTGAAAAAC GATCGAATTG 1800 

ATGGTCTATT GATTGACCGT GTCTATGCAA ACTATTATTT AGAAGCAGAA GGTGTTTTAA 1860 

ACGATTATAA TGTCTTTACA GTTGGACTAG AAACAGAAGC TTTTGCGGTT GGAGCCCGTA 1920 

AGGAAGATAC AAACTTGGTT AAGAAGATAA ATGAAGCTTT TTCTAGTCTT TACAAGGACG 1990 

GCAAGTTCCA AGAAATCAGC CAAAAATGGT TTGGAGAAGA TGTAGCAACC AAAGAAGTAA 2040 

AAGAAGGACA GTAAGATAAA ATAGTGGCTG AAACTGCGTT TTGATTAGCA AAACGTAGTT 2100 

TTTTTTGTAA TCTAGGAAAA CGATAATAGC GATTGAATAT GGATAATTGA ATATGGAATA 2160 

GCCCACTGTG ATTTCTAAAA CATTGTTAAA AATTGATTTG ACTTCCAAAA TTAAAATGTT 2220 

CTGTAATGAA ATACTGATGT AACTGTTTTA GGAACAATAA AACGCATAAT ATCAAGGTTT 2280 

TTGCACCTTA CATTATGCGT TTTTGTGATT TTAAGACTTG TTAGCTGATT TTTTACAATC 2340 

CTGCGAAATC TTTGATTTCT TGTGCTGACA TTGAAGAGTC GCAACGGACG TTGATTTGTC 24 00 

CATCTGTAAT ATGAACAAAA CCTGGTACAG TTGGGATTCC ATAGCGTGAG CGGAATGCTT 2460 

GCAAATCATT GAGTTGGCTT GGTTCTTCAC TATTGATGAA GTAAATGTGA GCTTTGGTTT 2520 

CAGCTACGAC ACCTGACAAT GTACCTGCAA ATTTACGGCA GTAAGGGCAA GTTTTGCGAC 2580 
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CGATAAAGAA GGTTGCAGTT TCTTTTTTAT CAAGAGCTTC TTGCGCACGC ACAACTGTAG 2640 

TGACTTCAAG GTCTTTGATG TTATCTAAAA ATTGTTCCAT GAGATTACCT CGCTTTCATT 2700 

GATAAGTCTA GTATGCCATA AAGTTTCTAA AATTGCTTAG ATTTGATACG AAAAAAGATG 2750 

AGGTTGGTTG GTCTCATCTT TTATAGGTCT TTATTTTACA AATGCATTGA TTTCTGCTTC 2820 

GATGTTAGCA ATCTTAGCTT GTGATTCTTC GTTGGTTTCC CCTACAACTG CAATGTAGAA 2880 

CTTGATTTTT GGTTCTGTAC CTGAAGGGCG AACGGCAATC CATGAACCGT CAGCAAGTGT 2940 

GTATTTCAAC ACATCACTTG GAGGAGTTGT CAAGTTTGTA ACAGTACCGT CAGCAACAGT 3000 

AGCAGTTTGT GCCTTGAAGT CTTCTACGAC AGTGATAGCT GTTGCGTTCC ATTCTGTTGG 3060 

AGCATTGTTG CGGAATTTAG CCATAATCGC TTTGATTTGT TCAGCACCAT CGACACCTGA 3120 

AAGAGTAACA GAGATTGTTT TTTCTGCGTA GTAGCCATAT TCTTTATAGA TTTCTTCGAT 3180 

ACCGTCAGCA AGTGTCAAAC CACGAGAACG GTAGTAGGCA GCAAGTTCAG CAACTACAAG 324 0 

AACGGCTTGG ATGGCATCTT TATCACGTAC AAATGGTTTA ATCAAGTAAC CGAAGCTTTC 3300 

TTCAAATCCC ATCATGTAAG TGTGGTTGTG TTTTTCTTCG AATTCTTGGA TTTTTTCAGC 33 60 

GATAAATTTG AAACCTGTCA AGACGTTGAA CATAGTTGCG CCGTAGCTTT CAGCAATCTT 3420 

CGTTACCAAG TCAGTTGAAA CGATAGATTT GCAGAGAGCG GCATTTTCAG GAAGAGTTCC 3480 

AGCGTTTTTG TGAGCTTCCA AGATGTATTT AGCCATGATA GCACCGATTT GGTTACCTGA 3540 

AAGGTTGAGG TAGCTACCAT CTTTTTGAAG AACTTCAACA CCAACACGGT CAGCGTCTGG 3600 

GTCAGTTGCG ACAAGAACAT CTGCACCAAC TTGACGACCA AGTTCTTCAG CAAGGGCAAA 3660 

GGCTGCTTGG CTTTCTGGGT TTGGAGATGT TACAGTTGAA AAGTCTGGGT CAGCAGTTGC 3720 

TTGCGCTTCA ACAACTTGAA CAGAGTCAAA TCCTGCTTGG GCAAGAGCAC GACGAGCCAA 3780 

CATTTCACCA GTACCATGAA GTGGTGTGTA GACAATCTTC ATGTCTTTAC CAAATTCTTC 3840 

AATCAAGGCT GGGTTGATGT TTATGTCCTT AACCTCTTTA AGGTATTCTA TGTCAACAGC 3900 

TTCGCCGATA ACTTCAATCA AGCCAGAAGC TTTTTCAGTT TCCACATCAG CAACTTCAAC 3960 

TGCAAATGGG TTTTCGATTG CACGGATATA AGTAGTCAAA GCGTCCGCAT CGTGTGGAGG 4020 

CATTTGTCCA CCGTCTTCAC CGTAAACCTT GTAACCGTTA AATGGAGCAG GGTTGTGGCT 4 080 

GGCTGTGACC ATGATACCTG CGAAACAGTT GAGATGACGA ACTGCAAATG ATAGTTCTGG 4140 

AGTCGGACGA AGGCTTTCAA ATACGTAAGA TTTGATGCCG TGTTTAGCAA GAACTGCCGC 4200 

AGATTCAAAG GCAAACTCAG GTGAGAAGTG ACGGCTATCG TAGGCAATTG CTACACCGCG 4260 

TTCTTTCTCG TTTCCACCTT TTGACTCAAT CAAACGAGCC AATCCTTCAG TAGCTTGGCG 4 320 
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AACAACGTAG ATGTTGATAC GGTTTGTACC AGCACCAACC AAGCCACGCA TACCTGCAGT 43 80 

ACCAAATTCA AGATTTGTAT AGAAGGCATC TTCCTTAGTT TTTTCGTCCA TATTTTCCAA 4440 

ATCTTGACGA AGGTAGTCAC GAAGCTCCAC AAAATCAACC CATTTCTGGT AATTTTCTTG 4 500 

GTAAGACATT CAAATTCTCC TTTATTTTTA AAACATTTAA TCAGTTTAAT TATATCATTT 4560 

TTTTTAGTTT TAGTAAAACC TTATCTGCTT CGAACATCTC TTCAAACCAG GTCAGATTGA 4620 

ATTTTGGGGT TATATGATGT TGAGGCTAGG AAAAATTCAA TTTCAGTAAA AAAAGTAAGT 4680 

CTTCTCATAA CAAAACATTG ATATAGTTAC TTAGTTTTAA ACAAGCATAT TATAATAAAG 4740 

CTATGGCATA TAGTACTGAT TTTAAACAGC GAGCATTAGA TTACATCAAA GAGGGGCACA 4 800 

GCCATGTCGA GGCAGCCAAG TTTTTTGGTG TTGGCGTCAG AACTCTCTTC ACGTGGGAAA 4860 

AGAAAGACGT GAACAAGAAC ACATAGAGAG GAAAAAGCGA GTCGTCAAAA ACCGAAAGAT 4920 

TCCTTTAGAG GAATTGAAAG CCTTTGTAGA GGCTCATCCA GATGCTTTTT TACGGGAAAT 4980 

TGCGGCACAT TTTGATTGTG CTGTTCCTTC AGTATGGGCA GCTTTAAAGC AGATTAAGGT 5040 

CACTTTAAAA AAAGATGACG AGCTTTAAGG AACAAGACCC AGAAAAGTAG CCTTATTTCT 5100 

TAAGAATTTT AATAGTTTAA AGCACCTAGC ACCTGTTTAT ATTGATGAAA CAGGAATCGA 5160 

CCGCTATCTC TATCGTCCTT ATGCAGGGGC TCCTAGAGGG GAGAAAGTCT ATGAAAAGAT 5220 

TAGCGGACGT CGTTTTGAGC GAACTTCAAT TGTTGCAGGA CAAGTAGACG GAGAGTTTAT 5280 

AGCTCCCATG ATTTACAAGA AAAGCATGAC AAGCGATTTC TTTGTGGAGT GGTTChAnj^r 5340 

GCAACTCCTA CCTGCTTTGA AGACACCTCA TGTTATTGTC ATGGGCAATG CTGGTTTTCA 54 00 

TCCCAAGAAC ATTTTGGATG AACTCTGCAT CCAAGATAAA CACTTTTTCT TACCTCTACC 54 60 

ACCTTATTCA CCGGATTTGA ATCCTATTGA GCAAGCTTGG GCTATCTTGA AAAAGAAAGT 5520 

GACGGATGTA TTAAGGGAAG TTCCAACTAT TTTTGAATGT TTGGAATGCT TTTTTAAAAC 5580 

TAGATGACTA TAACGGTTCT AAAGGAACCT ATCGAGTAGT CATTAAAACT AAGGATACTG 5640 

CTGGTTAAGA GAAGACGGTA TACAATCAAA CCATTCACCG TGTAGCCGAA ATCGTTCAGA 5700 

ATGAAGACTT GTATCAGAAT GAAGACTTGT ATAAGAAAGG TTTGAATGTT GAACTTGCGC 57 60 

ACCAACAAAT TAAGGGATTT TTTGAAGCAG AGTTTAAAAA TCGTATTAAT GGAGTTCTTA 5820 

ATACTAAAAT AAAAAATAGT ACATTAAATC GTGTAAATAA AAAAACTATA CACCAGAGCA 5880 

ACAAAAACTC CATGATCAAT TTGAAGCAGA AGCAACGGAA GATGCTAAAA AACAAGGCGA 5940 

TATTGTGTTG AATGTTGACC AGGATTTCAT GAGCATATCT AAGTCTAATA AAAGTGGTTC 6000 

AGACTGGAAG AAAACTTTCA CAGTGAGGAT AACCAATAGG CTAGCAAATG ACTTGAATAA 6060 

TGTCTTGAAA CAGGTTGATA AAGATACTCC TAATACCCCA ACTTGGCTAA ACTCAGCTGC 6120 
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TTCTAAAGCT AAAGATGATG ACAGAGTATA TAAACTACTG AAGACTCTTA TACCAGGAGA 6180 

AAATTACCTA TCATGTTAAG GATAATCAGC TAGAAGTAGA AACAGATAAA TACACATATA 6240 

CTGCCGCTAG AAATGGTAGT AAGGAAGTTG GTATTCAAGA GTCAGATATA GCAGCAACTC 6300 

TAAGTGCCGA TGAATATAAT TCTAATCGCC AAACTTTTGA GAGAGAATAC AAATACAAAA 63 60 

GCAAATGCCC TTAATAATGG TTGGGCTAGA TCTGGTTCTG AAGAGTTCAA AAAGTTCTCC 6420 

CACTTTGTAG GGGTAGACAA AGGGATTGTG CGAACGAATG TACTGACTGG TAAAAAACTA 6480 

TCTGATAAGA TTAGGAAAGA AGTGGGCTCT GGAGATAGCA AACTAGGAAA AGGCGGCTAT 6540 

TTCTCTACTG GGGATGTTCT ATTAGGAAAA GATGTTGTTT CTTATACCGT ACAAGTATTT 6600 

TCAGAGAATA ATGAAAGAGT AGGAGTAAAC ACTCAAAGTC ACCGTGTTCA GTATAATCTC 6660 

CCAATTCTAG CTGACTTTTC AGTCATCCAA GATACTGTGG AACCATCACG AACCGTTGTT 6720 

GAAAAAATCA TTCCAAAACT AAATATTCCC GAAGAAGAGA AAGGGAAAAT AACCGAAGAA 6780 

ATCAAGAAAA AGAAAAAAAC CTCAGAATTG GCAGAACTAA TCTCAGAAAA TGTGAAAGTT 6840 

CGCTATGTTG ATGAACAAGG GCGTTTGCTA TCATTGAAAA ATGATACTGG AATTGGAGAA 6900 

AAAGAAAGTG ACGGAACCTA CATTACCAAT AAAAAACAAC TGATTGGTAC CAGCTATAAT 6960 

GTCACAGATA AAAAACTCAG TAGCATGACT ACTACTGACG GAAAATATTA TACTTTTAAA 7020 

GAAGCAGATA CAAATTCTGC AAGTTTAACT GGGAATATTG TAAGCGAAGG TAGAACAGTG 7080 

ACC-i'TAUTTT ATAGAGAAAG CGAAGCGCCA ACCACTGCTA CAGTAACAGC CAATTACTAT 7140 

AAAGAAGGTA GGCAAGAGAA GTTGGTAGAG TCTGTTATAA AAGCTGATTT AGCGATAGGT 7200 

TCTGAGTATA CCACAGAATC AAAAACTATT GAAGGGAAAA CAACAACTGA GGACAAAGAA 7260 

GACCGAGTTA TCACAAGGAA AACAACATAC ACCTTGGTAG CAACTCCTGA AAATGCGTAC 7320 

CAGAAGACGG TGCAACAGTT GACTATTACT ACCGTGAGAA TGTTGAGGAA ACAGTGGTTC 7380 

CCAAAACAGC AACCTCTACT GAGACGAAGA CTATAACGCG TATCATTCAT TACGTTGATA 7440 

AAGTTACGAA CCAAAATGTA AAAGAAGATG TTCTTCAACC TGTAACCTTA AGCCGTACAA 7500 

AAACTGAGAA CAAGGTCACG GGAGTTGTAA CCTACGGTGA ATGGACAACA GGAAACTGGG 7560 

ACGAGGTTAT ATCTGGTAAG ATTGACAAGT ACAAAGATCC AGATATTCCA ACAGTTGAAT 7 620 

CACAAGAAGT TACGTCAGAC TCTAGTGATA AAGAAATAAC GGTAAGGTAT GACCGTTTAT 7680 

CAACACCAGA AAAACCAATC CCACAACCAA ATCCAGAGCA TCCAAGTGTT CCGACACCAA 7 740 

ACCCAGAACT ACCAAATCAA GAGACTCCAA CACCAGATAA ACCAACTCCA GAACCAGGTA 7800 

CTCCAAAAAC TGAAACTCCA GTGAATCCAG ACCCAGAAGT TCCGACTTAT GAGACAGGTA 7860 



wo 98/18931 



PCT/US97/19588 



858 

AGAGAGAGGA ATTGCCAAAC ACAGGTACAG AAGCTAATGC TACCTTGGCT AGTGCTGGTA 7920 

TCATGACCTT GTTAGCTGGT CTAGGATTAG GATTTTTCAA GAAAAAAGAA GATGAAAAAT 7980 

AATAGATTTT AGAATCTAGG AACCAGGAAA AGCTCACAGA TGTGGGCTTT TTTCCTGGTT 8040 

TTGAGAACGA GGTCTTTCGT AAAGAATAAA AACGCTTACA AGTCTGTTGA ACTGGGAAAC 8100 

TATGAATCCT ATTTTTTTAA AAATATTTCC AGAAATCAGT TGCGG 8145 
(2) INFORMATION FOR SEQ ID NO : 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8697 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 

CGGTACCGGG AACGATACTT AGTCTAATTT TGCACCTTTT CCATGTATGG TAAAGGTTTT 60 

TCTTTTTTTA AAAAGGAAAA CGAGAAGAGG AGGTTCTTAT GAAAGCAAGC ATTGCCTTGC 120 

AAGTTTTACC CCTAGTACAG GGGATTGATC GGATAGCTGT TATTGATCAG GTCATTGCTT 180 

ATCTGCAwAC TCAAGAAGTG ACGATGGTAG TGACACCATT TGAAACGGTC TTGGAAGGGG 240 

AGTTTGATGA GCTTATGCGC ATTCTAAAAG AAGCGCTGGA AGTGGCAGGG CAGGAGGCAG 300 

ACAATGTCTT TGCCAATGTC AAAATAAATG TAGGAGAGAT TTT^JICTATT GATGAGAAAC 36 U 

TTGAGAAGTA TACTGAGACG ACACATTAGT CTATTGGGCT TTCTCGGAGT ATTGTCAATC 420 

TGGCAGTTAG CAGGTTTTCT TAAACTTCTC CCCAAGTTTA TCCTGCCGAC ACCTCTTGAA 480 

ATTCTCCAGC CCTTTGTTCG TGACAGAGAA TTTCTCTGGC ACCATAGCTG GGCGACCTTG 540 

AGAGTGGCTT TACTGGGGCT GATTTTGGGA GTTTTGATTG CCTGTCTTAT GGCTGTGCTC 600 

ATGGATAGTT TGACTTGGCT CAATGACCTG ATTTACCCTA TGATGGTGGT CATTCAGACC 660 

ATTCCGACCA TTGCCATAGC TCCTATCCTG GTCTTGTGGC TAGGTTATGG GATTTTGCCC 720 

AAGATTGTCT TGATTATCTT AACGACAACC TTTCCCATCA TCGTTAGTAT TTTGGACGGT 780 

TTTAGGCATT GCGACAAGGA TATGCTGACC TTGTTTAGTC TGATGCGGGC CAAGCCTTGG 84 0 

CAAATCCTGT GGCATTTTAA AATCCCAGTT AGCCTGCCTT ACTTTTATGC AGGTCTGAGG 900 

GTCAGTGTCT CCTACGCCTT TATCACAACT GTGGTATCTG AGTGGTTGGG AGGTTTTGAA 960 

GGTCTTGGTG TTTATATGAT TCAGTCTAAA AAACTGTTTC AGTATGATAC CATGTTTGCC 102 0 

ATTATTATTC TGGTGTCGAT TATCAGTCTT TTGGGTATGA AGCTGGTCGA TATCAGTGAA 108 0 

AAATATGTGA TTAAATGGAA ACGTTCGTAG AATTAGAATG TTTCTGAAAA AGAAAAGAGG 1140 
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AAATCAAAAT GAAGAAAACA TGGAAAGTGT TTTTAACGCT TGTAACAGCT CTTGTAGCTG 1200 

TTGTGCTTGT GGCCTGTGGT CAAGGAACTG CTTCTAAAGA CAACAAAGAG GCAGAACTTA 1260 

AGAAGGTTGA CTTTATCCTA GACTGGACAC CAAATACCAA CCACACAGGG CTTTATGTTG 1320 

CCAAGGAAAA AGGTTATTTC AAAGAAGCTG GAGTGGATGT TGATTTGAAA TTGCCACCAG 1380 

AAGAAAGTTC TTCTGACTTG GTTATCAACG GAAAGGCACC ATTTGCAGTG TATTTCCAAG 1440 

ACTACATGGC TAAGAAATTG GAAAAAGGAG CAGGAATCAC TGCCGTTGCA GCTATTGTTG 1500 

AACACAATAC ATCAGGAATC ATCTCTCGTA AATCTGATAA TGTAAGCAGT CCAAAAGACT 1560 

TGGTTGGTAA GAAATATGGG ACATGGAATG ACCCAACTGA ACTTGCTATG TTGAAAACCT 1620 

TGGTAGAATC TCAAGGTGGA GACTTTGAGA AGGTTGAAAA AGTACCAAAT AACGACTCAA 1680 

ACTCAATCAC ACCGATTGCC AATGGCGTCT TTGATACTGC TTGGATTTAC TACGGTTGGG 1740 

ATGGTATCCT TGCTAAATCT CAAGGTGTAG ATGCTAACTT CATGTACTTG AAAGACTATG 1800 

TCAAGGAGTT TGACTACTAT TCACCAGTTA TCATCGCAAA CAACGACTAT CTGAAAGATA 1860 

ACAAAGAAGA AGCTCGCAAA GTCATCCAAG CCATCAAAAA AGGCTACCAA TATGCCATGG 1920 

AACATCCAGA AGAAGCTGCA GATATTCTCA TCAAGAATGC ACCTGAACTC AAGGAAAAAC 1980 

GTGACTTTGT CATCGAATCT CAAAAATACT TGTCAAAAGA ATACGCAAGC GACAAGGAAA 2040 

AATGGGGTCA ATTTGACGCA GCTCGCTGGA ATGCTTTCTA CAAATGGGAT AAAGAAAATG 2100 

GTATCCTTAA AGAAGACTTG ACAGACAAAG GCTTCACCAA CGAATTTGTG AAATAATGAC 2160 

AGAAATTAGA CTAGAGCACG TCAGTTATGC CTATGGTCAG GAGAGGATTT TAGAGGATAT 2220 

CAACCTACAG GTGACTTCAG GCGAAGTGGT TTCCATCCTA GGCCCAAGTG GTGTTGGAAA 22 80 

GACCACCCTC TTTAATCTAA TCGCTGGGAT TTTAGAAGTT CAGTCAGGGA GAATTGTCCT 2340 

TGATGGTGAA GAAAATCCCA AGGGGCGCGT GAGTTATATG TTGCAAAAGG ATCTGCTCTT 2400 

GGAGCACAAG ACGGTGCTTG GAAATATCAT TCTGCCCCTC TTGATTCAAA AGGTGGATAA 24 60 

GGCAGAAGCT ATTTCCCGAG CGGATAAAAT TCTTGCGACC TTCCAGCTGA CAGCTGTAAG 2520 

AGACAAGTAT CCTCATGAAC TTAGCGGTGG GATGCGCCAG CGTGTAGCCT TACTCCGGAC 2580 

CTACCTTTTT GGGCACAAGC TCTTTCTCTT AGATGAGGCC TTTAGCGCCT TGGATGAGAT 2640 

GACAAAGATG GAACTCCACG CTTGGTATCT TGAGATTCAC AAGCAGTTGC AGCTAACAAC 2700 

CCTGATCATC ACGCATAGTA TTGAGGAGGC CCTCAATCTC AGCGACCGTA TCTATATCTT 2760 

GAAAAATCGC CCTGGGCAGA TTGTTTCAGA AATTAAACTA GATTGGTCTG AAGATGAGGA 2820 

CAAGGAAGTC CAAAAGATTG CCTACAAACG TCAAATTTTG GCGGAATTAG GCTTAGATAA 2880 
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GTAGAAAAAT AGGGAGTTGG TGAAGATTAT CCTTTACCAG CGCCCTTTTT CTTTTAAAAA 2940 

TGAGAAAATT TCGGTATAAT AGTCAAACAA GGTCAAGGTT TAAAGAGAGA GGTGGGTTTG 3000 

TTATGAGATT TAAAAATACA TCGGATCATA TTGAGGCCTA CATCAAGGCG ATTTTAGATC 3 060 

AATCTGGTAT CGTGGAGTTG CAACGGAGTC AGTTGGCAGA TACCTTTCAG GTTGTTCCTA 3120 

GTCAGATTAA CTACGTGATC AAGACACGCT TTACGGAAAG TAGAGGCTAC TTGGTTGAAA 3180 

GTAAGCGTGG TGGCGGAGGC TACATTCGTA TAGGACGGAT TGAGTTTTCT AGTCATCATG 3240 

AAATGCTCCG GGAGCTGCTT TACTCGATTG GTGAGCGAGT CAGTCAAGAA ATTTATGAGG 3300 

ATATTCTCCA GCTTTTGGTT GAGCAGGAAT TGATGACCAA GCAGGAGATG AATTTGCTAG 3360 

AATCAGTAGC TTTGGATCGC GTTTTAGGAG AAGAAGCTCC AGTTGTTCGA GCAAACATGC 3420 

TACGTCAGAT CATACAAGAG GTAGATAGAA AAGGGAAGTA AGATGAACTA TTCAAAAGCA 3480 

TTGAATGAAT GTATCGAAAG TGCCTACATG GTTGCTGGAC ATTTTGGAGC TCGTTATCTA 3 540 

GAGTCGTGGC ACTTGTTGAT TGCCATGTCT AATCACAGTT ATAGTGTAGC AGGGGCAACT 3 600 

TTAAATGATT ATCCGTATGA GATGGACCGT TTAGAAGAGG TGGCTTTGGA ACTGACTGAA 3 660 

ACGGACTATA GCCAGGATGA AACCTTTACG GAATTGCCGT TCTCCCGTCG TTTGCAGGTT 3720 

CTTTTTGATG AAGCAGAGTA TGTAGCGTCA GTGGTCCATG CTAAGGTACT AGGGACAGAG 3780 

CACGTCCTCT ATGCGATTTT GCATGATAGC AATGCCTTGG CGACTCGTAT CTTGGAGAGG 3840 

GCTGGTTTTT CTTATGAAGA CAAGAAAGAT CAGGTC.ViCA TTGCTGCTCT TCGTCGaaAt 3900 

TTAGAAGAAC GGGCAGGCTG GACTCGTGAA GATCTCAAGG CTTTACGCCA ACGCCATCGT 3 9 60 

ACAGTAGCTG ACAAGCAAAA TTCTATGGCC AATATGATGG GCATGCCGCA GACTCCTAGT 4020 

GGTGGTCTCG AGGATTATAC GCATGATTTG ACAGAGCAAG CGCGTTCTGG CAAGTTAGAA 4080 

CCAGTCATCG GTCGGGACAA GGAAATCTCA CGTATGATTC AAATCTTGAG CCGGAAGACT 4140 

AAGAACAACC CTGTCTTGGT TGGGGATGCT GGTGTCGGGA AAACAGCTCT GGCGCTTGGT 4200 

CTTGCCCAGC GTATTGCTAG TGGTGACGTG CCTGCGGAAA TGGCTAAGAT GCGCGTGTTA 4260 

GAACTTGATT TGATGAATGT CGTTGCAGGG ACACGCTTCC GTGGTGACTT TGAAGAACGC 4320 

ATGAATAATA TCATCAAGGA TATTGAAGAA GATGGCCAAG TCATCCTCTT TATCGATGAA 43 80 

CTCCACACCA TCATGGGTTC TGGTAGCGGG ATTGATTCGA CTCTGGATGC GGCCAATATC 4440 

TTGAAACCAG CCTTGGCGCG TGGAACTTTG AGAACGGTTG GTGCCACTAC TCAGGAAGAA 4500 

TATCAAAAAC ATATCGAAAA AGATGCGGCA CTTTCTCGTC GTTTCGCTAA AGTGACGATT 4560 

GAAGAACCAA GTGTGGCAGA TAGTATGACT ATTTTACAAG GTTTGAAGGC GACTTATGAG 4 620 

AAACATCACC GTGTACAAAT CACAGATGAA GCGGTTGAAA CAGCGGTTAA GATGGCTCAT 4680 
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CGTTATTTAA CCAGTCGTCA CTTGCCAGAC TCTGCTATCG ATCTCTTGGA TGAGGCGGCA 
GCAACAGTGC AAAATAAGGC AAAGCATGTA AAAGCAGACG ATTCAGATTT GAGTCCAGCT 
GACAAGGCCC TGATGGATGG CAAGTGGAAA CAGGCAGCCC AGCTAATCGC AAAAGAAGAG 
GAAGTACCTG TCTACAAAGA CTTGGTGACA GAGTCTGATA TTTTGACCAC CTTGAGTCGC 
TTGTCAGGAA TCCCAGTTCA AAAACTGACT CAAACGGATG CTAAGAAGTA TTTAAATCTT 
GAAGCAGAAC TCCATAAACG GGTTATCGGT CAAGATCAAG CTGTTTCAAG CATTAGCCGT 
GCCATTCGCC GCAACCAGTC AGGGATTCGC AGTCATAAGC GTCCGATTGG TTCCTTTATG 
TTCCTAGGGC CTACAGGTGT CGGGAAAACT GAATTAGCCA AGGCTCTGGC AGAAGTTCTT 
TTTGACGACG AATCAGCCCT TATCCGCTTT GATATGAGTG AGTATATGGA GAAATTTGCA 
GCTAGTCGTC TCAACGGAGC TCCTCCAGGC TATGTAGGAT ATGAAGAAGG TGGGGAGTTG 
ACAGAGAAGG TTCGCAATAA ACCCTATTCC GTTCTCCTCT TTGATGAGGT AGAGAAGGCC 
CACCCAGATA TCTTTAATGT TCTCTTGCAG GTTCTGGATG ACGGTGTCTT GACAGATAGC 
AAGGGACGCA AGGTCGATTT TTCAAATACC ATTATCATTA TGACATCGAA TCTAGGTGCG 
ACTGCCCTTC GTGATGATAA GACTGTTGGT TTTGGGGCTA AGGATATTCG TTTTGACCAG 
GAAAATATGG AAAAACGCAT GTTTGAAGAA CTGAAAAAAG CTTATAGACC GGAATTCATC 
AACCGTATTG ATGAGAAGGT GGTCTTCCAT AGCCTATCTA GTGATCATAT GCAGGAAGTG 
GTGAAGATTA TGGTCAAGCC TTTAGTGGCA AGTTTGACTG AAAAAGGCAT TGACTTGAAA 
TTACAAGCTT CAGCTCTGAA ATTGTTAGCA AATCAAGGAT ATGACCCAGA GATGGGAGCT 
CGCCCACTTC GCAGAACCCT GCAAACAGAA GTGGAGGACA AGTTGGCAGA ACTTCTTCTC 
AAGGGAGATT TAGTGGCAGG CAGCACACTT AAGATTGGTG TCAAAGCAGG CCAGTTAAAA 
TTTGATATTG CATAAAAGAA TAAAAGTATC AGCATCTGAC CATAAGTCAC AGTGGAGTGA 
AATTCAATGA AAATCAAAGA GCAAACTAGG CAGCTAGCCG CAGGTTGCTC AAAACACTGG 
TTTGAGGTTG CAGATAGAGC TGACGTGGTT TGAAGAGATT TTCGAAGAGT ATGAAACTAA 
AACCTATAGC TTCTAAACGA TCCGTGGTTT TCATCATTCA ACACAAAATT CATATGTTTA 
TTACCCTCCG TCGTATTTGT CTTAGAGCGT GTGTAGTAGA AAAAGAGCAG TCTTATCTGA 
AATTTTTATT CTTTCAAAAG AGACCTGTTT CTTTTTTGCA TGTCAAATCC GTTCTAGCTG 
GTATTTGAAA AATCAAACTA ATATTCAATG AAAATCAAAG AACAAACTAG GAAGCTAGCC 
GCAGGTTGCT CAAAACACTG TTTTGAGGTT GTAGATAGAG CTGACGTGGT TTGAAGAGAT 
TTTCGAAGAG TATAAGCTGC AAGATGAATG ATTTTCTTGT ATTGACGTTG TTGTTGACAA 
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AAAGTAGCGG ATAAATGAAA TCCATTCCAT TATCATAGAT GATAGGCTGG TAGGAAATTT 6480 

TCAAATAGCA TACAGGAAAT AGATGTATGG AGTTCTGGTA GTAGAAAGGG AGAGAGATGA 6540 

ACATTTTAGT TGCAGATGAC GAGGAAATGA TTAGAGAAGG AATTGCAGCA TTTCTGACAG 6600 

AAGAGGGTTA TCATGTCATT ATGGCTAAGG ATGGACAAGA GGTCTTGGAA AAATTTCAAG 6660 

ATCTCCCTAT CCATCTCATG GTACTGGATT TAATGATGCC TAGGAAGAGT GGTTTTGAAG 6720 

TGTTAAAAGA AATCAATCAA AAGCACGATA TTCCTGTCAT CGTCTTGAGT GCTCTGGGAG 6780 

ATGAAACTAC TCAGTCACAG GTATTTGATC TCTATGCTGA TGATCATGTG ACAAAACCTT 6840 

TTTCTTTGGT ACTGCTTGTC AAGCGTATTA AGGCGCTTAT CAGACGTTAC TACGTCATAG 6900 
AGGATCTTTG GCGATATCAG GATGTAACAG TGGATTTTAC CTCTTACAAA GCACATTATA * 6960 

AAAATGAAGA AATTGATCTC AAACCAAAGG AATTACTGGT ACTAAAGTGT TTGATTCAGC 702O 

ATAAAAATCA AGTTTTAAGT AGAGAGCAGA TATTGGAAGA AATTTCAAAA GATGTAGCTG 7080 

ATTTACCTTG TGATAGGGTC GTTGATGTCT ATATTCGTAC TCTTCGCAAA AAATTAGCTT 7140 

TAGATTGTAT CGTGACTGTG AAAAATGTTG GGTATAAGAT TAGCTTATGA TAAAAAATCC 7200 

TAAATTATTA ACCAAGTCTT TTTTAAGAAG TTTTGCAATT CTAGGTGGTG TTGGTCTAGT 7260 

CATTCATATA GCTATTTATT TGACCTTTCC TTTTTATTAT ATTCAACTGG AGGGGGAAAA 7320 

GTTTAATGAG AGCGCAAGAG TGTTTACGGA GTATTTAAAG ACTAAGACAT CTGATGAAAT 7380 

TCCAAGCTTA CTCCAGTCTT ATTCAAAGTC CTTGACCATA TCTGCTCACC TTAA.ViCAGA 7440 

TATTGTAGAT AAGCGGCTCC CTCTTGTGCA TGACTTGGAT ATTAAAGATG GAAAGCTATC 7500 

AAATTATATC GTGATGTTAG ATATGTCTGT TAGTACAGCA GATGGTAAAC AGGTAACCGT 7560 

GCAATTTGTT CACGGGGTGG ATGTCTACAA AGAAGCAAAG AATATTTTGC TTTTGTATCT 7620 

CCCATATACA TTTTTGGTTA CAATTGCTTT TTCCTTTGTT TTTTCTTATT TTTATACTAA 7680 

ACGCTTGCTC AATCCTCTTT TTTACATTTC AGAAGTGACT AGTAAAATGC AAGATTTGGA 7740 

TGACAATATT CGTTTTGATG AAAGTAGGAA AGATGAAGTT GGTGAAGTTG GAAAACAGAT 7800 

TAATGGTATG TATGAGCACT TGTTGAAGGT TATTTATGAG TTGGAAAGTC GTAATGAGCA 7860 

AATTGTAAAA TTGCAAAATC AAAAGGTTTC CTTTGTCCGC GGAGCATCAC ATGAGTTGAA 7920 

AACCCCTTTA GCCAGTCTTA GAATTATCCT AGAGAATATG CAGCATAATA TTGGAGATTA 7980 

CAAAGATCAT CCAAAATATA TTGCAAAGAG TATAAATAAG ATTGACCAGA TGAGCCACTT 8040 

ATTAGAAGAA GTACTGGAGT CTTCTAAATT CCAAGAGTGG ACAGAGTGTC GTGAGACCTT 8100 

GACTGTTAAG CCAGTTTTAG TAGATATTTT ATCACGTTAT CAAGAATTAG CTCATTCAAT 8160 

AGGTGTTACA ATTGAAAATC AATTGACAGA TGCTACCAGG GTCGTCATGA GTCTTAGGGC 8220 
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ATTGGATAAG GTTTTGACAA ACCTGATTAG TAATGCAATT AAATATTCAG ATAAAAATGG 8280 

GCGTGTAATC ATATCCGAGC AAGATGGCTA TCTCTCTATC AAAAATACAT GTGCGCCTCT 8340 

AAGTGACCAA GAACTAGAAC ATTTATTTGA TATATTCTAT CATTCTCAAA TCGTGACAGA 8400 

TAAGGATGAA AGTTCCGGTT TGGGTCTTTA CATTGTGAAT AATATTTTAG AAAGCTATCA 8460 

AATGGATTAT AGTTTTCTCC CTTATGAACA CGGTATGGAA TTTAAGATTA GCTTGTAGAC 8520 

AGATTAGTTT TTTATTAAAG TTCATATAGG GTTAACATAA GTGTGTTATT CTTTGTGTAG 8580 

ATAAAAGAAA GGATACTAAT ATGGTATTAG CGATTATTTT AGTAACATTC TTTATTCGAT 8640 

TGATTTTTTT AAAGCGTTCG ATAGAGAATG AGAAACGAAT CCTTAGCAAT GGCGGGG 8697 
(2) INFORMATION FOR SEQ ID NO: 124: 

(x) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4317 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 

AACCATACAT ACGGCAAGGC AAAGCTGACG CGGTTTGAAG AGATTTTCGA AGAGTATTAG 60 

TTGCCTTTAA AGGCATCCAC CATCGTTTGA AATTCTTCAT TTGAGAGAGT AATCCCTTTG 120 

CCCAT'i-rrAG TATGGTCTGG ACTCCAAGCA CGAATATCAA ACTTTGCAGG GGCACCATTA 180 

AAGCTCACAC GGTTAATTTC CTTGGTCCAA CCTTTTTCGT TTTCAGAAAG AGTCAACAAG 240 

TGCTCTTCGA TTTCAAATGT AAATTCTGCC ATTTTCTTCT CCTTTTTTAG TTTCATTAGT 300 

TTATTCGTAA AATCTTGTAG ATTTTAGGAA AATTTTATAT AATATTGATA TAAAAGAAGG 360 

GAGGCCAATA TGAGACATAA ATTCCAGCAA GTTCTAAATA AAATACATGA TTTTTTAAAT 420 

GGATATGACC AACCTGACCA GACTGAAACC AACTCCCTTA CAGCCACTAT TGAAGAGGCT 480 

ATCCAGAAAC AAACCGCTGT TCACCTTATC TTGTCTGAGA CAAGCTTTAC AGGTGACATC 540 

ATCAAATATG ATCAGCAAGG CCAGCAAATT ATCGTGAAAA ATTTTTCCAA AAATGTGAGC 600 

CGGATTATCC GTATAAGCGA TATTCAACGC CTGCGATTTG TCCCCTCAAC TGTCCAAACA 660 

GCCCAAAAAA ATAGATTTAA GAAAGAGTGA GATGTAGTTG CTTCATCCCA CTCTTTTTTC 720 

TTAGCGAATT TGTTCAAAAT GTAAATGAAC TGCGATATGA TCTCCATAAC CACTTCTTTC 780 

CAAGTCACGT TGTAAACGAT AGGAAATGTA GTGTTCTGCA ATGGTAATGT AACCTGCGCC 840 

CAATAAACGA TGTTCAACCA TAGATTGAAT CATACTGATA GTCGCACGTT CCACCTTGGC 900 
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TTCTTGTAAA TCCAAAACTA CCTTCTTAGT GACTTGAGCA AGATTTTGAC GCAAATCATC 960 

TGTCAAAACA TAAACAGTTT GGGCTGCCTT CAAGATGGCT TGGTAAATCT TATCTGGATT 1020 

AAATTCAGCA ATTTCGCCAT TACGTTTGAT TACTTGCATA GGTTTCTCCT TTATTCTTTG 1080 

TTTTCTTTGA TTTCTGCCAG CATTTTTTCT TCTTCTACTG TCAGTTGATA ATGTTCAAGT 1140 

AAATCCGGTC TGCGCTCGTA GGTTTTCTTT AAACTCTCGT ACAATCGCCA CTGACGAATC 1200 

TTTTCATGGT GGCCACTCAT CAATACATCT GGCACGACCA TGCCTCGATA ATCATAGGGA 12 60 

CGTGTGTACT GAGGATATTC TAAAAGACCT GAAGAAAAAC TATCATCTTG GTGGCTAGAC 1320 

TCCTTGCCAA TCACTTCTGG AATCAGGCGA ACTGTAGCAT CAATCATGGT CATAGCTGCC 1380 

AATTCTCCAC CAGTGAGGAC ATAGTCACCT AGGGAAATCT CATCTGTTAC CAAGGTCTTA 1440 

ATGCGCTCAT CATAACCCTC ATAGTGCCCA CAGATAAAGA TTAGCTCTTC CTCTTGAGCC 1500 

AAATCTTCAG CATAAGCCTG ATCAAACTGC TTTCCAGCAG GATCAAGGAG AATAACGCGC 1560 

GGATTTTTCT TTTCAATAGC ATCAAAGGAA TCGAAAATAG GTTGTGCTCT GAGCAACATG 1620 

CCCTGACCGC CTCCGTAGGG CTCATCATCT ACATGACGGG CCTTTTCAGC ATTTTCTCGA 1680 

AAATTATGAT ACTGGATATC CAAGAGCCCT TTTTCTCGAG CCTTTCCAAC GATTGAGTGC 1740 

TCCAGTGGAG AAAACATCTC TGGAAAGAGG GTTAAAATAT CAATCTTCAT CGTCTAACCC 1800 

TTCTAAGATT TCCACATCGA CCCGTTTACT TGGAATATCA ACATTGAGAA CCACTGGTGG I860 

GATATAAGGT AAAAGCAAAT CACGTTTGCC TTTTCRTTTG ACCACCCAGA CATCAT7AGC 1920 

ACCTGGTTGC AGGATTTCCT TGATGGTTCC AACCAAGCTA TCACCCTCAT AGACTTCCAA 1980 

ACCGATAATC TCGTGATAGT AAAATTCACC ATCGTCTAGG TCATTCAAAT CTTCCTCAGC 2040 

GACCTTGAGA CTGTATCCCT TGTACTTTTC GATAGTATTG ATATGGTACA TATCTTTGAA 2100 

TTTAATAATG TCAAAGTTCT TCTGTTTACG GTGGCTAGCG ATGGTCACTG TTTGGACAAA 2160 

CTGATCTTTT TCATCAAACA AAACCAGCTC AGCTCCTTTT TTAAACCGTT CTTCTGCAAA 2220 

ATCCGTCACA GACAAGACTC GCATCTCCCC CTGTAATCCC TGCGTATTAA CGATTTTCCC 2280 

AACATTAAAG TAGTTCATCT TGTCTCCTGT AATCTCCTTT TTTCCATCTT ATTCTAACAA 2340 

TTCTCGAATA ATAGCCGCAA TTTTTTCCGA TTCTGACCAT TGTAAATAAT GGTGATTCCC 2400 

TCCTAAAATG AGTTTAGTAT TGGAAGTCCA ATATTCTGAT TCTCTGTACT CTTTTTCTCT 2460 

ATAAGGCTGA CAAAAAACAA ATACAGGAAT ATGAGCTTCT ATAGATACAT CCTCAAAATC 2520 

TTCCTCAGTA ATCTCTCCAG ATATCTGAAA TTCTGGATCT TGATTTTCCA ACTCTAAGCC 2580 

TTTTTCTTGC ATTAATTCCC AGATTTTTTT ATTCGTTTCA GGACTAAATG TTGCTTGAGT 2 640 

TAAGTTCTTA AAATAAAGTT CAGGACCACA CTCGTCAATC AGCCTCATCT GCTCTTCCAT 2700 
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TTCTGGATAA GGATTTTCTG AAAAATCAGC AAACATGACT TTTTTAGTTG TCGGTTCAAT 2760 

TGCTACTAAA GTCTGACGCT TAATTGGTTT CTCGAGTAAT TTGCAAGCTA AAATTCCACT 2820 

CCAACTATGT GCACAAAGTA TATATTCAGA AATTCCTAAT TCTTCAAGTA CTTCATAAAC 2880 

CGCATCTGCA AGATTATCTA GATTTTTTCC AGCTTGGTCA TGAATCGGAC TCCTACCTGT 2940 

GTTCGGAAAA TCAATTGTCA AATAACCAAT TGTAGGAGGA GGTTTTTCAA GTATAAGTGA 3000 

AAAATTTTCA TAACTTGGTA GCAAACCTGC TCCGTTTAAA CAAACTAGCA CTTTCTTTTG 3060 

CTTTTGATAA GTAACAGAGA GGCTACCAAT TTCTGTAGAT ACTTCAAACC TCTTCATAAA 3120 

GAAATCCACT GATTCTATAT AATGAATTAT TAAAAATCCT TATCCTTTAT TTTATCACGT 3180 

TCCAAGGATT TTCTCAAGTT GGAGGAAGGG GACAATATCT CTACTTTCCC TTCAATAATC 3240 

CTTCCAAATT ATGTTTATGT TGGTAATTAA TGGCTGCGGT TTTGTCTTTC TCAAAGACAG 3300 

TCTTGGTAAG GTCAATATGA TTAATAGCTA CGATTGCGAC GGTGTAGTAA ATGATATCAG 3360 

CCAGTTCTCT GGCAAGTTCC TCGTTCGAAT CCTATCCCTT CTTTTCGACC AGAGCGCCTA 3420 

TTCAAAACCT CGACTACTTC TCCGACTTCC TCCACTAACT TCATAAAGAG ACCTTCATCA 3480 

GTCCGAGACT GCTGTTAATG TTCGATTAAG TAGTCTTGGA ATTGCCTAAA CGTTCAATCT 3540 

TTTATAGTAT ATTGAAACTA GAATAGTACA CCTTTACTTC TAAAACATTG TTAGAAATCG 3600 

ATTTGACTGT CCTGATCGAT TTGTCCTGTT CTTGTTTCAT TTTACTATAT CTTCTATTCC 3660 

ACACAAAAAA GCGAGACATC CGTCCCGCCC TTCTTATTTT TCGTCAATAA CGATTCTTAC 3720 

TTTTTTGTAT TCAGTTGGGA CAGAGTAGAC AATCGTTCTT ATCGCAGAAA TAGTGCGACC 3780 

CTTACGACCG ATTACACGAC CCACATCGCT TTGATCAAGA TTCAAATGAT ATTCCAAAAA 3840 

TTCTGGTGTA TCCTCAATCT TGATAGTTAA GGCATCTGGT TGTGAAATTA AGGGTTTCAC 3 900 

AATCGCAATA ATGAGATTTT CAATCGTATC CATCTGTCAA CCTACTTTAA ACTTATTTTG 3960 

AAAATTTAGA ATCGTGGAAT TTTTTCAATA CGCCTTCTTT TGAAAGGATG TTACGTACTG 4020 

TGTCTGAAGG TTGAGCTCCA TTAGCCAACC ATGCAAGAAC GCGGTCTTCT TTCAAAGTTA 4080 

CTTGGTTTTC AGCAACAAGT GGGTTGTAAG TTCCAACTGT TTCGATGAAA CGTCCGTCAC 414 0 

GTGGTGAACG TGAATCTGCT ACGTTGATAC GGTAGAAAGG TTTTTTCTTA GAACCCATAC 4200 

GAGTCAAACG GATTTTAACT GCCATTTTTA AAGTCTCATT TCTTTAATTT TTTATTTCGG 4260 

TGAAATAGCT GAGCTATTTA GCACATGTTC TATTATAGCA GATTTCTGGC ATGTGTC 4317 

(2) INFORMATION FOR SBQ ID NO: 125: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 4 881 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEOUENCE DESCRIPTION: SEQ ID NO: 125: 

AATTTATTTG ACTGGAAATT GTAGAGGGTT CTCGAAATTT CTTGAATGGT TAAAATAAGG 60 

ACAAGAGAAA ACATGGATAT CTATATCCTT GTGCCAAAAA AACCACTGCC CTCCCCAGAC 120 

CAACCTGAGG AAAGCAGTGA TTCTTATTTT AGGAGTTAGG AATGAATACA CGAAATCAAT 180 

TTAGCTGATT ATTTTTTGTT TTTCAAGAAT TCATCGTATT GTTTTTGCAT TTCGTTCAAT 240 

ACTTTTTCGT AGGCACCTTC AGATTTCAAT TTTTCCATCA ATTCTGGAAT CGCTTTATCT 300 

GGGTCTACAG TACCAGTGTT GATAGCTGTA TCAAATTGTT GCATTGTGTT AGCAATAGCT 360 

GAGATTTCAG ATTTCACATT GTCAGTATTG AAGATAAATC CAAGCGCTGG AGATTCTTTA 420 

GCTTCTGCCA ATTCTTTCTT AGAATTTTCG ATTTGTTGGT CTGTAACGTT TTCGTTGATG 480 

TAAAGGATCC AGTTGTTACC AGTGTTCCAT CCACCCATGT GAGTGTTTCC TTTGTAGCCA 540 

TCAAGAACGC GAACACGGTT TTCTTTACCT TCAATTTTTT CCCAGTTCTT GCCTTCTGGA 600 

CCGTAAACAA GACCGTTCAA GAGTTCTGGG TTCGTATTCA AGAGGTTCAA GATTTCCATT 660 

GATTTTTCTT TGTTCTTAGA GTTGTTTGAG ATGACAAAGT TAGCAACTTG TGTTGTTTGG 720 

TTTTTCTTGA TGAAGTTAGT AATTGGTTTG ATTTCGATAT CTTTGTTGGC AAUACGTGAA 780 

AGCAAGCTGT TACCGTAGTC AGCTGGTCCT ACTGTTTCTT CACGAACGAA CCAAGTATCT 840 

TGTTGAAGGT CAAAGGAAGT ATCGCTTGTT GCGACGTCTT TTGGAATGTA GCCAGCTTCA 900 

TAGAATTTGT GAAGAGTCTT CAAGTGTTCT TTGAAACGAG GCACTTCGTA ACGGTTTACA 9 60 

ACTTTAGTAG TATCGCCTTC AAGGTCGATA ACGAATGGAA GACCGTTTGC TACTGGGTAG 1020 

TCAAAATTAT CAGATGGGAT GAAAACTTTA CCAATAGCAA ATGGTACTAC GTCTGGAGCT 1080 

TTTTCTTTGA TTTGTTTCAA GACTGGCTCA AGAGTTTCGT AAGAAGTAAC ACCTGAAATA 1140 

TCGATACCAT ATTTAGCAAG GAGAGTTCCG TTGAAGGCAA AGTTTTGAGA TGATGCAACG 1200 

TTGGCTGCAA CTGGAACAGC GTAAATCTTA CCATTTACAG TATTACCCTT GATGTAAGCT 1260 

GGGTCAAGTG CTTTGTAAAG GTCTTTACCT TCTTTTTTGT ACAATTCTGT CAAGTCAGCG 1320 

TAAGCACCTT TTTGAGCATT TACAATATAG TTATCTGCAA AGGCAATATC ATAGTTTTCA 1380 

CCAGATGATG TGATAACTGA CATTTTCTTA CCATAGTCAC CCCAGCCAAG GTATTGGATA 1440 

TCCAATTTGG CACCAACTTT TTCTTCAATG ATTTTGTTGG CATTTGCTAA CAATTCATCC 1500 

AAGTTGTCTG GTTTGTCACC GATTTGGTAC ATTTTGATAA CAGGTTTGTC ACCTGAATCA 1560 
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GCAGCTTTTT 
ACTACACTAG 
TTTATTTTTA 
ACTAGAAAAC 
GAAGTCAGTT 
AAGAGTATTA 
ACTATTCTTT 
ACAAAATCGC 
GTAGAGCAAC 
GTTGGATTTG 
GGGCGTTGAA 
CTGGTAGTGA 
TACGAGCCGA 
TGATGTTAAA 
GAAGTACACG 
GAAGGACGAA 
CATAGGTTGT 
TGAAGAGGGC 
CAAATTGGGA 
AAATAATGAT 
TACTGAAGAA 
CTTTTTTAAT 
TCGACTCGTC 
TAAAGACCGG 
TATACGTCCA 
AAGAGACCGA 
AGCGGAGTCA 
ACTGTCGCTG 
CTGCTATATC 



TGCTGTTACC 
CAGATGCAAA 
AACTTATAAA 
TAGCCGCAGG 
ACATATATCT 
ACTTCACACA 
CACACCACCG 
GATTGGAAGG 
TCCCAGTTGA 
CATGAGCAAA 
CCAGTCATTC 
CAATGGCAAA 
TTCTAGAATG 
TGGTGAGAGA 
GGTCACCATG 
GATGGTAAAG 
TGTGATAAAG 
TTGTAGGATT 
TGGGAAGAAG 
AACGAATACA 
GATATCTGCT 
TTTCTTTTTT 
TTGCAAGCAA 
CTGCTGCAGC 
AAACGTTGGT 
AGTCTGCGCG 
ACTGAGGAAT 
CTTCGTAGTA 
CAAAGCCTTT 



TGTCAAATTT 
AGCATATTTT 
CAATGTAATG 
CTGCTCAAAG 
ACGGCAAGGC 
AGGGAAGTTG 
ATAGTCAAAC 
GTTGCAACCA 
CCAATCAAGC 
TATTGCAATG 
CAGAAACCAA 
CAGATTTGGA 
GCTTCTGGAA 
AGCATTGGAA 
ATATAACCTG 
AATCTGCGAT 
ACATTTGTCA 
TTATCCTTAA 
CTATAGCCGT 
AAAGGTAGGA 
TTCTTACTGA 
GCCATATTCT 
GTTTGATAGG 
CATACCGATA 
TACATTGTAA 
GAAGATATTT 
GGTTACGTTG 
GGTTGGATCA 
CCAAATACCT 



867 
CCACAAGCAG 
TTCCAGTTTT 
ATCTTATACT 
CACTGCTTTG 
GACGTTGACG 
GGAACTGAGA 
CTTTTACAAA 
CAACCATGGC 
CGACCGCTTT 
GATACAAGTT 
GAGCTGTTAA 
AGAAAATCCG 
TGGTCTTCTT 
CAATCAAGGC 
GTACCAAACC 
ACTTAAAGGT 
ATGTCCCAAC 
ACTGTGCCAA 
ATTGGAGGAG 
TACAAGAGAG 
AGGAGTGAAT 
CCTCCTTTCT 
ATAACCAGAA 
TCTGCTGTCT 
AGCTGACCAG 
CCGACTGCAA 
CGAATACGTT 
ATTCCCATGA 
AGGAAAAGTA 



CAAGACCTGC 
TCATGATAAA 
CAATAAAAAT 
AGGTTGTAGA 
CGGTTTGAAT 
AATGTTATTT 
GTAGCGTTGG 
CATACGACCT 
GGCAATGTAG 
GTCACTCTTG 
GAGCGTGATG 
GGCCTCACTG 
GAAGAAGGAA 
CCAAACAGTG 
AGCGTTGAAC 
TGTCCGTGAA 
TACGGTTACA 
AAACTCAAAA 
GCTTTTCTCG 
GGCAATCAAA 
GCCGACATTA 
AGAACAAAGC 
TCAAACCAAC 
GAGTCAAACC 
CATTGTGTGG 
GGATGGTCAA 
GCCACTTGCT 
TCGTCGCATA 
GGAGATAGGG 



AGCCAGAGCG 
AACTCCTTTT 
CAAAGAGCAA 
TAAGACTGAC 
TTGATTTTCG 
CTCAATAAGC 
AAAAATGGAT 
GTTTCTTTCG 
TCCATATTTT 
ATGTAAAGAA 
GTTGCGATAC 
GCACCATCGA 
CGCATCAAGA 
TCACCAAGCT 
AACATACTGA 
ATAGCGTAGG 
AAGACAGAGA 
CCGTCTAAGC 
TCTGTCACTG 
CCCGAAATGA 
TCAATTTTTT 
TGAGTTTGGA 
AACGGATTGG 
ATTAAAGACA 
GATTTGATAG 
TACAGTTACA 
AGCTCCGTCC 
GTACATGACA 
CCAGATGCCC 



1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 
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AGGTCAGCGT AGAAATTGAC TTCTTTGAGA CCAAGACTTT CCAATAGATG ATTGAACACC 33 60 

CCTTTATCAA TATTTAGGAA GGCATCTGTA AAGAAACTGA TGATAACCCA AGACAAGAAG 3420 

TAAGGGAACA ACATAGAAGT TTGAAAAATC TTCACCATTC TCTTAGAACG GAGCTCGCTG 3480 

AGGATAATGG CAATCCCTAC AGATACAACT AAACCTAGAA AGATAAAGCC AAGATTGTAG 3540 

AGGACAGTAT TTCGTGTGAT AATAAAGGCG TCTCTTGAAC TAAATAAGAA TCTAAAATTA 3600 

TCGAGTCCGA CCCATTTACT ATTTATGATA CTATCTATGA AACCATTACT GGTCATGTGG 3660 

TAGTCTTTGA AGGCAACCAC GTTCCCAAAT ACTGGAATGT AAAAGAATAG AATCAACCAG 3720 

AGTGCCCCTG GCAAAACCAT CAAGAGAAAG ATCCAGTTGT CTCTCAATGT TTTTGAAAAC 3780 

TTTTTCATAA TTTCCTCCCT TTTTATTTTG ATATCCATCT AAAAATTCTT TTTTAGACTT 3840 

TTGATAACGA TTACATTATT AGTATACTCC TATTTGCAGG TTAGGTTAAA CTCCTAATTA 3900 

TAGAAAAAAC TCCACAAATT ATGTAGCAGA TTTAAAACTT TATCACCACT ATCAAACAAA 3960 

TGTCCTAAAT CAATTGTTTA TTTTATCTCT ATTAGCCCAG TGATGGCGTC ACTCTGTTAT 4020 

AAGCATCCAA CAACGGGGTA TACTGAAAAA TCTCCAGACT AGGGAACTCA GCGATAGTTC 4080 

CTAATCTGGA GATTTTTAAT ATGTTATTAG GCGTTTGCTT TCAACTTAGC AATAACCTCT 4140 

TTAAGATTAT CAATCAACTC TGCTGCAGTA TGCTCAGAGC CTTTTTCATC TGCCAAGAAC 4200 

AAAACTGCTT TTTGAAGTTC TTTTTGAGAG TTTTCAAGGA CATCCTTATC TACTGTTTCA 4260 

AGGTTTGAGT CTTTAAGAAG TTTACTTAAT TCCTTCGCTA ATTTCTTGAG TTTGaTttGC 4320 

AGACTCATCT TCTCCTGCTG TTTCTTTGCC CGCTGTTTGT CCTCCATCCT TAGTTGCTGA 43 80 

CTGGCTTTCC TTAATGGACT CTAGGGAAGC AATGGCATCT TTGACTGTTT GCAAGATATC 4440 

ACGTAAACCT TGCTCTGTCA AACTATCATC TGCAAAAGCT TTATTAGCCT CTGCCAAAAC 4500 

CAGACGTGCT GAATCTGTGG TAGGATTCGA TACACCTGTC AATGATCTCA AAAGATTTTC 4 560 

TAAGGTTTGA GTCTGCTTAC TAATACTAGA CTAAAATCAA AAAGTATTAT ATAACAGTGA 4 620 

TATGAAATCA ACTAAAGAAG AAATCCAAAC CATCAAAACA CTTTTAAAAG ACTCTCGTAC 4 680 

AGCTAAATAT CATAAACGCC TTCAAATCGT TCTATTTTGT CTGATGGGCA AATCTTATAA 4740 

AGAGATTATA GAACTTTTAT AGTAGTTTGA AATAAGATGT GAACATCTCT ATCAGGAAAG 4 800 

TCAAATTAAT TTATAGAAAT ATTTTAGCAG CCAAGGTGTA CTGTTATAGA TTCAATACAC 4860 

TATACTTGGT GGTTTAGCTC G 4333^ 
(2) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13121 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 6: 

AGGATCCCCG GAAAAGGAGA CTAAAAATGA AGAAAAAATT TCTAGCATTT TTGCTAATTT 60 

TATTCCCAAT TTTCTCATTA GGTATTGCCA AAGCAGAAAC GATTAAGATT GTTTCTGATA 120 

CCGCCTATGC ACCTTTTGAG TTTAAAGATT CAGATCAAAC TTATAAAGGA ATTGATGTTG 180 

ACATTATTAA CAAAGTCGCT GAGATTAAAG GCTGGAACAT TCAGATGTCC TATCCTGGAT 240 

TTGACGCAGC AGTCAATGCG GTTCAAGCTG GGCAAGCCGA CGCTATCATG GCAGGGATGA 300 

CAAAGACTAA AGAACGTGAA AAAGTCTTCA CCATGTCTGA TACTTACTAT GATACAAAAG 360 

TTGTCATTGC TACTACAAAG TCACACAAAA TTAGCAAGTA CGACCAATTA ACTGGCAAAA 420 

CCGTTGGTGT TAAAAACGGA ACTGCCGCTC AACGTTTCCT TGAAACAATC AAAGATAAAT 480 

ACGGCTTTAC TATTAAAACA TTTGACACTG GTGATTTAAT GAACAACAGC TTGAGTGCTG 540 

GTGCCATCGA TGCCATGATG GATGACAAAC CTGTTATCGA ATATGCCATT AACCAAGGTC 600 

AAGACCTCCA TATTGAAATG GATGGTGAAG CTGTAGGAAG TTTTGCTTTC GGTGTGAAAA 660 

AAGGAAGTAA ATACGAGCAC CTGGTTACTG AATTTAACCA AGCCTTGTCT GAAATGAAAA 720 

AAGATGGTAG TCTTGATAAA ATTATCAAGA AATGGACTGC TTCATCATCT TCAGCAGTGC 780 

CAACTACAAC TACTCTCGCA GGATTAAAAG CTATTCCTGT TAAGGCTAAA TATATCATTG 840 

CCAGCGATTC TTCTTTTGCC CCTTTTGTTT TCCAAAATTC AAGCAACCAA TACACTGGTA 900 

TTGATATGGA ATTGATTAAG GCAATCGCTA AAGACCAAGG TTTTGAAATT GAAATCACCA 950 

ACCCTGGTTT TGATGCTGCT ATCAGTGCTG TCCAAGCTGG TCAAGCCGAT GGTATCATCG 1020 

CTGGTATGTC TGTCACAGAT GCTCGTAAGG CAACTTTTGA CTTCTCAGAA TCATACTACA 1080 

CTGCTAATAC CATTCTTGGT GTCAAAGAAT CAAGCAATAT TGCTTCTTAT GAAGATCTAA 114 0 

AAGGAAAGAC AGTCGGTGTT AAAAACGGAA CTGCTTCTCA AACCTTCCTA ACAGAAAATC 1200 

AAAGCAAATA CGGCTACAAA ATCAAAACCT TTGCTGATGG TTCTTCAATG TATGACAGTT 12 60 

TAAACACTGG TGCCATTGAT GCCGTTATGG ATGATGAACC TGTTCTCAAA TATTCTATCA 1320 

GCCAAGGTCA AAAATTGAAA ACTCCAATCT CTGGAACTCC AATCGGTGAA ACAGCCTTTG 1380 

CCGTTAAAAA AGGAGCAAAT CCAGAACTGA TTGAAATGTT CAACAACGGA CTTGCAAACC 1440 

TTAAAGCAAA CGGTGAATTC CAAAAGATTC TTGACAAATA CCTAGCTAGC GAATCTTCAA 1500 

CTGCTTCAAC AAGTACTGTT GACGAAACAA CGCTCTGGGG CTTGCTTCAA AACAACTACA 1560 
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AACAACTCCT 


' TAGCGGTCTT 


GGTATCACTC 


TTGCTCTAGC 


TCTTATCTCA 


TTTGCTATTG 


1620 


CCATTGTCAT 


CGGAATTATC 


TTCGGTATGT 


TTAGCGTTAG 


CCCATACAAA 


TCTCTTCGCG 


1680 


TCATCTCTGA 


GATTTTCGTT 


GACGTTATTC 


GTGGTATTCC 


ATTGATGATT 


CTTGCAGCCT 


1740 


TCATCTTCTG 


GGGAATTCCA 


AACTTCATCG 


AGTCTATCAC 


AGGCCAACAA 


AGCCCAATTA 


1800 


ACGACTTTGT 


AGCTGGAACC 


ATTGCCCTCT 


CACTCAATGC 


GGCTGCTTAT 


ATCGCTGAAA 


1860 


TCGTTCGTGG 


TGGTATTCAG 


GCCGTTCCAG 


TTGGCCAAAT 


GGAAGCCAGC 


CGAAGCTTGG 


1920 


GTATCTCTTA 


TGGAAAAACC 


ATGCGTAAGA 


TTATCTTGCC 


ACAAGCAACT 


AAATTGATGT 


1980 


TGCCAAACTT 


TGTCAACCAA 


TTCGTTATCG 


CTCTTAAAGA 


TACAACTATC 


GTATCTGCTA 


2040 


TCGGTTTGGT 


TGAACTCTTC 


CAAACTGGTA 


AGATTATCAT 


TGCTCGTAAC 


TACCAAAGTT 


2100 


TCAAGATGTA 


TGCAATCCTT 


GCTATCTTCT 


ATCTTGTAAT 


TATCACACTT 


TTGACTAGAC 


2160 


TAGCGAAACG 


CTTAGAAAAG 


AGGATTCGTT 


AATGGCAAAA 


TTAAAAATTG 


ATGTAAATGA 


2220 


TTTACACAAG 


CACTATGGAA 


AAAATGAAGT 


CCTAAAAGGA 


ATTACGACTA 


AGTTCTATGA 


2280 


AGGAGATGTT 


GTTTGTATCA 


TCGGTCCTTC 


AGGTTCTGGT 


AAGTCAACTT 


TCCTCCGTAG 


2340 


CCTCAATCTT 


TTAGAAGAAG 


TCACTAGCGG 


TCACATCACT 


GTGAACGGCT 


ATGATTTAAC 


2400 


TGAAAAAACA 


ACCAATGTTG 


ACCACGTCCG 


TGAAAATATC 


GGCATGGTAT 


TCCAACACTT 


2460 


CAACCTCTTC 


CCTCATATGT 


CTGTATTGGA 


CAACATCACC 


TTTGCTCCTA 


TTGAGCACAA 


2520 


GTTGATf^AO'T' 


AAGGAAGAJIG 


r"nr: » nr- 7^ ?l 


GGGnAT G GAG 


TTGC TTGAAA 


aCjGttggact 


2580 


AGCAGATAAA 


GCTAATGCCA 


ATCCAGATAG 


CCTATCAGGT 


GGTCAAAAAC 


AACGTGTGGC 


2640 


CATCGCTCGT 


GGCCTAGCAA 


TGAATCCAGA 


CATCATGCTC 


TTCGATGAAC 


CAACTTCTGC 


2700 


CCTTGACCCT 


GAGATGGTTG 


GAGACGTACT 


TAACGTTATG 


AAGGAATTGG 


CTGAGCAAGG 


2750 


CATGACCATG 


ATTATCGTAA 


CCCATGAGAT 


GGGATTTGCT 


CGTCAGGTTG 


CCAACCGCGT 


2820 


TATCTTTACT 


GCAGATGGCG 


AGTTCCTTGA 


AGACGGAACA 


CCTGACCAAA 


TCTTTGATAA 


2880 


CCCACAACAC 


CCTCGTCTGA 


AAGAGTTCTT 


AGATAAGGTC 


TTAAACGTCT 


AAACTCAAAC 


2940 


TGTAAGGATT 


TCCTTGCAGT 


TTTTCTACCT 


CGTATTGGAA 


TTTTTGATTT 


TTCGGAAAAT 


3000 


TATGTTAGAA 


TTAAGTTTAT 


GAAATGAGGT 


TTCCTCATAC 


CTAGCAAGAC 


TAGGAATAAA 


3060 


AATAGAAATT 


AGGTAGCTAG 


ATGTCATCTA 


AGGTTATTGT 


TACAATTTTC 


GGTGCGAGTG 


3120 


GAGACCTGGC 


TAAACGCAAG 


CTCTACCCTT 


CCCTTTTTAG 


ACTATATCAA 


TCCGGCAATC 


3180 


TTTCCAAGCA 


CTTTGCCGTT 


ATTGGAACTG 


CCCGTAGACC 


TTGGAGTAAG 


GAATATTTTG 


3240 


AATCTGTAGT 


TGTCGAGTCC 


ATCCTTGATT 


TGGCAGATAG 


TACCGAGCAA 


GCCCAAGAAT 


3300 


TTGCTAGCCA 


CTTCTACTAT 


CAAAGCCATG 


ATGTCAATGA 


TTCGGAACAT 


TATATTGCTT 


3360 
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TGCGTCAATT ACAAGCTGAG CTTAATGAAA AATACCAAGC TGAACACAAT AAGCTCTTCT 3420 

TCTTGTCTAT GGCACCTCAG TTCTTTGGAA CCATTGCCAA ACACCTCAAA TCTGAAAACA 3480 

TTGTCGATGG CAAAGGTTTT GAGCGCTTGA TCGTTGAAAA ACCATTTGGT ACAGATTACG 3540 

CAACTGCAAG CAAGTTGAAT GACGAACTCC TAGCAACATT TGACGAAGAA CAAATTTTCC 3600 

GTATCGACCA TTATCTTGGT AAGGAAATGA TCCAAAGCAT CTTTGCAGTT CGCTTTGCAA 3 660 

ACTTGATTTT TGAAAACGTT TGGAACAAGG ATTTTATCGA CAATGTTCAA ATTACCTTTG 3720 

CGGAGCGCTT GGGTGTAGAA GAACGTGGTG GCTACTATGA CCAATCCGGT GCCCTCCGTG 3780 

ACATGGTCCA AAACCACACT CTACAACTTC TTTCGCTCCT CGCCATGGAC AAACCAGCAA 384 0 

GCTTCACAAA AGACGAGATT CGTGCTGAAA AGATTAAGGT CTTTAAAAAC CTCTATCATC 3900 

CAACTGATGA AGAACTCAAA GAACACTTTA TCCGTGGGCA ATACCGCTCT GGTAAGATTG 3960 

ATGGCATGAA ATACATCTCT TATCGTAGCG AGCCAAATGT GAATCCAGAA TCAACAACTG 4020 

AAACCTTTAC ATCTGGTGCC TTCTTTGTAG ACAGCGATCG ATTCCGTGGT GTTCCTTTCT 4080 

TTTTCCGTAC AGGTAAACGA CTGACTGAAA AAGGAACTCA TGTCAACATC GTCTTTAAAC 4140 

AAATGGATTC TATCTTTGGA GAACCACTTG CTCCAAATAT. TTTGACCATC TATATTCAAC 4200 

CAACAGAAGG CTTCTCTCTT AGCCTAAATG GGAAGCAAGT AGGAGAAGAA TTTAACTTGG 4260 

CTCCTAACTC ACTTGATTAC CGTACAGATG CGACTGCAAC TGGTGCTTCT CCAGAACCAT 4320 

ACGAAAAATT GATTTATGAT GTCCTAAATA ACAACTCAAC TAACTTTAGC CACTGGGATG 4380 

AAGTTTGTGC GTCATGGAAG TTGATTGACC GTATTGAAAA GCTCTGGGCT GAAAATGGTG 4440 

CCCCACTTCA TGACTATAAA GCTGGAAGCA TGGGACCTCA AGCCAGCTTT GACCTACTTG 4500 

AAAAATTCGG TGCCAAATGG ACTTGGCAAC CAGATATCAC CTATCGTCAA GATGGTCGCT 4560 

TAGAATAAAA AAATTTCCTG CAAGTTTATG CcTTGCAGGA TTTTTGCTTC TGATTAGATT 4620 

AAACCTTCCA AGAGACCTTT CATAAAGTTT TCTGAGTTAA ACTCTCCAAT ATCATCGATT 4680 

TTTTCACCAA AACCAATCAA TTTTACAGGA ATATTGAGTT CTTCACGAAT GGCTAGAACC 4740 

ACACCTCCTC GAGCAGTTCC ATCAATCTTA GTCAAAACAA TTCCCGTTAA AGGTGTGATT 4800 

TTCGAAAATT CTTTGGCCTG TACTAGGGCA TTTTGACCTG TTGATGCATC AAGTGCCAAG 4860 

AAGGTTTCAT GTGGTGCTTC TGGCACAACA CGTTTGATAA TACGACCAAT CTTTTCCAAC 4920 

TCAGCCATAA GGTTATCCTT ATTTTGCAGA CGACCAGCAG TATCAATCAT GAGAATATCG 4980 

ATACCTTCAG TCACGGCACG TTCCATACCA TCAAAGACCA CGCTGGCTGG ATCAGCTTTT 5040 

TCAGGTCCAG TTACTACTGG AACATCTACT CGTCGGCCCC ATTCAGCTAG CTGAGCTACT 5100 
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GCACCCGCAC 


GGAAGGTATC 


TGCTGCAACC 


AGCATGACCT 


TCTTACCAGC 


TTGTTTGTAG 


5150 


CGGTGGGCTA 


GTTTTCCGAT 


AGAAGTTGTT 


TTCCCAACAC 


CATTCACACC 


AACAAAGAGC 


5220 


ATAACTGTCA 


AGTTATCTTG 


GAAGTGGATG 


CTTTCATCGT 


AGCTACCATC 


CTTTTCATAA 


5280 


AGCTCAACCA 


ATTTCTCAAT 


GATGACACGA 


CGAAGTACAT 


CAGGTTTCTT 


GGCATTTTCA 


5340 


AGCTTGGCTT 


CGTAACGTAG 


TTCCTCCGTT 


AAGTTAGAAG 


CGACTTGGAC 


ACCAACATCA 


5400 


CTCATAATCA 


GCAGTTCTTC 


CAGTTCCTCG 


AAAAATTCTT 


CGTCAACAGA 


GCGGAAGTTA 


5460 


GCAAAGAAGG 


CATTCAAGCG 


GGCACCGAAA 


CCTGTGCGAG 


TTTTCTTAAG 


ACTGCGGTCA 


5520 


TATTTTTCCT 


GAACAGTTTC 


TTCTGTTTGA 


GGAGCTTCTG 


GTTCAAGCAC 


TTCAGAATTA 


5580 


TTTTCTTCTA 


CAGTTCCTTC 


GTGCTCAAGC 


TTCTCTTCCT 


CTGGTAATTC 


TTCTGAGTTT 


5640 


GGTAATTCTT 


CTATTTCTTC 


TTGAGAAACC 


CCTACAGCTG 


GCTCTGAATC 


CTGACTTTCT 


5700 


TCAACTGTGT 


CTTGGATTTC 


CTCTTCTTGG 


AACACAGCTT 


GTTCAACAAT 


TTCAACCTCT 


5760 


GCTTCTTCCT 


GAGAAACTTC 


CTCAACTTCT 


GTGAAGGTAG 


GATCAACATC 


TTCAGACAAA 


5820 


TCAAGATTTT 


CCAGAGCTTC 


TTTTACAACT 


TCTTCGATTT 


TAGGTTCTTC 


TTTTTTTCCG 


5880 


AATAGACGGT 


CAAACAATCC 


CATATCTTAG 


TTCTCCTTTA 


GCACATATTC 


TTCGATAGCC 


5940 


CAGGCGACAG 


CTTCCTCATC 


GTTGGTCATC 


GGCGTCACTA 


CATTTGCGGC 


TGCCTTTACT 


6000 


TCAGGAACAG 


CGTTTTGCAT 


AGCAACACCA 


AGACCTGCCC 


ATTCAATCAT 


AGAGAGGTCA 


6060 


TTGGCCTCGT 


CACCACAAGC 


C AT C ACTT C A 


C TTTGGT C G A 


TTCCAAGATG 


GCTiiA'l'TAGT 


6120 


TTTGCCAAAC 


CTGTTGCTTT 


ATGAACATTC 


TTTGGTGACC 


ATTCTAGCAA 


CATTTCACGT 


6180 


GATTTAAAGA 


TTTCATATTG 


GTCAAACAAT 


TCTGGAGAAA 


TCTTCTGAAT 


GGCTGCATCC 


6240 


AAGGGTTCTT 


GAGCAAAGGC 


AGTCACGCAT 


TTGTTGTAGG 


TCATTTGACT 


AGATAAGTCT 


6300 


TCAAAGTCCA 


CTGGAACAAA 


GGTCAAAGCT 


GGATTGAATT 


TGGCATAAAG 


ACTTTCTTGG 


6360 


TCCGATTGGA 


TTTGATAAAC 


TGTTCCTTCT 


GAGATGGCAT 


CAAGAGGCAG 


TGATAATTTC 


6420 


TCTGTTTCTT 


CATACAAACG 


TGCCACATCA 


TCATATGAAA 


AGACTGTTTT 


ATCAAGGATT 


6480 


TCTCCTGTAT 


TTTTCTGAAC 


TAATCCACCA 


TTAAAAGTAA 


TGGTATACTC 


ATCTTCCTGA 


6540 


CCGTCAGTCC 


CTAACTCATG 


GAGAAAGAAA 


TCCATGGCTT 


TTAAGGGACG 


ACCAGTTGTC 


6600 


AATACGACCT 


TGATACCACG 


ATCACGCGCA 


gCTTGCAAGG 


TTTCCTTGGT 


ACGATCCGTC 


6660 


AGCCTTTTAT 


CAGTAGTCAG 


CAAGGTCCCG 


TCCAAGTCCA 


ATGCAATCAA 


TTTTATATCT 


6720 


GCCATTATAA 


GCCCTCCATA 


TAAGCTATAA 


CCGACCGTTC 


CTTATGGTGA 


CCAATCACAG 


6780 


TCTTTGCTAA 


TTCTAAAATT 


TCAGGTCGTG 


CATTTTCAGG 


AGCTACAGGA 


TGTCCCACAA 


6840 


CCTGCATCAT 


ATGTAAGTCA 


TTAAGATTGT 


CTCCAAAAGC 


CATGACCTGA 


TCCATTGTGA 


6900 
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TACCAAGTTT TTTAACTAAT TCAACAATGG CCACTCCCTT ATCGACATAG TCCAGAACAA 6960 
TATCAATGGA TTCAAAGCCA GTTGTCATGG CCTTAACACC AGGAACGTTT TCGTTTACCC 7020 
AAGCCTCCCC ATCTTCCAGC GTTTCTTCTG TGAAGTTGGT TGTAAATTTG AAAATGTCAT 7080 
CTGTGATATC TTCCAAACTC GCTACTTTTT GGATATTTTC ATTATAGTGC TGACTCACTT 7140 
TCAAATAGGT CTCATCAACC GTATCTAGAA CATATGAACC CTTCTTACCC GTCAAGAGCA 7200 
GTTTATTGAT ATCTACATAA GGTGAAGTTT TCAGCTTTTC AAAAGTTGCC AGATAAAAGT 72 60 

CACGAGACAT AGTCGCTTCA TACAAGTCCT GACCTTGATA CTCTACCAAA CTGCCATTTT 7320 
CCGCGATGAA AATAATGTCA TCACGAACAC CAGCAAATAA TTTTTCTAGA GACAGAAATC 7380 
CCCGACCCGA AGCTACCGCA AAGTAAATCC CTTTTTCCTT GTAGGAAACC AAGAGAGACT 7440 

TGAGACGATC CATATCAAAG CGTCCATTCC CATCTAGGAA GGTTCCGTCC ATATCCGTTG 7500 

CTACTAGTTT AATTGTCATC CTTCAATACT TTCTAAATCT TTTAACTTAA CTGAAACAAT 7560 

CTTTGAAACA CCCGATTCTT GCATGGTCAC TCCATAGATG GAATCAGCCG CTGCCATGGT 7620 

TCCCTTACGG TGGGTTACGA CGATGAACTG GCIXSTCCTTG TCAAAGCGGT TGAGGTAATC 7680 

CCCAAAACGT TTAACATTGG CTTCATCCAG CGCAGCTTCC ACCTCATCCA AGATAACAAA 774 0 

TGGAATAGTC TTGACACGAA TAATGGAGAA GAGCAAGGCA AGAGCCGATA GGGCTTTTTC 7800 

ACCACCACTC ATGAGATTAA GAGACTGGAT TTTCTTGCCT GGTGGTTGGA CAGAAATTTC 7860 

AACCCCAGCT GTCAGCAAGT CTCCTTCAGT CAAAATGAGG TCAGCCTGAC CTCCACCAAA 7920 

CATCTGCTTG AAGGTCACTT TAAAGGACTC ACGAATGACC TCAAAGGTTG ATTTAAAGCG 7980 

TTCCTTGACC TCATCATTCA TCTCTGTAAT GGTCTCAAGG AGCAGGTTTT TCGCAGACAA 804 0 

AATATCATCA CGTTGGCTAT TTAGGAAATC CAGACGGTTG TGAACTTCTT CGTACTGTTC 8100 

AATAGCGTCT AAATTGACAG GACCCAGTGA GCGTATAGCC TTCTCTAAAT CCTTAACTTC 8160 

TTGCTCTGCC AGATTGAGAT TTTCCAACTC ATGCGCCTTT TCTAAAGCTT CTGTGTAGCT 8220 

GATCTGGTAC TGGTCTGTTA ATTGACTTTG TAGATGGCGC AAGCGCTCGC TAACCTTITC 8280 

TTTCTTGGCT TCAGCACGAG. TTTGCTTGCG AATCCACTCT TCATTCTGCT GGCGAGCCTG 8340 

ATCCAAATGA CTAGCAATAT CATCCAGTTG ACCCTCAATA TCATCCAACT CAAACTGCTT 8400 

GCGAATCAAA CCTTGTTGGA GATTTGTTTT TTGAGTTTTG GATTCTTCCG CCTGTTGACT 8460 

GAGCAATTCT GTATCAACCT TCTCAAGATT ATCAATCTTT TCTTGAAGAA GGCGCTGGAT 8520 

TTCCTCTTGT TCAAAATCAA GATTGTCCAA TTCCTTGCCT AAGCGTTCAA TATCAGCAAC 8580 

TTCATAACGT TTTTGCCCTT GCAGTTCTGT CTTAAGCAAA CGAGCTTGCG CTAGCTCTTC 8640 
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CTGCAAGTTT TGATAGCGTT CTTGGATGGC ATTTTTGTTA GACTTAATCT CTTCAATCTC 8700 

AGCTTCCAGA TTTTGCTTGT CACTGGAGAT TGCAGCAAGA CGCTCTTGGC AGTTTTCCTT 8760 

ATCCGCTTGC CAATCTCCCT CGGAAAGACG ATCTATTTCC TCTTCTTGGA GTTTCCAAAG 8820 

AGTTTCCAGT TCTTCAACTT GCTGACTAGT TTGCTGATAA GCGAGGAACA AGCCTTGCTC 8880 

CTGAATACGT GCCTGCTCTC CTTGAGATTT AATAGCTTCT AATGACTCGG TCAATCTGGC 8940 

CATCTCATCT TGCAAGGTCT TCAAAGTCGC CTCTTCTGAA CCCAAGCTTG CTTCTTCTTC 9000 

AGCAATTTCT TTTTGTAATT GCTCCAGTTC TGGCTTGATA AAAATGCTGT TATTCTGGCG 9060 

ATTGGCACCA CCTGCATAAG AACCACCTGT GCGCAACTCT GTCCCATCCA ATGTCACCAT 9120 

ACGAACCTGA TAACGAACTT GGCGAGCTGC TGCACGCGCA TGTTCTACGG TATCAAAGAT 9180 

AGCCGTCGTA GCTAGCAAGT TCTTGAAAAT GGCTTCCAGT CTAGTATCAA AAGTCACCAA 9240 

CTCATCTGCC ATCCCAAGGA AACCTGGGCT TACAGCGATA GCATCTTGGT TCTGACTAGA 9300 

AATCGTACGC GCCTTGATAG TGGTCAAAGG AAGAAAGGTT GCACGACCGG CTCTGTTCCG 9360 

TTTAAGGAAG TCAATAGCCT TGGTTGCCGA CTCTTCATCT TCTACGATGA TATGCTGGCT 9420 

ACTTGCCCCT AAGGCAATCT CTAGGGCAGT TTGATAATAA ACATCAAAGG TCAGATGCTC 9480 

ACTGACTGCA CCAATAATCC CACCTAGGCG ATCTTTTTCT TGGAGAACAC TCTTAACACC 9540 

TGCATAAAAG TTACTATGAT TTCTCAGGAT ATTTTCCAAA CTTTGAGCTC TGGCCTGCTT 9600 

GTTTTTGAGA TTATCCAGAC GGTCAAAGAG TTnnCTTTGT TGAGCTTGAT AGGAAGTTTT 5660 

CTGCTCCTCT TGCTCCTTGG CAATAGCTTG GTAGTCAGCC AATAATTTCT GAACCTGCTC 9720 

CTTGGCAGTT TCAAGCTCTT CCTTTTGCTG ACTAGCCTTC TCTTTAGCTA TAGCTAATTG 9780 

CTCTTTCAGC TTTTCTAGTT GATCTGCTTG TTTTTGAGAA AGCTGACGAC TATTTTCCAA 984 C 

CTCATTCTCA ATACGGGTCA ACTGGTTTGA GACATCCGCT TCTTCTTGTA AAAGAGCTAC 9900 

AAAGCGTTCA CGTAAGAGCT CAATCATCTG ATCAGGATCG TCTGAGAAAG CCAGCAATTC 99 60 

AGCTTCTAAA CGATTGAGTT TTTGATTATT TTGGACTAGA TTTCCCTCTA ACAGAGCTAA 10020 

AGAGCTTTCT TTATCAGACT TTTCTTTGCT GAGTGAATTT CTCTTATCCT CCAAAGCAGC 10080 

CAAACGGGCT TGTGCCTCCT GTTGATTCAA GGCCACTTGC TCGGACTCCA GTTTCGATAG 10140 

GGCTAATTTT CTTTCTAAAT CACTAATCAG ACTAGTCAAG TCCATCAAAC TGCCTTGGTC 10200 

TTTGGCCATT TCAGCCTGTA AATCTTGGCG TTGCTTTTTA AGAGTTTGAT TTTCTTCTTC 10260 

TAATTTTTCA CGCTTTTGGT AATAACTCAT CAAGAGTTCT TGAACCTGAG TCAACTCTTC 10320 

TTCTGTCGAC TCTAGTTCAG CCTTATTTTC CTTGATTTGA GCAACCAGAA CATCTAAATA 10380 

AATAGCCTTA CGTTGTCCTT CCAAGTCTAA AAACTTACGG GCATTCTCAG CTTGCTTCTC 10440 
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AAGAGGCTTG ATTTGATTAT CCAACTCGTA GATAATGTCC TCTAAGCGGT CCAGATTATC 10500 

CTGAGTTTGC TGCAGTTTAC TCTCGGTTTC TTTTCTGCGA GTCTTGTATT TTAAAACTCC 10560 

AGCAGCTTCT TCAAAAATAG CTCGTCGTTC CTCAGGCTTG GAATTAAAAA TCTCCTCAAC 10620 

CTTCCCTTGG GAAATAATAG AGAAGGAATC TCGTCCCAAT CCAGTATCCA AGAAGAGGTC 10680 

ATGAATATCA CGCAGACGGA CTTTCTTGCC GTCAATCTTG TATTCGCTAT CTCCACTACG 10740 

ATAGACATGG CGTTCCACCC TGATTTCTTG ACCTGCATCC TTGATAAATC CGTCATGATT 10800 

ATCCAGAGTC ACAACTACAG AAGCATAATT GAGCGGTTTG CGACTTTCGG TTCCAGCAAA 10850 

GATGATATCC GGCATCTTGC CCCCACGGAG ACTCTTGACA CTAGACTCCC CCAAAGCCCA 10920 

ACGCAGACTT TCTGTAATAT TGGACTTTCC AGATCCATTG GGTCCAACAA CTGCCGTCAC 10980 

ACCTTGGTCA AAAACGACCT TGGTCTTATC AGCAAAAGAC TTGAACCCCT GAATTTCGAT 11040 

TTCCTTTAAA TACATGAATC CAGCCCCTTC TCAACGGCAT TTTTGGCAGC TTCCTGCTCT 11100 

GCTAATTTCT TAGAACGACC TTGGCCTTGA CCGATGCTCT TACCTTCAAC AAGAACTTCT 11160 

ACATCAAAAA CCTTATCGTG AGCAGGCCCT GTTTCAGAAA TCACCTGATA ACGAATAGCC 11220 

ACATCACCAT TGACCTGAAG CAACTCTTGG AGATGGGTTT TATAGTCTGT AATCATCTCA 11280 

AACTCGCCTG CTTCAACCTT AGGAATCATG ACTTGATAGA TAAATTCCTT GACCTTGGCC 11340 

ACATCCTTAT CCAAAAGAAG GGCACCAAGA AAGGCTTCAA AGGCATCACC AAGAATGGTG 11400 

TCACGATTGC GACCACCTGA TTTTTCTTCC CCTTTACCCA ACTTGATAAA CTGGTCAAAC 114 50 

TGGCAATCAC GCGCAAAACC AGCTAAACTC TCCTCACGGA CAATCATAGC ACGGAGTTTT 11520 

GATAGGTCAC CTTCAGGCTT TTTAGGATAT TTTTTATATA GATATTCTGA AATCAATAAC 11580 

TGTAGAACAG CGTCTCCTAA AAATTCCAAG CGTTCATTGT GTGAAATTTT TAAGAGGCGG 11640 

TGCTCATTGG CATAACTCGT ATGAGTAAAG GCAGTTTCCA GTAACTTTTT GTCTGCAAAT 11700 

TCGATTGCAA AATGATTCTT TAGTACAGTT TGTAATTCTT TCATACCAAC CTCTTTCTAA 11760 

CTGATAATAG TCCTTTTTAT TATATCAAAA AAAGCCCCCT. GAGTCACTCT AAAACGGGAC 11820 

TGGAAAGCAT TTGGGAATTC TTTAGACAGA GATTCTCAGT TTTAGCGGCA AATTTGGGTC 11880 

AGGATAAAGA AAAAAGCCCT ATTAAAGGCT TTTTAGGATG TTTACATCCA CCCTGAGGGA 11940 

ATCGAACCCC CATCTCAAGA ACCGGAATCT TACGTGATAT CCATTACACT AAGGGTGGAA 12000 

ACTTGTTTTA TTATAACAGA AATTTGCTCT AATAACAAGT TTTl^GGTCA AAGACCCCGT 12060 

CTTAGTGGGA AGCATCCCCA TTCCAGATGG AGTTTTTCAC GATCACATAA TCAACGTGTT 12120 

TAAGGTCAGC AACCTGACGT CCACCTGCAT AAGAAATAGC ACTTTGAAGG TCTTGTTCCA 12180 
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TCTCAGTTAA AGTGTCTTGC 


AGATGACCTT TAGCAGGAAG 


CAAGATACGT 


TTGCCTTCCA 


12240 


CATTTTTGTA AGCACCTTTT 


TGATATTGTG AGGCTGAACC 


ATAATATTCT 


TTGAACTGTT 


12300 


CACCATCGAC TTCAATCGTT 


TTCCCTGGAC TTTCAATGTG 


TCCTGCAAAG 


AGGGAACCAA 


12360 


TCATGATCAT GCTAGCACCG 


AAGCGGATAG ACTTAGCAAT 


ATCACCGTGA 


GTACGAATTC 


12420 


CTCCATCAGC GATAATCGGT 


TTACGCGCAG CCTTGGCACA 


CCAGCGTAGA 


GCAGCCAACT 


12480 


GCCAACCACC TGTACCAAAA 


CCAGTCTTAA CCTTGGTGAT 


ACAAACCTTA 


CCAGGACCGA 


12540 


TTCCGACCTT AGTAGCATCC 


GCACCAGCAT TTTCCAATTC 


ACGCACAGCT 


TCTGGTGTTC 


12600 


CCACATTTCC AGCAATGACA 


AAGGTATCTG GCAATTCTTT 


CTTGATGTGT 


TGAATCATAG 


12660 


AAATCACGCT ATCCGCATGA 


CCATGAGCAA TATCAATAGT 


GATATACTCA 


GGAGTATCAG 


12720 


CCTTGAGCTG GCTAACAAAA 


TCATACTCAT AATCCTTAAC 


ACCGACAGAG 


ATAGAAGCAA 


12780 


TGAGCCCTTG ATTGTGCATT 


CGTTTAATAA AAGGAATGCG 


TCCTGCCTCA 


TCAAAACGGT 


12840 


GCATAATGTA GAAGTAACCA 


CCTTTAGCCA GTTGCTCTGC 


TACATTTTCA 


TCCAAAATCG 


12900 


TCTGCATATT CGCTGGCACA 


ACAGGTAGTT TAAAGGTGTG 


ATTTCCTAAA 


GTGACACTTG 


12960 


TATCCGCTTC TGCACGGCTT 


TTAATGACAC ATTTATTTGG 


AATCAATTGA 


ATATCTTCGT 


13020 


AATCAAAAAT TGGAAATTCA 


TTTAACATAT CGATGTCTCG 


TTTCTTTTGT 


AATGACCTAC 


13080 


CTATGCTCTT GCATCACTAC 


GCCTTTTCCG ACGTTTCCTG 


G 




13121 


(2) INFORMATION FOR SEQ ID NO: 127: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9578 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 

CCGAATGCAA TGTTTACGGT TGAACTTGAA AATGGACATC AGATTTTAGC AACAGTTTCT 60 

GGTAAAATTC GTAAAAACTA TATTCGTATT TTAGCGGGAG ATCGTGTTAC TGTCGAAATG 120 

AGTCCATATG ACTTGACACG TGGACGTATC ACTTACCGCT TTAAATAATC GAAAAACTTG ISO 

GAGGGATAAG AAATGAAAGT AAGACCATCG GTCAAACCAA TTTGCGAATA CTGTAAAGTT 240 

ATTCGTCGTA ATGGTCGTGT TATGGTAATT TGCCCAGCAA ATCCAAAACA CAAACAACGT 300 

CAAGGATAAG ATAGAAAGGA GAAAACATGG CTCGTATTGC TGGAGTTGAT ATTCCAAATG 360 

ACAAACGCGT AGTAATCTCA TTGACTTATG TTTATGGTAT CGGACTTGCA ACATCTAAGA 420 

AAATTTTGGC TGCTGCTGGA ATCTCAGAAG ATGTTCGTGT ACGTGATCTT ACATCAGATC 480 



! 

I 
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AAGAAGATGC TATCCGTCGT GAAGTGGATG CAATCAAAGT TGAAGGTGAC CTTCGTCGTG 540 

AAGTAAACTT GAACATCAAA CGTTTGATGG AAATCGGTTC ATACCGTGGT ATCCGTCACC 600 

GTCGTGGACT TCCTGTCCGT GGACAAAACA CTAAAAACAA CGCCCGCACT CGTAAAGGTA 660 

AAGCTGTTGC GATTGCTGGT AAGAAAAAAT AATATAGGAG GTAAAAGTCT TGGCTAAACC 720 

AACACGTAAA CGTCGTGTGA AAAAGAATAT CGAATCTGGT ATTGCTCATA TTCACGCTAC 780 

ATTTAATAAC ACTATTGTTA TGATTACTGA TGTGCATGGT AATGCAATTG CTTGGTCATC 840 

AGCTGGTGCT CTTGGTTTCA AAGGTTCTCG TAAATCTACA CCATTCGCTG CTCAAATGGC 900 

TTCTGAAGCT GCTGCTAAAT CTGCACAAGA ACACGGTCTT AAATCAGTTG AAGTTACTGT 960 

AAAAGGTCCA GGTTCTGGTC GTGAGTCAGC TATTCGTGCG CTTGCTGCCG CTGGTCTTGA 1020 

AGTAACAGCA ATTCGTGATG TGACTCCAGT GCCACACAAT GGTGCTCGTC CTCCAAAACG 1080 

TCGCCGTGTA TAATCATCGC ATTACACTGC TTTTCGTTTA AGAGGGAGTA ACTAAATGAT 114 0 

CGAGTTTGAA AAACCAAATA TAACAAAAAT TGATGAAAAT AAAGATTATG GCAAGTTTGT 1200 

AATCGAACCA CTTGAACGTG GCTACGGTAC AACTCTTGGT AACTCTCTTC GTCGTGTACT 12 60 

TCTAGCTTCT CTACCAGGAG CAGCTGTGAC ATCTATCAAC ATTGATGGTG TGTTACATGA 132 0 

GTTTGACACA GTTCCAGGTG TTCGTGAAGA CGTGATGCAA ATCATTCTGA ACATTAAAGG 1380 

AATTGCAGTG AAATCGTACG TTGAAGACGA AAAAATCATC GAAGTGGATG TTGAAGGTCC 1440 

TGCTGAAGTA ACAGCTGGTG ACATTTTGAC AGATAGCGAT ATTGAAATTG TAAATCCAGA 1500 

TCATTATCTC TTTACAATCG GTGAAGGTTC TTCTCTAAAA GCGACTATGA CTGTTAACAG 1560 

TGGTCGTGGA TATGTACCTG CTGATGAAAA TAAAAAGGAT AATGCACCAG TTGGAACACT 1620 

TGCTGTAGAT TCTATTTATA CACCAGTTAC AAAAGTCAAC TATCAAGTGG AACCTGCTCG 1680 

TGTAGGTAGC AATGATGGTT TCGACAAATT AACCCTTGAA ATCTTGACAA ATGGAACAAT 1740 

TATTCCAGAA GATGCTTTAG GGCTTTCAGC ACGTATTTTG ACAGAACATC TTGATTTGTT 1800 

TACAAATCTT ACTGAGATTG CTAAGTCAAC TGAAGTGATG AAAGAAGCTG ATACTGAATC I860 

TGACGACCGT ATTTTAGATC GTACGATTGA GGAACTGGAC TTGTCTGTGC GTTCATACAA 1920 

CTGTTTAAAA CGTGCCGGTA TCAATACTGT GCATGATTTG ACAGAAAAAT CTGAAGCAGA 1980 

GATGATGAAA GTACGAAATC TTGGACGCAA GAGTTTGGAA GAAGTGAAAC TCAAACTCAT 2040 

TGATTTGGGT CTTGGATTAA AAGATAAATA AAGGAGGAAT ACATGGCTTA CCGTAAACTA 2100 

GGACGCACTA GCTCACAACG TAAAGCAATG CTTCGCGATT TGACAACTGA CCTTTTGATC 2160 

AACGAATCAA TCGTGACAAC TGAAGCTCGT GCTAAAGAAA TCCGTAAAAC TGTTGAAAAA 2220 



wo 98/18931 



PCT/US97/19588 



878 

ATGATTACTC TAGGTAAACG TGGTGATTTG CATGCACGTC GTCAAGCAGC TGCTTTCGTA 2280 

CGTAATGAAA TCGCATCTGA AAACTATGAT GAAGCAACTG ATAAGTACAC TTCTACTACA 2340 

GCACTTCAAA AATTGTTCTC AGAAATCGCA CCTCGTTATG CTGAACGTAA CGGTGGATAC 2400 

ACTCGTATCC TTAAAACTGA ATCACGTCGT GGTGATGCAG CGCCAATGGC GATCATCGAA 2460 

TTAGTATAAA ATCATCAATT TTGTTGAGTG TTATGATGAT GGAGTCTTGT GCTCTTAGTC 2520 

TAGCTCTGGT CTACCGCTAG GATTTCGGTC CTAGCGGGAA CACTCATCAT AAGTTGGGAT 2580 

AGTAGACGCT TGTTTACGAA ATTGTTTTTT TCTTAAGAAC AACTTCGTAA GCAGGCGTTT 264 0 

TTGAGTATTT TCGTTAGAAT TATGCTATAC TATTTGAAAA GAATCCTGTT TAATGTTAAG 2700 

GTTTCTTATT TTAAGAAGAA TTGGAGTTTA CTTATGAAAG CCATTATAAC TGTTGTTGGT 2760 

AAAGATAAAT CTGGAATTGT TGCAGGTGTT TCTGGTAAAA TTGCAGAATT AGGATTGAAT 2820 

ATTGACGATA TCTCTCAAAC TGTCTTGGAT GAATATTTTA CGATGATGGC TGTTGTATCT 2880 

AGTGATGAAA AGCAAGATTT TACCTATCTT CGTAATGAAT TTGAAGCTTT TGGGCAAACT 2940 

TTGAATGTAA AAATCAATAT TCAGAGTGCA GCGATTTTCG AAGCTATGTA TAATATCTAG 3000 

GAGGTCATCA TGGATATTAG ACAAGTTACT GAAACCATCG CCATGATTGA GGAGCAAAAC 3060 

TTCGATATTA GAACCATTAC CATGGGGATT TCTCTTTTGG ACTGTATCGA TCCAGATATC 3120 

AATCGTGCTG CGGAGAAAAT CTATCAAAAA ATTACGACAA AGGCGGCTAA TTTAGTAGCT 3180 

GTTGGTGATG AAATTGCnGC TGAGTTGGGA ATTCCTATCG TTAAtaaGCG TGTATCGGTG 3240 

ACACCTATTT CTCTGATTGG GGCAGCGACA GATGCGACGG ACTACGTGGT TCTGGCAAAA 33 00 

GCGCTTGATA AGGCTGCGAA AGAGATTGGT GTGGACTTTA TTGGTGGTTT TTCTGCCTTA 3360 

GTACAAAAAG GTTATCAAAA GGGAGATGAG ATTCTCATCA ATTCCATTCC TCGCGCTTTG 3420 

GCTGAGACGG ATAAGGTCTG CTCGTCAGTC AATATCGGCT CAACCAAGTC TGGTATTAAT 3480 

ATGACGGCTG TGGCAGATAT GGGACGAATT ATCAAGGAAA CAGCAAATCT TTCAGATATG 3540 

GGAGTGGCCA AGTTGGTTGT ATTCGCTAAT GCTGTTGAGG ACAATCCATT TATGGCGGGT 3600 

GCCTTTCATG GTGTTGGGGA AGCAGATGTT ATCATCAATG TCGGAGTTTC TGGTCCTGGT 3660 

GTTGTGAAAC GTGCTTTGGA AAAAGTTCGT GGACAGAGCT TTGATGTAGT AGCCGAAACA 3720 

GTTAAGAAAA CTGCCTTTAA AATCACTCGT ATCGGTCAAT TGGTTGGTCA AATGGCCAGT 3780 

GAGAGACTGG GTGTGGAGTT TGGTATTGTG GACTTGAGTT TGGCACCAAC CCCTGCGGTT 3840 

GGAGACTCTG TGGCACGTGT CCTTGAGGAA ATGGGGCTAG AAACAGTTGG CACGCATGGA 3900 

ACGACGGCTG CCTTGGCCCT CTTGAACGAC CAAGTTAAAA AGGGTGGAGT GATGGCCTGC 3960 

AACCAAGTCG GTGGTTTATC TGGTGCCTTT ATCCCTGTTT CTGAGGATGA AGGAATGATT 4020 
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GCTGCAGTGC AAAATGGCTC TCTTAATTTA GAAAAACTAG AAGCTATGAC GGCTATCTGT 4080 

TCTGTTGGAT TGGATATGAT TGCCATCCCA GAAGATACGC CTGCTGAAAC TATTGCGGCT 4140 

ATGATTGCGG ATGAAGCAGC AATCGGTGTT ATCAACATGA AAACAACAGC TGTTCGTATC 4200 

ATTCCCAAAG GAAAAGAAGG CGATATGATT GAGTTTGGTG GTCTATTAGG AACTGCACCC 4260 

GTTATGAAGG TTAATGGGGC TTCGTCTGTC GACTTCATCT CTCGCGGTGG ACAAATCCCA 4320 

GCACCAATTC ATAGTTTTAA AAATTAAGAA AATAGGAGAA ATTTTAAGTT CTATTTAAGA 4380 

TTAGACGTGT ATACTATAAT CATTAAATAA AGACCTCCTA ATATTATTTG AAACAGATAA 4440 

CACTGAATTA GTTTGAATTT GATTTTCATC TAATATCTTT ATTTAATGAA CTCCTAAACT 4500 

TTTTCATAAT AATCTCCTTC AAAAGTCGCC TGTATGGGTG GCTTTTATTT TATCATTCAT 4560 

GATATAATAG AAGCAAACGG AGGACGGAAA ATGGTAAAAG TACGATTGTA TTTGGTACGT 4620 

CATGGCAAGA CCATGTTTAA CACGATTGGT CGCGCGCAAG GTTGGAGCGA TACTCCCTTA 4680 

ACTGCTGAAG GTGAACGAGG GATTCAAGAG TTAGGAATCG GTTTGCGAGA ATCTGATCTA 4740 

CAGTTTGAGC GTGCTTATTC GAGTGATTCT GGTCGTACCA TTCAGACCAT GGGAATTATC 4800 

CTTGAAGAAC rTGGCTTGCA GGGGGAAATC CCTTATCGCA TGGACAAGCG TATCAGAGAA 4860 

TGGTGTTTCG GTAGTTTTGA TGGAGCCTAT GATGGCGATC TTTTCATGGG CATTATTCCT 4920 

CGTATCTTTA ATGTGGACCA CGTTCACCAA TTGTCTTATG CTGAACTGGC TGAGGGCTTG 4 980 

GTAGAGGTCG ATACAGCTGG TTGGGCTGAA GGCTGGGAAA AACTCAGTGG CCGAATCAAG 5040 

GAAGGCTTTG AAATGATTGC AAAAGAAATG GAAGATCAAG GTGGAGGTAA CGCCCTTGTT 5100 

GTCAGCCATG GAATGACTAT TGGAACCATT GTTTATCTGA TTAATGGCAT GCATCCGCAT 5160 

GGTCTGGATA ATGGTAGCGT GACAATCCTT GAATATGAGG ACGGCCAGTT TAGGGTTGAA 5220 

GTTGTCGGTG ACCGTAGTTA CCGAGAGCTA GGACGTGAGA AGATGGAAGA AGGCTCTATT 5280 

TAATCAGTCT AGACTTGCTT GCCATGAGCT AGGGATTTGA TAAGAATATC AAGATAAGAA 534 0 

AAAACAGCCG AGGGCACTCC TTTCGGCTGT TTTTGATGTG GAAAACTAAA GTGTAATGCT 5400 

ATTGCTTTTA GAGATTTTCA TAAACAAGAG CAAGGAACCT ACTGTTAGAA CAGTCAGGAT 5460 

AGTTGACAAG GTTGCGGCTA CACCGTAATT TCCTCTGAGA ACCTCTGTAT AAATAGCTAC 5520 

AGTCATTGTT CTTGTTTTGA CATTGTAGAG GAGGATAGAA GTAGAGAGTT TTGAAATCAT 5580 

TGTGACTCAA GATAAGATGG CTCCAGAAAT GATACCAGAT AGCATCATTG GAGTTGTAAT 5640 

CTTAGCAAAG GTATTGAGAC GACTACTTCC TAAGCTTTCA GCAGCTTCTT CAATACTTGG 5700 

TGCTATTTGT TGTAAGCTAG CAACAGATGA GCGAATAGTA TAAGGTAATC TTCTGGCAGA 5760 
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TAGAGACATA ATCAAGATGA AAGCAGTCCC TGTAATCATA AGAAATCCAC TTCCAAATAG 5820 

ACCAGTATTG AAGGAAGAAA TGAAGGCAAT CCCTAGAACG GTTCCTGGTA CAATATAAGG 5880 

TACCATACTG AGGCTGTCAA TTAAGTTTGT AAACAAATTC CGTTTTCTAA CGGCTAGGTA 5940 

GGAGATAAAT GTCGCAAATA GAACAACTAG AACTAAGGCA ATCAAAGGGA TACGAATGGT 6000 

ATTGAAAATA GCAGATCCCA TACGATGGAA AGCTACCTTG TAACTGTTTG GAGAATAACC 6060 

TTTAACAGAT ACCATACCTG ATGTTTTTAG GAAAGAGGTA TAAATTAAGT AGATTTGAGG 6120 

TAAAACAGAG ATAAAGATAA TTCCGTAGAC TGTTGCATAA ATGGCAGCCA TTTTTCCTTT 6180 

TGTAGTTTTT TTAGGCTCAA TTGGATGGAG CAGATTCATG CTGAAACTGT AGCGGTTTGC 6240 

AATGTGTTTT TGGATAAGGA AAATTGCCAA GGCAATGATA ATCGCCATAA TTGCAAAAGC 6300 

AGAATTTCCT CCAACCTCGC TAATAAATTG GGTATAAATC AGGACAGGGA AAGTCCGATA 6360 

CCCTTCGCCA ATCAACATAG GCGTTCCAAA GTCTGAGAAT GCTCTCATAA ATACAAGCAA 6420 

GGAGCTGCTA GTAAGGTTGG AACTAGGAGA GGTAAAACAA CCGTTACGAT AGGTTTAAAT 6480 

CCGAAGGACC CCATGCTTTC AGCTGCTTCA AGTAGAGAAT TGTCAATACT GTTCATTGTT 6540 

CCAGCAACAT ATAGAAATAC CAGTGGGAAT AGTTGCAGTG TAAAGACAAG TACAATTCCT 6600 

TTGAATCAAT AAATATCGAT AGCTGGAAGA TAAAGGGCAT TTGTCAAAAA TTTAGTGATG 6660 

ACCTCATTTC GTCCTAGCAA GAGAACCCAG GAGTAGGCTC CTACGAAAGG AGCTGACATG 6720 

GAAGCAATGA TAATCAATAT TTGTAGAAAT TTCTTCCnrT ^nn^rfrr-?.'!^^ ^'*^*"Ar;.ir ;;-7m,-. 

AGATAAGCTA ATAGGGTTCC TACAACTAAG GAAGTGATAG TAGCGGTAAT GGAAACCTTG 684 0 

AAACTGTTGA CTAGTGTCTC AGAGTAGTAG GCTTTACTAA AGAAAGTGAC AAAATTAGCT 5900 

AGTGAGAATT GTCCTTCATG TATAAGTGCT TGCTTGAGCA CGGTAACGAT AGGATAAACG 6960 

AGAAAGATAG GATAGGTAAG AAAGAGGAAG AAAGAGGAAA CTGTCCAAAT ATTTAGTTTT 7020 

TTACGTTCCA TGGTTGACTC CTTTTATCAG GTTTTGGGAA CCATCTGCAG AAAAGATGTT 7080 

TAATTTTTGC GTATTGATTC GTAGACGAAT ACGATTGCCT TTTTGTAGAT CTTCTTCAAA 7140 

AGTTGATTCT TCACTAACTT GAATTTTTGA GGCAAAACCT GTCTCAATGA AATAATCCGT 7200 

ATTTAGTCCA AGATAGACGC TATCTCTAAT AGTTCCTTCA ATATCTCCAG ATTCATCTTT 7260 

GATAAACTCT TCGGGACGAA TGCTTACATG AATAGCTTGC TCCTCAACCT GATCAAGAGC 7320 

TGGCATTCGA AGGGCATAGC CATCTGAAAA GACGATATAA GCGCCGTCGC TCCGTTTTTC 7380 

AAGATTGGCA GGGATAATAT TTGTGCGTCC GATAAAGGTT, GCCACAAACT CATTAGCTGG 7440 

TTTATGATAG AGTTCTTTTG GTCGGCCGAT TTGTTGGATC ACCCCATCTT TCATAACAGC 7500 

AATTTGGTCT GAAATAGCCA TGGCTTCTTC TTGGTCGTGG GTTACATAAA CAGTTGTAAT 7560 
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TCCCACTTCG TGTTGGATTT CTCGGATGGC TTGACGCATA TCCAAGCGAA GTTTGGCCTC 7620 
CAGATTACTA AGTGGCTCGT CCATGAGGAG AACACTTGGA TTAACCGCTA AGGCGCATGC 7 680 

CAAGGTGACA CGTTGTTGTT GTCCACCACT GAGTTTATCG GGCTTTCGAT CCGCATATTG 7740 

AGCAATTTGC ATGAGTTCAA GATACTTGTT GGTCTGTTGA ATCAATTCTT CTTTTGGAAC 7800 

CTTCTTTTGC ATAAGACCAA AAGCAACGTT GTCTCGGACA GTCAAATGTG GGAAAATAGC 7860 

GTAGTTTTGG AAAACCATCC CGATATTGCG TTTGCTGGGT TCCATATTAT TGATTTTTGT 7920 

ATCATCGAAG TAAAATTCTC CACCTTCGAT ACTGTTGAAA CCTGCAATCA TACGAAGAAG 7980 

GGTCGTTTTC CCACATCCTG AAGCTCCAAG AAGGGTAAAG AGACTTCCTT TTGGAATTGT 8040 

AATGTTCAAA TTCTCAATAA CAGGGACATC GTGGTAGATT TTTTTGGCGT TAATAATTTT 8100 

GATCTCACTC ATAGTGAACC TCTTTTACTG TTTAGATTGG ATATCTGTAA AGACTTCGTT 8160 

GTATTTCTTA ACGATATCTG ATTTATTCTT GATGACATAA TCATAATCTT CAGTGAGTGT 8220 

TTTGATTTTG TCAATTGGTT TCATGTTTTC GCTTGTTTTA GCATTTTTAC GAACAGGACG 8280 

GTTAGTAGTG GTTGTACCAA GTGTATCTTG TACTTCTTGA GAGATAATAA AATCGATAAA 8340 

TTTCTTGGCA TTTTCCATAT TTTTAGATTT TTTAACGATA GCAGCACTAG CAGGTAGGAA 8400 

GACGGTTCCT TCTTTTGGAT AGACTACCTT AATGTTAGCT CCGTCATTTA AGAGTTTAAC 8460 

TGCTGGATCT TCATAAGAGA GACCAACAGC CATTTCTCCA TCAGCGACTA CTTTATAGAC 852 0 

ACTAGATGAA CTTGAACCGA TTTTACCATC AATAAGTGTG AAAAGATCTT TTACATAAGA 8580 

CCAAGCCTTA TCATCTTTGT AACCACCTTG AGCTTGTAGC ATATTTGTTA ATTGAGCAAA 8640 

GGCGCTAGAA GAGTTTGCTG GGTCAGCAGT TGCGATTTTT CCTTTTAGTT CAGGTTTGAA 8700 

AAGATCGTTA TATCCTTCGA TGTTCATGCC TTTAGTTAAA TCAGGGTTGA CGATTAAAAC 87 60 

ACTACCATCT AGTGTATAAG GAGTAGAGTA GCCAGTTGTG TTTTGATATT CTTTGATAAC 8820 

ATTATCATTT TCTTTTGAAG TATAGTTTTC AAAGAGTTCT CCGTGGGTAG TATATTGTGT 8880 

ATAAGAACCA CCAAAGATAA CATCAGCTAC AGGAACTTCT TTTTCTGACT CTAGTTTTTT 8940 

GAAAAGTTCT CCAGTACCAG CTTGAATCAG TTCTACTTTG ATACCATATT TTTCTTCAAA 9000 

GGCAGGAATA GTTGCTCCAA TTAAGCCCTC TGAGTTTGGT GAATAAACGA CTAGCGAACC 9060 

GCCGTCTCCT TTATCAGATG AACTGTCATC GGCAGATTCA TTAGAAGAAC AAGCAGCATA 9120 

ATACATCCAT TTCTTTTTCA TGATGGATAC CTCCGTTGTG TTATTTAAGT TTATTTTAAA 9180 

ACAATGTAAG CGTTTTTAAA ACATACAATT CTATTCTATA GTGTATTGAA TCTATAACAG 9240 

TACACTTTGA CTGCTAAAAT ATTTCTATAA ATTAATTTGA CTTTCCTGAT AGAGATGTTC 9300 
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ACATCTTATT TCAATTCACT ATATTAGAGT AAAATTCTCT ACAAAAAGAA GAATAGCCTA 9360 

TTTTACTATT CTTCTGAGTG ATTTCAATTC CTTTGGGGAA ATATGGAGAT ACTTTTTAAA 9420 

TCCTGACAAA TGGTTGTTTC TTTTTCTAAA TCGGTGATAC TGTATCGGAG AATGCGCGTG 9480 

AGGTCACAAA GGCTGCGATA GAGCTTCTAT GGAGAATTTC TTTTTGGAGA GATTTTTTAA 9540 

AGGAATGAGA CATCCGCTAC CTCCTTGGAA GGTTTTTG 9573 
(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13440 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



60 
120 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 
CGGGCTGTTG TGACGATTCT TATTTCTATC TGTGTTATCT TTTTGGGAAC TATTTTGGGT 
GTTGTCTTGG CTTTTGGGCA ACGTTCAAAG TTTAAACCGC TTGTTTGGTT GGCCAACTTG 

TACGTTTGGA TTTTCCGTGG GACACCGATG ATGGTTCAAA TTATGATTGC CTTTGCTCTT 180 

ATGCATATCA ATGCTCCGAC TATTCAGATT GGAATTTTAG GTGTTGATTT TTCGCGTCTG 240 

ATTCCAGGGA TTTTGATTAT CTCTATGAAT AGTGGTGCTT ATGTTTCGGA GACTGTTCGT 300 

GCCGGAATCA ATGCGGTTCC AAAAGGTCAG CTAG.'^J^.CCCG CTTATTCGCT AGGGattCGT 360 

CCTAAAAATG CGATGCGTTA TGTGATTTTG CCACAAGCAG TCAAAAATAT CTTGCCAGCA 420 

TTGGGGAACG AATTTATCAC CATTATCAAG GACAGCTCCC TCTTATCAGC TATTGGGGTC 4 80 

ATGGAGTTGT GGAATGGGGC TACAAdAGTT TCTACAACAA CCTATCTACC TTTAACACCA 540 

cttttatttg cagcatttta CTACTTGATT ATGACCTCTA TTCTGACAGT AGCCTTGAAA 600 

GCTTTTGAAA AACATATGGG ACAAGGAGAT AAGAAATAAT GACAGAAACC TTGATAAAAA 660 

TTGAAAATTT ACATAAATCC TTTGGAAAGA ATGAAGTATT GAAGGGCATC AACCTCGAGA 720 

TTAAAAGAGG AGAAGTTGTC GTTATCATCG GTCCTTCAGG GAGCGGGAAA TCTACCTTGC 780 

TTCGCTCTAT GAATTTGTTG GAAGAAGCAA CCAAGGGGAA GGTTATCTTT GAGGGAGTCG 840 

ATATTACGGA CAAGAAGAAT GACCTGTTTG CCATGCGTGA GAAGATGGGC ATGGTTTTTC 900 

AACAATTCAA TCTCTTTCCT AATATGACTG TGATGGAAAA TATCACCTTG TCCCCTATCA 960 

AGACCAAAGG TGACAGTAAG GCCGTTGCAG AGAAAAGAGC TCAGGAACTT TTGGAAAAAG 1020 

TTGGTTTGCC AGATAAGGCA GACGCTTATC CACAGAGTTT GTCAGGTGGC CAGCAACAGC 1080 

GGATTGCCAT CGCGCGTGGG TTGGCTATGG AACCAGATGT TTTGCTCTTT GACGAGCCAA 114 0 
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CTTCAGCCCT 
AGTCAGGAAT 
ATCGTGTCAT 
TTGAACAAAC 
TTGTTTAGCT 
CAGAGCTTTT 
CATTGTATTT 
ATTATTGATG 
AAATTAAAGG 
CCAGCCAGCC 
AGCGAAGTAG 
AAATACAATC 
ATTGATGAAG 
CCTCTAAACA 
GGAATTATGG 
ATCGGTCGAT 
ACAGTAACCT 
ATTCTGGTTG 
GCGGTAGTCA 
GATTATGAGG 
CCTATGACCA 
AGAAAATAAG 
TAGAAATGAA 
GTTGTATAAT 
TACAATGGGA 
AATTAGAAAA 
TTGAGTGTGC 
TTCTGCCACC 
GGTTAGTGAC 



AGATCCTGAG 
GACCATGGTT 
CTTTATGGCA 
CCAAGGACAA 
ATTTGTAGCC 
TCTTATAGTT 
TTTGGTATAA 
GGAAAGCTTT 
AAGAAACAGG 
AAGTCTACGT 
TACGGGTTCC 
AGGATCCAGC 
AGGCGGTTCT 
TGGGGCGTCT 
AAATGTTCCA 
CCAATATTGT 



TTGCAATCGG 
TTGACGTTGG 
CGGTTGCCCC 
TTACTATGCT 
ATAAAAATTT 
TATTAAATTT 
GGAGTGAGGT 
TGGAATATAC 
AATTGCTAAA 
TGATTCTTGT 
ATTTACTCCT 
TTCAGACTAC 



ATGGTTGGAG 
ATCGTAACAC 
GACGGTGTGG 
AGGACTAAAG 
AGCTTTAAAC 
TAAAGCTATA 
TTAAAGATAT 
AGCGGCCAAA 
TCTAGTGCCT 
TCGCAACAAG 
AGAGACCATT 
TTGGCATGGG 
ATTGGCTATT 
TTGGTCTGGT 
TGAATATGGG 
CGGAAAACCT 
Ac-UTACTCAT 
TCGTGCCAAG 
GATGAACCGC 
ACTTGCTAGC 
GATGGAGCAA 
TCTGAGGAAA 
TAGAAATAAG 
GATTAGATCA 
TACTTAGCAC 
TTTATAGCTT 
TTAAA7VAAAG 
ACAGATAAAA 
TGTAGTCATA 



883 
AAGTTCTGGC 
ATGAGATGGG 
TTGTTGAAGA 
ACTTCTTGAG 
GTTAAAGAGA 
GGATTGCCTA 
TTGTAAGAAA 
TTGCAGGGGC 
GGTTTGGTAG 
GAGAGGTCAG 
ACTCAAGAGG 
ATTTTGGTTC 
GACCCAGAAA 
CATCCAGTCA 
ATTGACTTGG 
ATGGCCCAGC 
AATCTTTCCA 
TTTGTGACTG 
GATGAAAATG 
CACATTACGC 
ACCTATCAGG 
GTGTATTTTC 
TTTATAAAAG 
TTTTAAAAAT 
TAATTGATTA 
ACGAAAAACT 
AAATTTTAGA 
GAGTAGCCTC 
CTACGACTAT 



TGTTATGCAA 
ATTTGCCCGT 
CGGAACACCT 
TAAGGTTTTA 
AGATTAGTGA 
GGAAAGAAGT 
AGAGAAGTGA 
AGTTGGCTGA 
TGATTTTGGT 
CCCTTGCGGC 
AATTGTTAGA 
AGTTGCCATT 
AGGATGTGGA 
TGATTCCTTC 
AAGGTAAAAA 
TTCTTTTGGC 
AGGTGGCTGC 
CTGACTTTGT 
GTAAGCTCTG 
CAGTCCCTGG 
CAGCACTTAG 
TATAGCTATA 
GAGGTTTGCG 
TTATAATGGG 
TCCAAATATT 
TCATAAACGT 
TTACATCTGT 
GACTTATGAC 
AGATGCAGCG 



GATCTAGCCA 
GAGGTGGCAG 
GAGCAGATTT 
TAAGTTAGCT 
AAAGCTCAAC 
GTTAGAGCTA 
TATGACACAG 
AAAGACTGCA 
TGGGGACAAT 
TGGTTTCCGT 
CCTGATTGCT 
ACCAAAACAC 
TGGTTTCCAT 
GACACCGGCA 
TGCAGTCGTC 
AAAGAATGCA 
AAAAGCAGAT 
CAAACCAGGT 
TGGGGATGTT 
AGGTGTCGGT 
GACATTGGAT 
TCTAAAATGA 
CCTCCTTTTT 
GAATATAGTT 
CAAGAGTGGG 
CAAACAAGTA 
CAGCATCCCT 
CTACATAAGA 
ATTTCTATTT 



1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 
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TTAAAACTGG TCGTCTTTTA TCTGCTGTGA AAGCCTTTGG 
TTTTGGATAG TCGAAATGCT GCATCTGATC CX5ATAGATTA 
GGTGGTCAAA TACAAGTTCT GGTTATCGAT TGGCGATGGA 
CTTCAGAGAA AGAATTACAA GACAAGTTTA TTCCTGGAGT 
CAGATTTGAT TAAAGTTCCT GGTTATATTT TTGATGGTTA 
ACATGCTTAA TTTATTAAGT GAGTTGTATA TTTGCATTAT 
AATTTGAAAA TATTATTCCA ACCAAAATAC AAGATAGGGT 
GAGAAGACTT AGAAGAGTGG ACTAAGAAAG TCTATCAAGT 
AAGGATAGTT GAGGAAAAAA CGATGAAAGT GATTGATCAA 
TATTGAACGT TCTTGTACAA GTCATAAAGG AGACTACGGT 
GACTTATCCT TATGGTGGTG CCATCATCAT GGCTGCTTTA 
AGGATTGGTA ACCGTTGGAA CGGACAGGGA AAATATCCCT 
TGAGGCTATG GCCTTTTCTC TGCAAGATCA GTAATTGTTA 
AGAAGTTGTC TTGCTGGGGC CTGGTTTACG AGACGATACG 
ACAGGTCTTT GCTAGCTTAA AAAAGAATCA GATTTTGATT 
CATCCTTGCT AGGACAAGTT TGTTGTTTCC ATCTAACCAG 
AAAAGAATGG GAAAAACTGT CTGGTATTGC TATTGAAAAG 
TAGTGCCCTG ACTTCTTTCC CTCAAGGAAC AATTTTGGTA 
TATTTGGCAA GTTGGCCAGT CTGATTATTA CCAGTTAAAG 
GACTGGTGGT ATGGGTGATA CACTGGCTGG AATGATTGCA 
ACAGGCCAGT CTCTACGAAC GTGTGGCAGT AGCAACCCAT 
AGAACTATCT CAAGAAAATT ATGTGGTCTT GCCGACGGAA 
AGTAATGAAA AGATATGTCT AAAATAGTTA GACAAAAAAT 
TTCTTAATTC ACAAAAAACG AACGTTTAGT ATTCTTCTTG 
GTTTTTTTAC TCTTGTAAAT CTATTTTTGT TAGAGTTGAT 
AATTGATTTG TTAGAGCTCT ACTTTTATTA AAAAAATTCA 
GTATTCTAAA GGTACTTTTA GATGAAATAA AAGCCTTTAC 
CTTTAATGGA GGTGTTTGAG TGGAAAATCT GAAGAAAATG 
ATTTATCAAG GATGGGATGG TTGTAGGGCT AGGAACAGGT 
CGAAGAAATC GGTCGTCGAA TCAAGGAAGA AGGCTTGCAG 



GCGAGATGCT GAGGAGTTGG 2940 

TTTTGACTAT GTCATGTTAG 3000 

GCGTTTATTA GGTCGAGCTC 3060 

AAGTTTTCAT TTTATCTATA 3120 

CCATGCTGTA AAAATTAAGG 3180 

TCCAACTCAT AATAAGAGCC 3240 

GTATTATCTT GACTATGCTG 3300 

TGTTTTAAAA CAATCAGATA 3360 

ACCTTACTAG AAAAAGTCAT 3420 

CGTCTGCTGT TGCTTGGTGG 3480 

GCAGCTGTAA AAAGCGGTGC 3540 

GCTCTACACA GCCATTTGCC 3600 

CAAGAGCAAT TGGAGAAGGC 3 660 

TTTGGAGAAA ATCTTGTAAA 3720 

GTAGATGGAG GGGCCTTAAC 3780 

CTTATCTTAA CTCCCCACCA 3840 

CAAAACGAAG GTACAACATC 3900 

GAGAAAGGTC CAGCTACTCG 3 960 

GTTGGCGGTC CCTATCAGGC 4020 

GGATTTGCAG GCCAATTTCG 4080 

CTTCATTCAG CCATAGCCCA 4140 

ATTAGTAATT GTCTTCCTAA 4200 

GTTGATAArr TGTATCATTA 4260 

CTAAGAAACT AAATTTGTTC 4320 

TTGGTTTACA TCCGTACTTA 4380 

ATTTCAAGGA TAAATAAGCA 4440 

ATGGTATAAT AGAGGTAGCT 4500 

GCAGGTATCA CGGCTGCTGA 4560 

TCTACTGCCT ATTATTTTGT 4620 

ATTACAGCTG TGACGACTTC 4680 
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TAGTGTGACC AGTAAACAGG CTGAAGGGCT CAATATCCCG CTCAAGTCTA TTGACCAAGT 4740 

AGACTTTGTC GATGTGACAG TCGACGGGGC GGATGAAGTG GATAGTCAGT TTAATGGAAT 4 800 

CAAAGGCGGT GGTGGTGCCC TTCTCATGGA AAAGGTGGTC GCAACACCAT CAAAAGAATA 4 860 

CATTTGGGTG GTGGATGAAA GCAAGCTGGT CGAAAAACTA GGTGCTTTTA AATTGCCAGT 4 920 

AGAAGTGGTT CAGTATGGTG CAGAGCAGGT CTTTCGTCAT TTTGAACGAG CTGGCTACAA 4980 

ACCAAGTTTC CGTGAAAAAG ACGGCCAACG TTTTGTGACC GATATGCAGA ATTTTATCAT 5040 

TGACCTCGCC TTGGATGTCA TTGAAAATCC AATTGCTTTT GGACAAGAAT TGGACCATGT 5100 

CGTTGGTGTT GTGGAGCATG GTTTATTCAA CCAAATGGTG GATAAGGTAA TCGTTGCTGG 5160 

ACGAGATGGA GTTCAGATTT CAACTTCAAA AAAAGGAAAA TAGAAGGGGG CATAAGATGT 5220 

CTAAATTTAA TCGTATTCAT TTGGTGGTAC TGGATTCTGT AGGAATCGGT GCAGCACCAG 5280 

ATGCTAATAA CTTTGTCAAT GCAGGGGTTC CAGATGGAGC TTCTGACACA CTGGGACACA 5340 

TTTCAAAAAC AGTTGGTTTG AATGTCCCAA ACATGGCTAA AATAGGTCTT GGAAATATTC 54 00 

CTCGTGAAAC TCCTCTTAAG ACTGTAGCAG CTGAAAGCAA TCCAACTGGA TATGCAACAA 5460 

AATTAGAGGA AGTATCTCTT GGTAAGGATA CTATGACTGG ACACTGGGAA ATCATGGGAC 5520 

TCAACATTAC TGAGCCTTTC GATACTTTCT GGAACGGATT CCCAGAAGAA ATCCTGACAA 5580 

AAATCGAAGA ATTCTCAGGA CGCAAGGTTA TTCGTGAAGC CAACAAACCT TAfTCAGGAA 56^0 

CGGCTGTTAT CTATGATTTT GGACCACGTC AGATGGAAAC TGGAGAGTTG ATTATCTATA 5700 

CTTCAGCTGA CCCTGTTTTG CAGATTGCTG CCCACGAAGA CATTATTCCT TTGGATGAAT 5760 

TGTACCGTAT CTGTGAATAC GCTCGTTCGA TTACCCTTGA GCGTCCTGCC CTTCTTGGTC 5820 

GCATCATTGC TCGCCCTTAT GTAGGTGAAC CAGGTAACTT CACTCGTACG GCAAACCGTC 5880 

GTGACTTGGC TGTATCTCCA TTTTTCCCAA CTGTTTTGGA TAAATTGAAT GAGGCTGGTA 594 0 

TCGATACTTA TGCTGTGGGT AAAATCAACG ATATCTTTAA CGGTGCTGGT ATCAACCATG 6000 

ACATGGGTCA CAACAAGTCA AATAGTCATG GAATTGATAC ACTATTGAAG ACTATGGGAC 6060 

TTGCTGAGTT TGAAAAAGGA TTCTCATTCA CAAACCTAGT TGACTTTGAT GCCCTTTACG 6120 

GCCATCGTCG TAATGCTCAC GGTTACCGTG ATTGCTTGCA TGAGTTTGAT GAACGCTTAC 6180 

CTGAAATTAT CGCAGCTATG AGAGAGAATG ACCTTCTCTT GATTACTGCG GACCATGGAA 6240 

ATGACCCAAC GTATGCAGGA ACGGATCACA CTCGGGAATA TATTCCATTG TTGGCCTATA 6300 

GCCCTGCCTT TAAAGGAAAT GGTCTCATTC CAGTAGGACA TTTTGCAGAT ATTTCAGCGA 63 60 

CTGTTGCCGA TAACTTTGGT GTGGAAACTG CTATGATTGG GGAAAGTTTC TTAGATAAAT 6420 
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TGGTATAAGA TGACGCGCTA TGCTTTGCTG GTGAGAGGTA TCAATGTTGG TGGTAAGAAT 6480 

AAGGTCGTCA TGGCGGAGCT TCGTCAAGAA TTGACAAACT TGGGACTGGA AAAGGTTGAG 6540 

AGCTACATCA ATAGTGGCAA TATTTTCTTT ACTTCGATAG ATTCCAAAGC CCAATTGGTT 6600 

GAAAAGCTAG AGACTTTCTT TGCAGTCCAT TATCCATTTA TTCAGAGCTT TTCTTTACTG 6660 

AGTCTAGAGG ACTTTGAGGC GGAACTTGAA AATCTACCAG CTTGGTGGAG CAGAGACTTG 6720 

GCACGAAAAG ATTTTCTCTT TTACACTGAG GGTTTGGATG TGGACCAAGT CATCGCGACA 6780 

GTTGAAAGTT TAGAGCTGAA AGATGAAGTG CTTTATTTTG GAAAACTTGG GATTTTCTGG 6840 

GGGAAATTTT CTGAAGAATC CTATTCTAAG ACTGCCTATC ATAAGTACTT GCTGAAGGTG 6900 

CCTTTCTACC GCCACATTAC TATTCGTAAT GCTAAAACCT TTGACAAAAT TGGTCA;UVTG 6960 

CTAAAAAAAT AATAAAGGAG ACACACAATG ACATTTTTAA ACAAAATCCA TGAAACTGCT 7020 

ACTTTCCTGA AAGAAAAGGG AATTGCAGCC CCTGAGTTCG GTCTAATCCT TGGATCAGGA 7080 

CTTGGAGAAT TGGCAGAAGA AATCGAAAAT CCAGTTGTAG TAGACTATGC TGAGATTCCA 7140 

AACTGGGGCC GTTCAACAGT AGTCGGTCAT GCTGGTAAAT TGGTATATGG TGAACTGGCA 7200 

GGTCGCAAGG TCTTGGCTCT TCAAGGGCGT TTCCATTTCT ATGAAGGGAA TCCTCTGGAA 7260 

GTGGTGACTT TCCCAGTTCG TGTGATGAAA GTTCTTGGAT GTGAAGGTGT TATTGTAACC 7320 

AATGCAGCTG GCGGTATCGG ATTTGGTCCT GGTACCTTGA TGGCTATCTC AGACCATATC 7380 

AACATGACGG GGCAAAATCC ATTGATGGGT GAAAACTTGG ATGACTTTGG CCCACGTTTC 7440 

CCAGATATGT CTAGGGCCTA CACACCAGAA TACCX5TGCCA CTGCCCATGA AGTGGCTAAA 7500 

AAACTTAATA TCAAGCTTGA TGAAGGTGTC TATATCGGAG TTACTGGTCC GACTTATGAA 7560 

ACACCAGCAG AAATTCGTTC CTATAAGACA CTGGGAGCAG ATGCAGTTGG TATGTCTACG 7620 

GTTCCTGAAG TTATCGTGGC AGCCCACTCT GGCTTGAAAG TTCTGGGAAT TTCATGTATC 7680 

ACTAACTTTG CGGCCGGTTT CCAAGAAGAA CTCAATCACG AAGAAGTTGT AGAAGTGACT 7740 

GAACGTGTTA AAGGTGATTT CAAAGGCTTG CTTAAAGCGA TTCTTGCTGA ATTGTAAGAA 7800 

AAAAGATTTA AAAGGGGGAG TGCCTCTGTT TTTTCAGGAT TGACTGCCTA TCCGGATTAA 7860 

AGAAGAAACA GAGGAATACT ATGAGCTTCT TCCTGCTCTT ATAACTGAAA GAAGCGGAAG 7920 

AATAGGTATG TCTGATCTGA TAGCCAGCAT TGTGAAAGAC AAGATTCTAG GATACTAGCA 7980 

TTAGCTTCCT AGCCAAGCAG ACTAGTATGA TAAGGAGAGA TGAGAATGAA TTGACTTTCT 8040 

GAATTTCTCA GTCTTATCAT ATATAGCACA ATGAGATTTC GCTTGAGTCT GCTTGTAAAT 8100 

AAACGAAAAG AAAGATAAGA AATAATGAAA ATTGGTCAAC GAATTATGCG CTTTGGCATA 8160 

AAAAATTAAG TATCGGAGTT GTATCTGTTG TAGTCGGCTT TGATTTCTAG CTCCAGCTGG 8220 
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AATTTCAGCC AATGAAGTAA AGCAAGATGT AACATCTGAA GTGGTAATAG GTGTGCTAGA 82 80 

TTCTAAGGAG GAATTGAAAG AGTCAGAAAA TGATGCTCCA AAACTAGAAA CTCCTCTTAG 8340 

AGAGGAGCCA AGACTAGCTC CTCAAACGCT TCCGGAAGCA AGTGAAGTTC TTGAAAACAA 8400 

AAGGGAAGAG TCAAAAGTAG AGATAACATA ACCAGCTCAA GCGGATGATA TCCGCAAGGT 84 60 

TGTTGGGGAA TTAGCCAAGG ATATAAGTAT TACTAAGTTG TATATGACAG GTCATTCTCT 8520 

TGGATGTTAC CTAGCTCAGA TTGCAGCGGT TGAAGCTTAC CAAAAATATC CTGATTTTTA 8580 

TAACCATGTA TTGAGGAAAG TGACAACTTT CAGTGCTCCT AAAGTGATTA CTTCCAGAAC 8640 

TGTTTGGAAT GCTAAGAATG GTTTCTGGGA TGTTGGTTTG GAAAGTCGTA AATTAGCTGT 8700 

TAGTGGAAAA ATTAAGCATT ATGTGGTTGA TAATGACAAT GTTGTGACTC CCTTGATTCA 8760 

TAATAATCGT GATATTGTTA CATTTACAGG TAATTCACGC TTTAAACACC GTTCTCGTGG 8820 

CTATTTTGAA AGTCCAATGA ATGATATTCC TAACTTTAAT ATTGGTAAAC AAGCTACCTT 8880 

GGATAAACAT GGTTATCGTG ATCCGAAATT GGATAAAGTG CGATTCTTTA AGAAACAGGC 8940 0 

TCTGCCTCGA TCTTCTAGTC AACCAAGCGC TGAACCAATG GAAAATATTG CCTCAGGAAA 9000 

ACAGGTTACT CAAAGTTCGA CAGCTTTCGG AGGAGATGCT AGAAGAGCTG TGGATGGCAA 9060 

AGTCGATGGT AACTATGGTC ACAATTCTGT CACTCATACA AACTTCCAAT CTAAGCCTTG 9120 

GTGGCAAGTA GATTTGGCTA AAGAAGAAAC CATTCGCCAA ATCAATATTT ACAACCGAAC 91BC 

AGACACTGCC CAGGATAGAT TGGCAAACTT TGATGTCATT CTTTTAGACA GTTCTGGTAA 9240 

AGAAATTGAG TGAAAACGTA TAACATCTCC TAAAGATGTG TCAGCACT^ TTACGATTAA 9300 

CCATAAAAAA GCGCGCTATG TTCGGATTGA GCTAGAAGGC TATAATGCCC TCAGTCTTGC 9360 

AGAAGTTGAA GTTTTCTGCT TTATAGCTAC GAATGCTGAA ACGGCGACAC AAGTTTCTAA 9420 

GCCAGTTCAA CCAATCAGTC AGACTCCTGT GAAGGATAAA ACATTGACAA TTCAACACAG 9480 

TGGAGCTTAC ATTGCCCGCT ACTCCATAAC TTGGGAAGAA GTTCCAGTAG ATAAAGATGG 9540 

AAACCAAGTT GTTCGTAGTC ATTCTTGGGA AGGAAGCGGT CGCAACCAGA CTGCAGGTTT 9600 

TGTCCTCAAC CTCCCAATCA AAGAAAATAT GAGAAATCTG CGAGTTAAGA TTGAGAAAAA 9660 

GACGGGCCTA CTATGGAATA GATGGCAAAC AATCTATGAA AACAGACCAA TTTTAGCTCA 9720 

ACCCCACCGT AAAATTACCC ATTGGGGTAC GACATTGAAT TCCAAGGTGA GTGACGATGA 9780 

TGTCTTGTAA TCTGATGGTA GAATGACAGT TAGTTTGTCT AGTTTATAAG AAAGTACTAC 9840 

CTGAGCTTGA ATAGGACTCA GGTAGCTCTC TATGAAAGAA CAAAATTAAT ACTCAATGAA 9900 

AATCAAAGAG CAAACTAAGA AACTAGCCGC AGGTTGCTCA AAGCACTGCT TTGAGGTTGT 9960 
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AGATAAGACT GACGAAGTCA GTCACATATA TAATCCAAGG CGACGTTGAC GTGGTTTGAA 10020 

GAGATTTTCG AAGAGTATAA ACAGAAAGGT AGAGCGCGTG TTCTAATTTG AACACGAGTA 10080 

GAAAACTTTT CTAAAAACAA AAACGAAAGG ATGGGTAAAC TGTATTCGCT GAACTGAATA 10140 

CGGGCGACTC TCCTCTAAAT CAAAATTAAG AAAGGAATTG ACCCCACCCT AAAAGTAGTG 10200 

GGAAAAAGAT AGTTGATCTA GCGAGCATCG CTCACTGCGC CCAACTCCTA TTTTCCCTTC 10260 

GCTTTTTGAT GGGTTTGGTA TCTTTCTCAA TATAAAATAT AAAATAAAGA AAGGTAGAGC 10320 

GTGTGTTTTG ATTTGAACAC GAGCGGAAAA CTCGGAAAAT AGATAATCTG ACTGAAAAAT 10380 

CAGGATTTCT CGTCAGGTTC CTAATTTTCA GTCGTTTTCT TCTCGCTCTT TGTATCATAA 10440 

ATTATGTCTA TCCATATTGC TGCTCAGCAG GGTGAAATTG CTGATAAAAT TCTTCTTCCT 10500 

GGGGATCCTC TTCGTGCTAA GTTTATTGCG GAGAATTTCC TTGATGATGC TGTTTGTTTT 10560 

AACGAAGTGC GTAACATGTT TGGTTACACT GGTACTTACA AGGGTCACTG TGTATCTGTC 10620 

ATGGGAACTG GGATGGGAAT GCCATCTATT TCGATTTATG CGCGTGAGTT AATCGTAGAC 10680 

TACGGTGTGA AGAAATTGAT TCGTGTGGGA ACTGCAGGTT CTTTGAATGA AGAGGTTCAT 10740 

GTTCGTGAAT TAGTTTTGGC GCAGGCGGCT GCAACCAACT CAAACATCGT TCGTAATGAC 10800 

TGGCCACAGT ACGATTTTCC ACAAATTGCT AGCTTTGATT TGCTTGATAA AGCCTACCAT 10860 

ATCGCCAAAA AACTTGGTAT GACTACTCAC GTTGGGAACG TTTTGTCATC TGATGTCTTT 10920 

TACTCAAATT ACTTTGAAAA GAATATCGAG CTTGGTAAAT GGGGAGTCAA GGCTGTGGAA 10980 

ATGGAAGCAG CAGCTCTTTA CTATCTTGCT GCCCAATACC ATGTTGATGC GCTAGCTATC 11040 

ATGACCATCT CTGATAGCTT GGTCAATCCA GACGAAGACA CAACTGCAGA AGAACGTCAA 11100 

AATACCTTCA CTGATATGAT GAAGGTTGGT TTGGAAACCT TGATTGCAGA ATAATTATAG 11160 

CCAAAAAGGG GCTCTTTGTC AACTGTAGTG GGTTGAAAAA AAGCTAAGCT TGAGAAAGGA 11220 

CAAATTTCGT CCTTTCTTTT TTGATATTCA GGGCGATAAA AATCCGTTTT TTGAAGTTTT 11280 

CAAAGTTCCG AAAACCAAAG GCATTGCGCT TGATAAGTTT GATGAGATTA TTGGTCGCTT 11340 

CCAGTTTGGC ATTAGAATAG TGTAGTTGAA GGGCGTTGAC GATTTTCTCT TTGTTCTTTA 11400 

GAAAGGTTTT AAAGACAGTC TGAAAAAGAG GATGAACCTG CTTCAGATTG TCCTCAATGA 11460 

GTCCGAAAAA TTTCTCAGGG TCTTTGTTCT GAAAGTGAAA AAGTAAGAGT TGATAGATCT 11520 

GATAGTGGTG TTTCAAGTCT TCTGAATAGC TTAAAATCTT GTCAAGAATT TCTTTATTTG 11580 

TTAAGTGCAT GCGAAAAGTA GGGCGATAAA AACGTTTATC GCTsArTTTA CGACTATCCT 11640 

GTTGGATGAG TTTCCAGTAA CGCTTGATAG CCTTGTATTC ATGAGATTTT CGTTCAAACT 11*700 

GATTCATAAT TTGAACACGA AAACGACTCA TGGCACGGCT GAGATGTTGG ATAATATGGA 11760 
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AACGATCTAG AACGATTTTA GCACACGGAA AAAGCTGTTT AGCCAAGTCA TAGTAAGGAC 11820 

TAAACATATC CATCGTAATG ATTTTCACTT GACAACGAAC GGCTCTATCG TAGCGAAGAA 11880 

AGTGATTTCG GATGACAGCT TGTGTTCTGC CTTCAAGAAC AGTGATAATA TTAAGATTAT 11940. 

CAAAATCTTG CGCAATGAAA CTCATCTTTC CCTTAGTGAA GGCATACTCA TCCCAAGACA 12000 

TAATCTTTGG AAGCCGAGAA AAATCATGCT CAAAGTGAAA GTCATTGAGC TTGCGAATGA 12060 

CAGTTGAAGT TGAAATGGCC AGCTGATGGG CAATATCAGT CATAGAAATT TTTTCAATTA 12120 

ACTTTTGAGC AATTTTTTGG TTGATGATAC GAGGGATTTG GTGATTTTTC TTTACCAGGG 12180 

GAGTCTCAGC AACCATCATT TTTGAAsAGT GATAGCACTT GAAACGGCGT TTTCTAAGGA 12240 

GAATTCTAGA AGGCATACCA GTTGTTTCGA GGTAAGGGAT CTTAGACGGT TTTTGAAAGT 12300 

CATrTTTCTT CATTAGACTT CCACAATCAG GGCAAGATGG AGCCTCATAA TCCAGCTTAG 12360 

CGATAATTTC TTTGTGGGTA TCCATATTGA TGATATCTAG AATCTTGATG TTTGGGTCTT 12420 

TAATATCGAG CAGTTTTGTG ATAAAATGTA ATTGTTCCAT ATGATTCrTT CTAATGAGTT 12480 

GTTTTGTCGC TTTTCATTAT AGGTCATATG GGACTTTTTT TCTACACAAA AATAGGCTCC 12540 

ATAATATCTA TAGTGGATTT ACCCACTACA AATATTATAG AGCCCAAAAA GGAAGCCCTT 12600 

TATGAATTGT AGGACTTCCT TTTCTTATCC AGAAATTGAT CTAGCTCTCT CTGATTTCGA 12660 

AGA-*TACTGA CTTTATOTGA ATATTCTTGG CAAAGTTTTT GGTAATTTTC TTTTTGAGTT 127 20 

TTGCGGACGC CCATCCCAAA GAATCCATCT GATAAACTCC CACTCAAAGC GTTCAGGGCA 12780 

ATCTACCGCC ATACTTTCTC TGACTTTTCC ACGGTATTTA AGATAACGCT TAAAGGCTCT 12840 

AAAGAGACAG GTCAATGGCG AAAAATTGAG AAAGATGATT TGGTCAGCTT CTTGCATTCG 12900 

TTCTTGGTAG TAGCACCAAG AATAATTACC ATCGATGACC CAAGCTTTAT GCTTGGTGAG 12960 

AAAGTTTTTT ATCTCGGTTA ACATCCATTC GCAGTCACTG TCTTGCCAAC CAGGTTGAAA 13020 

TTGGAGTGTG TCCATGTGCA GTTTTGGAAT GGAGTAGTAG TTAGATAACT TTTCTGCTAT 13080 

AGTTGACTTA CCAGAACCAG AATATCCGAT 7VATTGCGATT TTCATTTTCT ACCTTTTCCT 13140 

ATTTGGAGAC AAAAAAACAG CCTCTATGGA CTGTTTCTTA TTTAACAAGT TTAGCTGAAA 13200 

GACGAGCTTT ATCGCGGCTT GCTTTGTTTT TGTGAATCAA ACCTTTAGTT TCTGCTTTAT 13260 

CGATAGCTGA GCTAGCAGCA CGGAAAAGTT CTTCAGATGG GTTTGCTTCG AAAGCTTTTA 13320 

TAGCAGTACG CATAGCTGAT TTTTGAGCTG AGTTCTTTTC GATTCGTCTA ACGTTCAATT 13380 

CAGCGCGTTT GATAGCTGAT TTAATGTTTG CCAATGGTCT TACCTCCATA TTTACTAACT 13440 
(2) INFORMATION FOR SEQ ID NO: 129: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8512 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 
CCTTTTTTCA AAAACTAGAT ACTAGTCTAT CAAAAGTAGG AAAGGGTTTC AAGAAAATTG 
ATTGGAAATT TTTTGAAAAT CATAGAACTA TTAGCTAATC CCTAGTATTG AAAAGACTGG 
ATAGCTTCTT TCAGGTCATC TTGTAAACTA TTTCTCTGGT CAAGTTGGAC ATAGACTTCC 
ACCAGACAGG ATCTAAAGTT GGAAAATTTG TAAAAATCCT CCCTTTCTTC TATCGGAAAA 
TCAACAGTTT TTATCCAAGA AGCTACTTGT TCTTGCTCCA ACTTCCCTTG TAAAATAGGT 
TCATAGATCA CTCTTGCTAA ACGCCAATCC TCATCATCTG TAAAGCGAAT CGACATTCTT 
TTAAATAGTT GGCCAAGTAT ATCAAATACT TCATGAACTC TGTTTTTAGG AAAGTCTGGA 
TGACAAACCA CCTCTGTCAG TAAATCGGCT CCATGTGCAA AAGCGTGAAC CCAACCATAC 
TGACTTGAGA AACCCCTTGT ATCCTTTTCT TTTGAAAGAT AGTGCAAGCC TTGATTTAAA 
AGGACATTAC GAATTTCTGG AGAAGGATTT CCCAAATGAT CAAACAACCA CTGGATTTCT 
-TTTGGTTT TTCTTCTCCT ?-TTTTCTT?. CT?J^J^.TCTTG ATACATC-GTC 
AATACCTCTA CATTTCTAGC AACTGTTCAA AAAGGCAGTC TTAAATGACT CAATATTGAA 
TTCTCAATTA AATACAATCT GATATAAAAT GACGTAAATA ACTATCAATA CCAGTTCTAC 
AGTAAGTTCA AATTTAACAT CACGACCTTC AACGACATTT TTGAAAATAG CTACAACTAA 
GACAAATAGA ATGACGCTTA ACAAGCCCAT AAACATCATT CTAAAAAATT TTTCTATTCC 
CCTACTCTCC CAACTCAGCA CTATAGGAGA TAATCTGGTC AACTGTGTCA GACAAGAATT 
GGATGGTATC ACGGAGTGGT TTGTCTGTTG AAATATCAGC ACCGATAATC ATGGCTGACT 
CAAGTGGTGT CTTGCTACCA ' CCTGATTTGA GGAGATTGAG CCAGTCTTCA GCTCCAGTTT 
CAGAATGTTT TAGATGAAGG TAACCAGCAG TCGAGATAAC TAGTCCTGCT GAGTAAGTGT 
AACTATACAA GCCCATATAG TAGTGAGCTT GGCGCATCCA AGTCAGAGTT GCATCATCGT 
CAATTTCAAT AGCATCTCCC CAGAAATCCG TCAAAACTTC CTTCATAATG CTGTTGAGCT 
TGCTTGCTCC AAAGGTCTCC CCTTCTTCAA TCAATGTATA CACCTTACGC TGGAAGGCGG 
CTTCCAAGAG GTGGGTGATA AAGTTATGGA AGTAGGTGTC TGTCAAGCGA TGAGCCAGAG 
CGAAGCGTTT TTGACGTGGG TCATTAGACT GGTTCTCCAA GTAATCACTG AGTAGCAATT 
CATTGAAGGT TGACGGTGCT TCAACATAGT AGGTCGACAT ATGGGCATTG AAGTAACTTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
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GATGATTGTC 


TGAAAAGATG 


AATTGACCAG 


AATGCCCGAT 


TTCATGAATC 


AAGGTATAGA 


1560 


CATCGCTCAA 


ACGGCCTGTC 


CAGCTCATGA 


GTACATAAGG 


GTGTACGCGA 


TATGGGTCCG 


1620 


CCGCATAACC 


ACCGGAATCC 


TTGCCACTGT 


TAGCAGCAAA 


GTCCACCCAG 


CGCTCTTCTT 


1680 


GGTAACGAGC 


AACTTCCTGA 


CAATATTCTT 


GCCCCAAAGG 


TTCTACCGAC 


TTCATGACCA 


1740 


AATCATAGGC 


ATCGTCAATA 


GTCACTTCAG 


GATTCAGGGC 


GCTGTCCAAG 


TCCAATTTCC 


1800 


AGTCTGCAAA 


GGTCATCTTT 


TCAAGACCAT 


TTACCTTGGC 


AACATGCTTG 


AGGTATCTCT 


1860 


GAGCGACTGG 


TGCAAAGTCC 


TTCATGATGA 


GGTCAATCTG 


GCGGTCAAAC 


ATGACACGGT 


1920 


CCACTTCTTG 


TTCAGCTAGA 


AGATAGTCAA 


AGACAGAGTC 


GTATCCCTTC 


ATATCAGCCA 


1980 


AGAGTTTTTC 


AGACTTGACC 


TGAGCCAGAT 


AGGCTGCTGC 


AGCCGTATTT 


TGGTGCTTAC 


2040 


GAAGTCCCTC 


TGAGAAGGAA 


CGGAAGGATT 


TCTCACGAAC 


CTCAGCATCC 


TCATGGTTTT 


2100 


GGTAGAAATT 


CTCATAGGTC 


ACAAAGCTGT 


TTTTGTAGGT 


CTTGCCATGG 


GCTTCAAAGT 


2160 


CAGCCATTTC 


AAAATCCCCA 


GCTCGCATCT 


TAGTATAAAT 


GTCCTGCGGA 


CTGTAGAAAA 


2220 


CTTCACCGAG 


ATTTGTCAAG 


GCCTTCTCCA 


CATCTGCCCC 


TAAGTAGTGG 


GCTTTTTTGA 


2280 


TTTTAGCCTG 


ACGAATGGCA 


GCTGTTAAAT 


GTGGCAATTT 


ACCCAAACGG 


TCCAAGACTT 


2340 


CCTCATCTGC 


TGCCACCAAG 


GCATCGTCAA 


AGAAGGTCAA 


GGCTACGCTG 


GCATCTGTTT 


2400 


CTiAATTCCAT 


CCCAGCTTGG 


G C AAT ATTGG 


CAAATTCGTC 


AI"TGCtatAG 


TCCUTCGTCT 


2460 


GAGGCATAAA 


ACCATAGTTG 


CCAATATGGC 


TCATCTGAAT 


GTAGATCTGT 


TCCAATTCCG 


2520 


CAAAGGCCTT 


CTCGAAATCC 


TCAAAAGTGT 


GAAGATTGCC 


CTTGTAATCA 


CGGCTAAACT 


2580 


GGTTGATGTC 


TTCGCGAGCT 


TTCTCGATTG 


CACGCAAGAA 


ATCCTCACGG 


TCTTGGTATA 


2640 


GGGCTGTTAA 


GTCCCAGAGT 


TCCTTCTCTG 


GAAATTCTGA 


ACGGTGTTTT 


TGTTCCATTT 


2700 


TCTTCCTCTT 


ATTTCTCTAA 


TTCTACTAAA 


ACACTAAGGG 


CTGATAAAGC 


GTAAAGCGGT 


2760 


GCTGTTTCTG 


CTCGCAAAAT 


ACGAGGACCT 


AGGCCTGCCA 


AAACGGCTCC 


TTTAGCTTCA 


2820 


AAACTTTCGA 


TTTCTGCAGG 


TGAGAGACCG 


CCTTCTGGAC 


CAAAGATAAA 


GAGCAGTTTG 


2880 


GCTCCTGTTT 


CAAGACCAGT 


GACTGCTTGC 


AGAAGCGCAG 


CGGCTTCTCC 


TTCTTTAGCT 


2940 


GATTCTTCAT 


AGGCTACTAT 


GATAGAGTCA 


AACTGGTCCA 


GCTGAGCTAG 


AAAATCTGCT 


3000 


TTTTTCTCGA 


AAAGTTTAAT 


ACTTGGTACA 


ATATTACGCT 


TGCTTTGCTC 


GGCTGCTCCA 


3060 


AGGGCAATTT 


TTTCTAGTTT 


TTCAACTTTT 


TTACCCAATT 


TCTTGCCATC 


CCACTTGGCA 


3120 


ACTGACCAGT 


CTGCAGGAAA 


. GGCCCAGATT 


TGGCTAGCCC 


CCAGTTCGGT 


TACTTTTTGA 


3180 


GCGATGAACT 


' CCAGCTTGTC 


TCCCTTGGGA 


, AATCCAGATG 


CGATGGTCAC 


TTGGACTGGT 


3240 
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AGTTCCACAT ' 


TGTCATTTAA 


TTCTTGGACC 


AACTCAAACT 


GACGATTTTC 


CATATCCAGC 


3300 


ACGCGCGCCA 


AGCGCTTGAT 


GCCATCATCA 


AAGACTAAGG 


TAACCTCATC 


CTCTTCTTTC 


3360 


AAGCGCAT/A 


CCTGAAACAT 


ATGCTTACTG 


GTTTCCTTGT 


CCTCGATAGT 


GACAGGAGAG 


3420 


ATAGCACTGC 


CTTTTACAAA 


ATACTGCTGC 


ATGCTAGCCT 


CCAATCACAC 


CAGAGATATC 


3480 


CTTGGTTTTC 


TTAAAGACAC 


AGGTATTCCA 


TTCCCCTTGA 


ACCATGTGAG 


TTTCGAGGAA 


3540 


AAATCCAGCT 


GACTCAGCCG 


ACTGGCGCAC 


CATGTCCAAC 


TTGTCCTTGA 


TAATGCCACT 


3600 


CATGATCAGG 


TAGCCTTCAT 


CCTTTACCAA 


GCGATAAGCA 


TCGTCTATTA 


GATGAATGAG 


3660 


GATATCCGCC 


AAGATATTAG 


CCACAATCAC 


ATCTGCCTCA 


ATTTCCACAC 


CCTTAAGCAA 


3720 


ATCTCCAGCC 


GCTACATGGA 


TATTTTCCAT 


GCCAGGGTTG 


AGCTCAATAT 


TTTCCTGAGC 


3780 


CACACGAACC 


GCCACATCAT 


CCAGGTCATA 


GGCGAAAATT 


TCTTTAGCCC 


CCAGAAGCGA 


3840 


GCTGGCAATA 


GAGAGAACCC 


CTGAACCAGT 


CCCCACATCT 


AGCACCGTTT 


CGCCACCACG 


3900 


AAGAACCTGT 


TCCAAGGCAA 


AAAGGCTCAT 


CTTGGTAGTT 


GGGTGGGTTC 


CAGTACCAAA 


3960 


AGCCATGCCA 


GGATCCAGCT 


TGATAATCAT 


TTCCCCCGCA 


GTCGCCTCAT 


AGTCTGTCCA 


4020 


AGAGGGAACG 


ATGGTCAAAT 


CATGAGTGAT 


ACGAGCAGGT 


TCATAGTATT 


TCTTCCAGTT 


4080 


GTCTGCCCAG 


TCTTCCTCAG 


CCAAGGCAGT 


CGTACCTATT 


TTTAACTCTC 


CCAAATCCAT 


4140 


AAAATCTGTC 


AATTCTGCTA 


GACGAGCCTG 


CAAATCCGCC 


TCAACCACTG 


TCACATCCAC 


4200 


CGTGTCAGGG 


TAGTAGGCTG 


TCACTACGAT 


TTCTTCTTGC 


TGCTCCACCT 


CTGGGAAAAT 


4260 


CTCTCCAAAG 


CGGTCCACAT 


TTCCCACATA 


GTCCATACTG 


TCTTCGATTG 


CGACTCCTTG 


4320 


CGCTCCCAGC 


TCAATCAAGA 


GATTGGAAAC 


CAACTCCTCT 


CCCTCACGCT 


TCACTGTAAC 


4380 


TTTTAACTCT 


TGCCATGTTT 


CCATTATTAA 


TACCAAGCCC 


GTAAAACACA 


AAACCAAAAT 


4440 


AGGAAATTCT 


CTGAAGACGC 


TTGTGTCTAA 


GAGAAGTTTA 


TCTTTTTGGC 


ACAGTGTTTA 


4500 


GGGCGGGTTC 


AGTTTAGAAA 
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4560 


ATCTTTTAAT 


CTCTCTTGCA 


ACTGAGGCAC 


AACTTGACTG 


GAACTAAGAA 


ATTCCTCAAC 


4620 


ATTCATCAGC 


TGATAGCCCT 


GTCCTTCATC 


TCCGAAGATG 


ATATTGTCAA 


ATTGTTCTTG 


4680 


TCTTAGCTGA 


CCAACCATAA 


AGACCGATTT 


CTTGCCTTTA 


AAAATTACGC 


TAGGATAAAT 


4740 


CTTGCTCCAA 


AGCAGACAGT 


CTTCATCTAA 


ATGAATTCCC 


AGTTCCTCAT 


AAACTTCACG 


4800 


CCGAGCGCAT 


TCAAAAGGGC 


TTTCGTCCCC 


TTCACGGCCA 


CCACCTGGCA 


GTTCCCACAT 


4860 


ATTGGCCCAG 


1 GGAATACTTG 


CCTTATCATC 


GCGTAAGATA 


GTCAAAAGCT 


TATCCCCACA 


4920 


AAACAAAGCA ATCTTGCAAC 


CTGTGAAATC 


AGAAATTTCT 


AGTTCCATCT 


TCAGTTCCTT 


4980 


CTAACATTTC 


: CTTTTCCAGC 


; TCGGCTAACC 


' AGTTTTCATA 


ATATCTTTTC 


TCATCCCTCA 


5040 
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ACATTCGACT ACTATCCATT TTCTGTCTAG CAATCTTGAG AGCCTTACGA GTTCGATCTA 5100 

CATCTTTCTT CACCTTTAAT TGATACCAGG CTTGTATCAC TTGAAGATTG GACAGTTTGA 5160 

GAGACAGAAA CGATTTGACC TGTCGAATAC TAGCATATTG CTCCGCTTGC TCAAAATCTC 5220 

CTTCCAACAA GGCGATATGA AGCAGGGATA GTTGGGCAAC TGTCTGCATC ATCGGAGTAG 5280 

TTGTCCTCTC AAGTAATGCT TGAAACTGCT GTTTAGCTAC TTCTTCCTTC CCTTCCAAAA 5340 

TGGAAACTTC ACCTTGCATA CCTAATACAC CATCCGCAAA ACTCCCTCGT GCATCCTCAG 5400 

GAACTGCTTG AACAAAGTCT TTCAAATCAT ATTCTTGAGG AGCTAGCAAG GTCTGGGCAG 5460 

AATGTCTCAA TACCAGGTAG GCGTATTTGG TATTTTCAGG GTGTTGTAGT AATTCCCAAA 5520 

TTTTTGCTCC ATCGGTGATG TCGACTGGCA AAATGTTATT TAGGAAGAAA GATAAATTAA 5580 

GAAAAATCCA AGTCCCTGCA AAATACCAGC TTCTTGTCAA AAATCCAAAC AATATCGCCA 5640 

ATAATATCAA GCCGAGATGA ACCATCAAGC CTCCTGAAAG CATCAGGATG ATTCTTTGAT 5700 

CGCTTTCATC CTCTTTTAAA CCAATGTATT GAGCACCAAC ATTTTTCAGA ATGGCTGTTC 5760 

TACTAAGATG AAACCTGCCT GACTTTTTGG TCAAAATAAA ATGTCCTAAT CCAAAAGCCA 5820 

CCAGCCGATA GCCTGTCAAG TAGCCACAAA AAGCATGACC CAGCTCATGA AGAATAAAGA 5880 

TTAAATACAT GCTTAGAAGA GCGAAGGCAT AACCAAAAGT AAAGGCTAAA ACTGCGGAAT 5940 

ACCCCAACTC TGCAAATGCG ATTGTTCCAC AAGrAAAAGC TAGa^TAATA .AAGACAACAC 6GGC 

CTAGCACATA AACCAAATAA GTCCCAATTT TCTTCATAAC ACCTCCAACC AACTCCTAGT 6060 

ATCTTGGATA AGGATAAAAT TCTCCCTTTT CCAAGCCAAT TTTTCCTTCT TCAAAGACTT 6120 

CTTGGTTCCA TTCCATGACA AATTCCTCTG CTTCTGGGTC TTCCAAAAAG TCCATGAGGA 6180 

CATCTAGCCC AACCTCAGCA GTATCTTTAA GGAAAAGCGC AAAATAAGCT AAAAATTCAC 6240 

GGGAAAATCC TTTTTTAGGC AGGTAAGGAA TAACAGTCAA ATAGTCTTCC TCATTGACTG 6300 

TTGACTTGGC AGGATTGTAG AAAAGGACCG CTTCCTCAAA AAGAATGTCA TCTGATGAAA 6360 

CCTCTCCGTC TTCATCCACC ATCTCCACAC CGCAGCATTT TGCGCTTCCA ATAGAAAACT 6420 

CACTTCTACC GCATGGTTGC GTTTGTCCCA GCTAATCTCA AAGTCAAAGG GAAAGTTCTT 6480 

GTCCAACTCT TCCTCTAAAA TATCTAAAAA TCCGTATGTT GCCATTTTGT CCTCTTTCTA 6540 

TGCGACTCTT TAATCGCCCC GATTGCTCGG AAATATGCTA AAATAGATAC TACCATCTTA 6500 

CCACAAAATT ATTTTATGTC CTAATTATAC CATATTACCT CATTTAAACC CTTGGTATCA 6660 

GTGATTTTCT TAAAAGTCTG ATTTCTTCAT TTCTCATAAA AATCAATATA AAAAGCCCTC 6720 

GAAAGGGCTA ATAAATCTAT AAAATCAATA GGCGAGTAAC TAGCACAAGT GGACGTGCTT 6780 
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TTTTATTGAC 


TATTACCACG 


ATACCACGCT 


TAATCTTAGG 


CTTGAACTTT 


CTTATCTGCA 


6840 


ATAGCGTCTG 


TCAAAGTCTG 


AGAAAAGTTA 


AGCCCCATTT 


CTCGTCCCAA 


CTTATCTGCC 


6900 


CATTTTGGTA 


TGGTCAAAGT 


CTTTTTAATG 


GGTTCCTGAC 


TTCCTAGGTA 


TTCTGATACA 


6960 


TCAACAGATA 


CCATAGAAAT 


AAAAGATTTA 


TCAAGGTCAT 


AGGTTGACAC 


GAAATCTTCA 


7020 


TCATCTTTAA 


AAGGATCATT 


ATCAATTAAA 


GACAAGCTAT 


TGATATCTGA 


TGGCTGAGGT 


7080 


AACTCTCCAT 


CACTCTCTAT 


CAAATCTGCA 


ACAGTTATCC 


CTAGCCACTC 


CGACCCCATA 


7140 


GCCAAAGCCT 


CAGAAATCCC 


CTCTCCTTGT 


GTAGCTGAGT 


ATTCAAAATC 


TGGGAAATGG 


7200 


ACAAAATAAG 


TCGCTTCTGT 


TCCGTCTGTG 


TCGTCATAAT 


AAAATAAAGC 


TGGATACGTA 


7260 


ACTAACATTT CACTACCTCC ATATCAAAAA GCAGGGACTG AATTTTACAA CCCAGCTTGC 


7320 


TTTCTTATCC 


CTCTTTCAGT 


GTACTTATTC 


AGCTCACCAT 


GAAGGATTGT 


GATAGGTCTT 


7380 


TCCCCTTGCT 


TTTCCATTTT 


AATATGGGAG 


CCTTTACCGC 


CTCTAGTCTT 


TATCCAACCA 


7440 






^ i ^ 1 1 1 1 


TGTGTCATAG 


GCATAGCGCT 


TTTACCTCCT 


7500 


GACAACACCA 


TTATAACACG 


TGTTACACGT 


ATTGTAAAGG 


AGTGATACTT 


ATTATTCTAT 


7560 


TATACATAAA 


AGCCCCTAGA 


TGTGGTTCTA 


AGGGAAGCCA 


ATTTATTCAT 


ACCTATTTTT 


7620 


CTAATGAGTA 


GTAAAAACTG 


CTTCTTTATC 


GAGCAATTCA 


TCATCTGTAT 


AGTCAATTGT 


7680 


AAAAGTATCT 


CGATCTAAGA 


CAGATTGAGG 


CGGAGTTGAA 


TGAATCATAG 


GAACACTGCG 


7740 


TACTCTATAT 


TTTTTATCTC 


CAATTTTTAC 


AAACTGATAC 


TCTTCGAAAA 


TCAAATTCAA 


7800 


ACCACGTCAA 


CGTCGCCTTA 


CCGTACTCAA 


GTACAGcCTG 


CGGCTAGTTT 


CCTAGTTTGC 


7860 


TCTTTGATTT 


TCATTGAGTA 


TGATTAACTC 


TCAAGTCTTC 


GAAATCAGGA TTTTCAACAG 


7920 


TTATTACAAG 


GAGGCGATTT 


ACTACTTCAA 


AAACATCAAT 


TATTCTATTT 


TTCATATTTT 


7980 


TTCAACCCAT 


TATTAGAATG 


AACTTCTTGG 


TAAGCAAAAT 


CAAGTTTAGA 


TTTAATGTTT 


8040 


TCGTACAAAT 


CTAAAATCTC 


TTTTGGAGTA 


TCTTCCCGGA 


AGAAAAGTTT 


TCTTTTCCCT 


8100 


GAAATAACTT 


GATCACTAAG 


AATCCAATGA 


CGAATTTGTT 


TTGTAAAAAT 


CAAAATTTCC 


8160 


TGACTTGGTA 


GTTCCATCAT 


TTCCATTGCT 


TATCACCTCT 


CTTTTCATTA 


TAGTTCATAC 


8220 


AATGACATTC 


AGCAATATTA 


TTTCTCAAGT 


CAGCACTTCC 


ACTTCTTTAG 


GCTCAACTAT 


8280 


CCTATTTTGA 


GCTTTAAGGA 


AAATCAAATC 


TCTCATGCTG 


ATACCTCTCC 


TCATTAAATT 


8340 


AAATAGTAAA 


AAAGATTCTA 


TCTCACTCCC 


TGATTATTAC 


AAAACCATTG 


AAATATCACA 


8400 


ACTAATAGGC 


TAGAATGGAC 


ATAGTAAGAT 


ATAGTAGATG 


AGTCATTCTA 


CTCAAATCCA 


8460 


CGTTAGAAAG 


GACTGCTATG 


CCAGACAATC 


TCGCCGTTCG 


CATGCGCCCn 


GG 


8512 



(2) INFORMATION FOR SEQ ID NO: 130: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2869 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 



CTCGTTTCAA 


GGTTGAGTCT 


CTTGCAAATC 


TTGTTCGCGT 


TCTTCCTTTT 


GCCAAGGCAT 


60 


CTCTCCCATG 


GTTGGTGCcA 


GCCATTGTTG 


GAATCTTGCT 


CTCATTGGTT 


CTACCAAACA 


120 


AGCAAGAAAG 


CGATGTTTTT 


GAAATGGAAT 


AATCACTTAA 


ATCACTTTTG 


TAGCCAAGTC 


180 


TACAGGAGTG 


ATTkTCTTTT 


TTTATCCGAT 


GATAAATGTG 


TTATAATAGG 


TAGCGAAAGA 


240 


GGTGAAGAAA 


TGAATCAAAC 


AGTAGAATAT 


ATCAAAGAAC 


TGACAGCCAT 


TGCGtCGCCA 


300 


ACAGGCTTTA 


CTCGTGAGAT 


TGCGGACTAT 


TTAGTCAAGA 


CTCTAGAAGG 


TTTTGGTTAC 


360 


CAGCCGGTTC 


GCACATCCAA 


GGGCGGTGTC 


AATGTAACTA 


TTAAAGGTCA 


AAATGATGAG 


420 


CAACATCGCT 


ATGTGACTGC 


CCATGTAGAT 


ACGCTTGGTG 


CTATTGTCCG 


TGCTGTCAAA 


480 


CCAGACGGCC 


GTCTCAAAAT 


GGACCGTATC 


GGTGGCTTTC 


CTTGGAACAT 


GATTGAAGGA 


540 


GAAAACTGTA 


CCATTCATGT 


GGCTAGCACA 


GGTGAAAAAG 


TATCAGGAAC 


CATCCTCATC 


600 




r'mfpQ CC ATGT 


CTATAAGGAT 


GCAGGAACTG 


CAGAaCOCAC 


GCAAGACAAT 


660 


ATGGAAGTGC 


GTTTGGACGC 


CTUUVGTAACT 


AGTGAAAAAG 


AAACTCGTGC 


TCTTGGCATT 


720 


GAGGTCGGTG 


ATTTTATCAG 


TTTTGACCCA 


CGAACTGTCG 


TGACAGAGAC 


AGGTTTTATC 


780 


AAGTCTCGCC 


ATTTGGATGA 


CAAGGTCAGT 


GCGGCGATTT 


TGCTCAATCT 


CCTTCGCATT 


840 


TATAAGGAAG 


AGAAGATTGA 


ATTGCCCGTA 


ACAACTCATT 


TTGCTTTTTC 


AGTCTTTGAA 


900 


GAAGTGGGAC 


ACGGTGCAAA 


CTCTAACATT 


CCTGCTCAGG 


TAGTAGAATA 


TCTGGCTGTG 


960 


GATATGGGAG 


CCATGGGAGA TGACCAGCAA ACAGACGAAT ATACAGTGTC TATCTGTGTC 


1020 


AAGGATGCTT 


CTGGACCTTA 


TCACTATGAC 


TTCCGTCAAC 


ATTTGGTGGC 


TTTGGCGAAA 


1080 


GAGCAAGATA 


TTCCATTTAA 


GCTGGATATC 


TATCCATTTT 


ATGGTTCGGA 


CGCTTCAGCG 


1140 


GCTATGTCTG 


CAGGGGCAGA 


AGTCAAACAC 


GCCCTTCTCG 


GTGCTGGTAT 


AGAGTCTAGC 


1200 


CATTCCTATG 


AGCGTACCCA 


TATTGACTCG 


GTGATCGCAA 


CAGAACGAAT 


GGTCGATGCT 


1260 


TATCTTAAGA 


GCACGTTGGT 


GGACTAATAT 


GTGCCTTATT 


TGTCAGAGAA 


TTGACCTCAT 


1320 


CAAGAAGGAA 


GAAAATCCTT 


ACTTTGTCAA 


AGAGTTGGAA 


ACAGGCTATC 


TTGTGGTTGG 


1380 


AGACCACCAG 


TATTTTGT^G 


GCTATAGTCT 


CTTTCTAGCC 


AAGGAGCATG 


TCAGCGAATT 


1440 
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GCACCATTTG AAAAAGGAGA CAAGACTCCG TTTTCTAGAA GAAATGAGTT TAGTCCAAGA 1500 

GGCAGTTGCC AAGGCCTTTG CTGCTGAGAA AATGAATATC GAACTGCTAG GAAATGGCGA 1560 

TGCTCATCTT CATTGGCATC TGTTTCCACG ACGGACAGGT GATATGAATG GTCATGGTCT 1620 

CAAGGGTCGT GGACCAGTCT GGTGGGTTCC CTTTGAAGAA ATGACAGCAG AAACCTGCCA 1680 

AGCAAAACCG GATGAGATTA AAAGATTAGT CAAACGTTTA TCGTCAGAAG TAGATAAACT 1740 

ATTAGAAATA AAGGAGTAGA AATGAAGAAA AGATACCTAG TCTTGACAGC TTTGCTAGCC 1800 

TTGAGTCTAG CAGCTTGTTC ACAAGAAAAA ACAAAAAATG AAGATGGAGA AACTAAGACA 1860 

GAACAGACAG CCAAAGCTGA TGGAACAGTC GGTAGTAAGT CTCAAGGAGC TGCCCAGAAG 1920 

AAAGCAGAAG TGGTCAATAA AGGTGATTAC TACAGCATTC AAGGGAAATA CGATGAAATC 1980 

ATCGTAGCCA ACAAACACTA TCCATTGTCT TVAAGACTATA ATCCAGGGGA AAATCCAACA 2040 

GCCAAGGCAG AGTTGGTCAA ACTCATCAT^ GCGATGCAAG AGGCAGGTTT CCCTATTAGT 2100 

GATCATTACA GTGGTTTTAG AAGTTATGAA ACTCAGACCA AGCTCTATCA AGATTATGTC 2160 

AACCAAGATG GAAAGGCAGC AGCTGACCGT TACTCTGCCC GTCCTGGCTA TAGCGAACAC 2220 

CAGACAGGCT TGGCCTTTGA TGTGATTGGG ACTGATGGTG ATTTGGTGAC AGAAGAAAAA 2280 

GCAGCCCAAT GGCTCTTGGA TCATGCAGCT GATTATGGCT TTGTTGTCCG TTATCTCAAA 2340 

GGCAAGGAAA AGGAAACAGG CTATATGGCT GAAGAATGGC ACCTGCGTTA TGTAGGAAAA 2400 

GAAGCTAAAG AAATTGCTGC AAGTGGTCTC AGTTTGGAAG AATACTATGG CTTTGAAGGC 2460 

GGAGACTACG TCGATTAATA CTCTTCGAAA ATCTCTTCAA ACCACGTCAG CGTCGCCTTA 2520 

CCTACTGACT GCGTCGGTTC TATTCACAAC CTCAAAACAG TGTTTTGAGT cGATTCGTCA 2580 

GTTTTATCTG CAACCTCAAA GCTGTACTTT GAGCAstGCG GCTAGCTTCC TAGTTTGCTC 2640 

TTTGATTTTC ATTGAGTACA AAAAGTAAAC TTTTCTCTTG CAATTCCAGA TAAATAGTGT 2700 

ATAATGGATG GGTATGTGAA AAACATACTT GTGGGAGGTA AAAATCTCTA ATTACCGCCA 2760 

AAACCACAAA GGAGGATTTA AAAATGGCTA AAAAAGTCGA AAAACTTGTA AAATTGCAAA 2820 

TCCCTGCTGG TAAAGCTACA CCAGCTCCAC CGGTTGGACC TGCTCTTGG 2869 
(2) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6186 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 
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CTGAATCCCT 


TATAGGAGTC 


CAGTAACTTT 


TTAGCCTCTA 


CTTTGCCTTC 


ATAGGCAGCT 


60 


TCAACATCAT 


TAAAAAAAGA 


ArGCACTGAA 


GCAAGTTCTT 


CAGTGCTCCA 


CGACAAATCT 


120 


AGTGGGTAAC 


TATACTGTTT 


GTTCATTAAC 


TAATACCAGC 


TCTCATTCTT 


GCTTCTTTTA 


180 


GTTCTTGCTT 


ACGATAACTA 


CGAGGGAGAA 


AAGCACGAAT 


CTCATCTTCA 


TTAAAACCGA 


240 


TTTGCATACG 


CTTGGCATCA 


ATAATAATTG 


GACGACGCAA 


AAGACTAGGA 


TACTGCTCAA 


300 


TCAAATGAAG 


CAATTCCGAT 


ACCGAAATAC 


TCTCTACATC 


AATATTCAAT 


TTTTGAAAAA 


360 


TTTTTGAACG 


AGTTGAAATG 


ATGTCATCAG 


TACCATTTTC 


GGTCAAGGAA 


AGGATGTGTT 


420 


GCAATTCTTT 


TCTTGTTAAA 


GGACTGGTCA 


TAATATTGTG 


TTCCACAAAG 


GGAACTTATG 


480 


TTTTTCTAAC 


CAGGCCTTAG 


CCTTACGACA 


TGATGTACAG 


CTCGGTGATA 


GAAATAGTGT 


540 


AATCATGCTT 


TTCTCTTCTT 


ATCTATACTT 


TGCTACTTCT 


ATTATACAAA 


AAAATAAAGC 


600 


GCTTGACTAG 


GGATTTTTAG 


AAAAAAAGCC 


TATTTTTTCA 


AGAAAAATAG 


GCTTTTTGCG 


650 


AACGATTGAC 


ACAATTGGAT 


TTGGTTAATT 


CACTCTTAAC 


GATGGTTTTA 


AACGATATAT 


720 


ATTTTTATAT 


ATGTAAATTA 


AAAACATCTT 


TCCTTTCACT 


TCCTACGACT 


TTTCAGATAC 


780 


AGATAGCCAA 


AGAAGTTTTC 


ATAGAGGGCA 


AAAAAGAGGA 


GGAAGGCATG 


AAGAAAGAAG 


840 


GTCTCTGGCA 


AAATCATAAT 


AACAGGATCC 


TTGGCTGGAT 


CAAAAAGCCA 


GGTATCATCT 


900 


CCCACAAAGA 


GAATTTGATG 


GAAAAGAGTA 


AAf^AATTGGT 


r* a a a a CCSAT 


CAAAACT CCC 


5 bO 


CCAAGTCCAA 


TCATCACAGG 


TAAGACTACT 


AGAGCCAGGA 


GACTTTTTCG 


ATAAAGAGAC 


1020 


AAAAAGTCCT 


TTTTCACAAT 


CCTATTGACA 


AAGACATAGA 


AACTTGGCAG 


TGTCACTAGA 


1080 


GCTACTAGCT 


GAACCAAATG 


AAAGAGATTC 


TTGACCACTG 


CGAAATGGTG 


CAGACCAGCT 


1140 


GCTGACGAAC 


GAAAATCAGG 


CATCTGTAAG 


ACCTGACTAA 


AAGGATTGGT 


CAGATAATTC 


1200 


ATCAAGATAT 


GAAAATTGTA 


TTGAATGGTT 


TCTGGTTTTA 


GATAGACTCG 


ATTCGTTAAG 


1260 


TTTAGCCACT 


GAATCTCCAT 


AGGATAGAAA 


ATCCAAGCCA 


GATAAATGGT 


CAGAAGGATG 


1320 


GAGAGGGAGA 


GGAGAAAGAG 


CATAGAGCCC 


CAAAAGATCA 


ATTTAGTTTT 


CATCAAAATC 


1380 


CCACTCCGCA 


AGGCTAGAAA 


CCACATGTGT 


CGGTGCGATT 


GGCAGGCCAG 


CTACTTCTTC 


1440 


TGCCTTAGTA 


AAACCTGTCG 


TCACCAAGAG CGTTGGAATG 


CCATTGTCAA 


TCCCAGCCCG 


1500 


AATATCAGTC 


AAATAATTGT 


CCCCAACCAT 


GATTAACTCT 


TCACGTTCCA 


AACCTAAGTG 


1560 


CTCAACCGCC 


TTGTCCATAA 


TGATGGCATT 


TGGTTTTCCG 


ATATAAACCG 


GCTTCACTCG 


1620 


TGTCGCTACT 


TCAAGCAGCG 


TAATCAGTGA 


GCCAGCACCT 


GGCAAAAGAC 


CGCGTTCCGT 


1680 


CGGGATGTTG 


AGGTCAGGAT 


TGGTTCCGAT 


AAAATGGGCA 


CCCTTTTGAA 


TAGCAAGAGT 


1740 
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TGCTGTGGCA 


AATTTTTCAT 


AGTCGACTTG 


CCAATCCAGA 


CCAACTACCA 


CGTAGGCAGG 


1800 


TTTTTCCTTG 


TCTTCCACAT 


AACCAGCCGC 


CTTGATGGCT 


TCCTTGAGTC 


CTGCTTCTCC 


1860 


GACGACATAG 


ACGGTCTTTT 


CAAGCCCCAA 


ATCATTCATA 


TAGTCGATGG 


TTGCCAAAGT 


1920 


CGCTGTGTAG 


ACAGTCGATA 


GGGGCGTATC 


GATATTAAAA 


TTCTGAGCCA 


ACATCTCCTT 


1980 


AACACTCTCT 


GGAGTGCGGG 


TTGTATTGTT 


GGTTACAAAG 


AGATAGGGAA 


TGTCCCGCTT 


2040 


TTGCAATTCA 


TGAACAAAAG 


TCTCTCCAGC 


AGGGATTCGG 


TCTTTCCCCT 


TATAAATGGT 


2100 


TCCGTCTAAA 


TCAATTAAAT 


AGCCTTTATA 


TTTCATCTAT 


TTCTCCCTAA 


GCCTTTTTTA 


2160 


TTTCTTGCCA 


AGTAATGATT 


GCTTGGGCAT 


TGATAACCCC 


ATCACTTGTA 


ATTTCATGCT 


2220 


TGCTTTCCAG 


TCCAGTCCGT 


TCAACAGCCG 


ATGTAATCAC 


CCCACCTGGT 


CGAACTTCCT 


2280 


TGACATACTT 


GAGGTTGATT 


TTCTTGGGAA 


TATAGTGGGT 


CAAAAAATCC 


GCTCCCATGA 


2340 


CCTCAAAAAT 


CCAGTCCAAG 


TATTTACTGT 


TATTGACATG 


ACCATTCATA 


TCCAAGTCGT 


2400 


AAAAACGAAC 


ATGGTAATCC 


TTGCTGATCG 


GTTCTTCCAA 


GGACTCATAC 


TTCGGTCCAC 


2460 


GGATAAGTTT 


TTTATCAAAA 


TCAGACTGGT 


AAGGAGCCAC 


AATCTCAGGT 


TCAACAACAT 


2520 


GGACTTTTCG 


ACTGTCGCGG 


TCCATGAGAA 


CAAAGGTCGC 


CATCATGTGG 


ATGAGCTCCT 


2580 


GCTCCGCTTC 


ATTATAAATA 


GTAAAGCGAC 


GGTAGCAAAA 


AAGTCGATTG 


TAGCTCAAGG 


2640 


CTTCCGTTTC 


GATGGTAATT 


TCTTCCGCAA 


AACGAGGCAA 


ACGAACCACC 


TCAATATCAT 


2700 


ATTCTACGAT 


AATCCAGACC 


AGATTATATT 


CTTCCAAAAT 


GGCCTTATCA 


CTAACTCCCA 


2760 


GTTCAATCGA 


CTGCATCCCT 


GAAACTTGCA 


GTGACAGCAA 


AATCACATCT 


GGAAGTTTGA 


2820 


TATGACCGTT 


CATATCAGCC 


ATATCAAAAG 


GAATTTTCAT 


TTTCATTTGA 


TAAGTTAAGC 


2880 


CCATGATCCT 


ACTCCAAAAT 


AAATCGTTCT 


GCTACAGTAT 


CTCCCAAAAA 


GAGACCTCTC 


2940 


TTTGTCATGC 


GAACGTGGTC 


ACCCTCAATC 


TGCATGAGGC 


CTTGTTGAAC 


CAAATCTCTG 


3000 


ACAATTTCTC 


CATA7VAGTCC 


AGCAAAAGAC 




ill i CAA/i 


1 VjVm L.n 1 o 


3060 


GAAACCCCGG 


ATTTCTTGCG 


GAGTCCCAAG 


TVACATTTCTT 


CTTCCATTTG 


CTCCTTTTGA 


3120 


CTCAGGTGAT 


CTTCTGTAAT 


ACAAGCATTG 


CCTTCCTCAA 


CCGCACTGAG 


ATAATGACGA 


3180 


ATGGGACCAT 


GATTTTTATA 


GCGTACTCCA 


TTGACATAAC 


CAGATGCCCC 


TGCACCAATA 


3240 


CCATAGTATT 


CAGCATTGTC 


CCAGTACATG 


AGATTATGAC 


GACTTTCAAA 


ACCGGGTTTG 


3300 


GAGAAATTAG 


AAATCTCATA 


ATGCTCAAAA 


CCCGCTCGCT 


CCAGCTCTGC 


AATGATGTAC 


3360 


TCAAACATCT 


CCGCTTCTAG 


TTCCTCCTTA 


GGCAGAGGCA 


ATTTCCCACG 


TCGCATCCGG 


3420 


TTCATAAAGA 


CCGTATGGTT 


TTCTAAAATC 


AAACTATACA 


AACTCATGTG 


GGGAATATCC 


3480 


AATCCAATGG 


CTTTAGCCAC 


ATTTTCCTTT 


ACTTGCTCCA 


TGGTCTGACC 


AGGCAGAGCA 


3540 
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TAAATCAAAT 


CAATGGAGAT 


ATTGTCAAAA 


CCAGCCAGTT 


TCAGGCGATC 


GATATTTTCA 


3600 


TAAATATCCT 


TCTCCAAATG 


ACTGCGCCCA 


ATCTTTTTCA 


ACATCTTATC 


ATCAAAGGTC 


3660 


TGGACACCTA 


GCGAAACACG 


ATTGACAGCC 


GAATTTTTCA 


AAACAGCTAT 


CTTATCCGCA 


3720 


TCCAAATCGC 


CTGGATTGGC 


TTCAATGGTC 


AACTCTTCCA 


AGACAGACAA 


ATCCAAGTTT 


3780 


TTAGTCAAGC 


CATTCAGTAA 


CACCTCCAGT 


TGCGGAGCCG 


ACAGGGCTGT 


CGgTGTTCCA 


3840 


CCACCGATAT 


AAAGGGTTGA 


CAACTTTTCA 


ATATCATAAG 


AACGAT^CTC 


TTCCAGCAGA 


3900 


TGCTCTAAAT 


AGCTGTCGAC 


TGGCTGATTT 


TTGATGAAGA 


CCTTTGAAAA 


ATCACAATAA ■ 


3960 


TAACAAATCT 


GGGTACAAAA 


TGGGATGTGC 


ACATAGGCTG 


ACGTTGGTTT 


TTTCTGCATA 


4020 


GTAATTATTA 


TACCACAAAG 


ACTAGATTCC 


AGATAAAAAT 


CACCATCCCC 


AGATACATAG 


4080 


TCCGTCCGGA 


GATGGTGATG 


GTTTATTCTT 


CTGTTATATC 


AATCACAATC 


TCTTCTGAGT 


4140 


CATCAAGAGC 


TTCGGCTTTT 


TCTTGCCATT 


GCTCCTTGAG 


ATTATTTAAT 


TGATTTTTTG 


4200 


ATGCTTCTGT 


CGCTTGAAAA 


GCATAGGATT 


TAGTTTGAGC 


AAGTATACTG 


TCCACAGTGA 


4260 


TTTCACCTGA 


CTCAACCTGT 


TCTTTTGTTT 


TCAGAACAAA 


ATCTGTAGCC 


TGCTCCTTAA 


4320 


CTTCTGTCAG 


TTTTTCACAG 


ACTTGCTCCT 


TGGCATACTC 


CGGATCTTCT 


CTCAAATCAT 


4380 


CTAGAAAATC 


TTGAGCCTGA 


CTGCAAACTT 


GTTTGCCCTT 


ATCACTTGTT 


AAAAACAAGG 


4440 


CAAGAGCTGC 


ACCTGAAACG 


GTTCCTAAAA 


GGATTGAGGA 




ATAAGGATTC 


AC 

-3 ^ W V 


TCCTTTTTTA 


TTTTTTGAAA 


AATTTACTTG 


CAAGACGAAG 


AGCTGACAGA 


CTTGCACCAG 


4560 


TCTTGAGTGT 


TTTTGAACCA 


GCTGATGAAG 


CTTTCTTGCT 


CAAGACACGC 


GCATGGTCAT 


4620 


TGAGGTCTGA 


AACAGATAGA 


GATAAATCTG 


CAACAGCACT 


GAAGAGTGGA 


TCAATCGTAG 


4680 


CCACCTTGAC 


ATTGATATCA 


TCTGCCAAGA 


CATTGACCTT 


AGCCAACAAC 


TCATTGGTGT 


4740 


GATGCAAGGT 


CACATCCACA 


TCTGAAGTCA AGGTTTTAAT 


CGTCTTTTCT 


GTTTCATCGA 


4800 


TGACACGACC 


AAGCTTTTGT 


ACAGTAATGA 


TCAGATAGAC 


CAAAAAGACA 


ATCAAAGCTA 


4860 


GGGCAACAAG 


AATATATGCA 


ACTTCTAACA 


TTTAGTTTTC 


CTCCTCTGTA 


ATATAGTAAG 


4920 


GGGCCTTCTT 


TCGATTTTGA 


TAAATAACGA 


TCATTATACC 


GAGACCGATA 


AGGACAACTG 


4980 


ACAGCCATTG 


GGACACTCGA 


AAGCCGAAGA 


ACATGAGACT 


ATCTGTTCGC 


ATACCTTCGA 


5040 


TAACCATACG 


ACCGAAACCA 


TACCAAATCA 


AGTAAAAGGC 


CGTGATATGA 


CCTCGTCTGA 


5100 


GACTCTTCCA 


TTTCCGTCTA 


AAAATCAGAA 


TCAAGGCAAA 


GCCAAGCAGA 


TTCCATAGAG 


5160 


ACTCATAAAG 


GAAAGTCGGT 


TGACGGTAGC 


TCCCCTCAAT 


ATACATCTGG 


TCACGGATAA 


5220 


AGCCAGGTAG 


ATAATCCAGA 


TTATCCACTG 


TTGCACCATA 


AGCTTCTTGG 


TTAAAGAAAT 


5280 
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TACCCCAACG CCCCAAACTT TGAGCAATCA TAACGCTAGG CGCCGCAATA TCTAGAAAAT 5340 

CCCAAGTATT GATGAGTTTA CGGTCAGCAA AGATATAGAG CACAAGAGCC CCAGTTATCA 5400 

AACCACCGTA AATGGCCAAA CCACCATTCC AAATGGCAAA AATCTCTCCT AAATTCTGAC 54 60 

TATAGTAATC AAATCGGAAA ATAACATAGT AGAGACGAGC TCCTAAAATA GCCAAGGGAA 5520 

AGGCTACTAA GATAAAATCT AAAATATCGT CTGGTATGAT CTTCTTTCTA GGTGCTTCTT 5580 

TCATGGTCAA ATAAACCGCA AGAATCAAGC CTGTCACAAT ACATAAGGCA TACCAACGAA 5 640 

TGGCTAGGGG TCCTAGTTGA ATAGCAATTG GATCAAGCAT TTTGCACCTC ATTTCGAGCG 5700 

ATTAGACTTG TCAGTCGTTC GTCGAACAAA CGGGTCGCAT CAAAGCCCAT TTCCTTGGCA 57 60 

CGATAATTCA TGGCAGCTGC CTCAATCACA ACAGAGATAT TACGACCTGT TTTAACTGGA 5820 

ATACGAATAC GAGGAATGtA CGCCAGAAAC TTCAAGTTCC TCTGCATTAT TTCCAAGACG 5880 

ATCAAAGGTC TTATGCGTAT CGTAATTTTC CAAATAGACA GCAAGCTGAA CCTGTGAAGA 5940 

ATCCTTGACA GCACTCGCAC CGTAGAGACT CATAACATCG ATAATACCAA CCCCACGAAT 6000 

TTCAATCAAG TGTTTCAAAA TTTCAGCTGG TTCACCCCAG AGAGTAATCT CATCCTTGGC 6060 

AAAGATATCG ACACGGTCAT CGGCTACCAA ACGGTGACCA CGTTTGACAA GCTCAAGACC 6120 

TGTCTCGCTC TTACCAATTC CACTATCTCC CTGAATCAAG ACGCCCATCC CATAAATATC 6180 

CATCAA 6186 



(2) INFORMATION FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 9541 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 

GAAAATCACA ACCCTTTTTG CAAAATTTTT GAGATTATTT TCACAAACTT GATTTTTCAA 60 

AGTATACTCA ATAAAAATTA AAAAAATCCA CTACGTCAAG GCGAGGCTAA TGTGGTTTGA 120 

AGAAATTTTC GAAGAGCGTG AATGAGTATC ATCTATAGTA AAATAAAAAA ACTGAACAAT 180 

TTGGTTGGGG ACAGCCAAAC CAATTTCTCA CAATGTTTCA GAAACAAGGG TGTGCTATTC 240 

CAATTTCAGC CTACTATAAC TGTCATAGAT TGCTGAAACA AAGTCTAGGT AAAAGTCTTC 300 

ATAATAAAAA GACCTCCTAT CAAGTGTTCA AAAACTTTGA TAGGAGGTCT TGTTTTGTGA 3 60 

AAATATTTAT CAAATTTTCT ATACAAGTGA GCTGTTAGCC AGGTTCTTTC TATTCTTTCA 420 

ATTTCAATGA ATGGATTTTT TACTAATACT CATAACTGGG AATTTGTCTG TGTAAAAATA 480 
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GCGAGATAGA TGGTATTTAT AAAACACTCA AGACAGCTAG ACTAATATCA TTTAAAACAT 540 

TATCTTCTTT TGAGCGACTG TTGGTTACCA ACATAGCTAA ATTTCCTGCA TTTTCAAATT 600 

GATAGGGTTC TGATTTAGCA TTCACAACCA CCAAGAGGTG TTCTTTGCCG TGAACTTCAT 660 

AGATAAGGTA GCCGCTATGT TCAATCGCAG AATGCACAAA GACATGATGG TAAATTTCAT 720 

CATAGCTAGA GTAAGAAAAG GCACCAGTTT TTGTCTTCAA TCGGATGACT TGACGGATAA 780 

ACTCAATACT GTCTTGACGC TCATTAATCA AGTTCCAGTT CACTTGGTTC ACACTGTCAG 840 

GAGCATTATA GCTATTCATC GCACGCTCTC TATCATCATG GGTCAACTCA CCATTTTCAC 900 

CAGTCGCAAC CAGTTTGGTA CGACCAAATT CTTGACCGAT TTCCATAAAG GCCATCCCCT 960 

GCATGAGCAG ATTCATGGCT GTGGCAGTTT CGACCTTGCG CATGATTTGC TCTGAACTTT 1020 

GGTCTGGATG AAGGGTTGCC AATAAATCGT GAAGATTGTA ATTGTCATGG GCTTCTACAT 1080 

AGTTAAGCAC CTGATTTGGA TGTGTATAGC TTCCTAATTC ACGACTTCCT AGGATTGCTT 1140 

TAGCTAGAAT TGGCTCTGTC GCAGCACCAC TGACAAAACC TGACTTGATA GCACCATAAA 1200 

CTTCTCCCCC TTTGACAGCA TCGCGCTGAT TGTCATTAAA GAAACCAATA TTTGGCATCT 12 60 

GGTAGGCATT GTCCTTCTTG GCCTTATCAT AAGGGGCAAG ACCTGTTCCC ATATCCCATC 1320 

CTTCTCCATA GAGGATAATG TTGGAGTCGA TTTCATCCAA GCTTTGACGA ATCATCTGCA 1380 

TGGTCTTGAC ATCATGAATC CCCATCAAGT a^AAACGG^'^J^. GCCGTCAATA TTATATTCCr 1440 

GCACCCAGTA TAGAAGAGAA TCAATCATAT ACTTGCGAAA CATTTCGTGT TCACTGGCTG 1500 

TTTCATTTCC AACACCCGTT CCATTCTGGA AGGTACCATC TGGATTCATA CGATAATAGT 1560 

AATCAGGGAC TGTTGTTTGG AATGGTGCAT CAACAACTGA GAAGGTATGG TTATAGACTA 1620 

CATCCATAAT GACTCCAATA CCCGCATCGT GATAAGCTTG AACCATCACC TTCAAATCAC 1680 

GAATGACCTG AGCTGGATCA TCTG6ATTAG TTGAAAAACT AGTTTCTGGC GCGTTATAGT 1740 

TTTGTGGATC ATAACCCCAG TTGTAGGTTA CATTTCCATC CTCATCGTAT TCTTTATCAC 1800 

GGTCTGCAAT TGGTTGCAAT TGAACATAAT TGTAGCCCAG CTTCTTGATG TAATCAAAAG 1860 

CAGTTGACTG GCCGTATTGG TTAACTGTTC CAGCCTGAGC AGCACCCAAG AAAGTTCCTC 1920 

GAAGATGTTC ATCTACACCC GATGTAGGTG ATTTAGTCAA ATCACGAATG TGCATTTCAC 1980 

AGATAACTGC CTTACATGGA TTTTCCAAGC GCCAAGTAGC CTCCGAACCG TGCTTAACCT 2040 

CGAAGTTTTC AACTTGCTTT TCTACATGGC TCAGAATAGC TGAACGTTTG CCATCAGGGC 2100 

TGGTCGCGAT TGTATAAGGA TCACGTGTCA GTGTTTGGTG ATGAGGGAAT TGGACTTGAT 2160 

ACTGATAAGT CTTACCTACC AAATCTTCTT CAACATCCAA ACTCCAGACA CCGATTGTAT 2220 
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TGTCCTTATG 


A TTAT AAG At» 




1 1» 1 1 1 1 t~rVl 


CTCAAAAGTC 


TTCCAAACGG 


2280 


GTGCATCATT 


AGCAGCTGAT 






mm (-t TVJ'p/^/^OT' 
i I ^ l\j 1 ^Vj^ 1 


GTAGGTGACC 


2340 


AGAGAGAAAA 


ATGAGCCTGA 




L. ALi u o An v< ^ 


\u>/Vr\ 1 1 ^ J> 1 


TGGTAACCCC 


2400 


AATGATGATC 


AAAACT AG CA 




C CTT ATCAAA 




TTTTfiATTTT 


2460 


TATAGAAAGG 


ACTGGCAATA 


GCAGGATTTT 


»Vf*»rniV AT>IV 

CAGAGTAATA 


AAl CL. 1 AXUA 


mr'r; rr'TTrr' a 

1 ^•uL'V- 1 1 cv./\ 


2520 


AAATCCAGAC 


CTCTGTTAAT 


AGGGGATAGT 


GATTAAAACG 


G A 1 AlaAA i A 1 


1 U 1 1 i AL. 1 Mvj 


2580 


TTTGACCTGT 


ATGAACCACA 


AAATTCAAGC 


TTTCTATAAC 


IV m^m/^ a a r^mm 
A i L» J. G AAt- 1 i 


r* rT" TV^ T T a a 


2640 


AGCTAAATAA 


AGCTCCAAAA 


TAATCTTCTT 


TGTAGGTTAG 


a IV a m/^ a a mm 
l^AAAlUAAX 1 


^/~*'pm^^ a mf^r^T* 
CGi ivjAlCL-i 




GACTTTTTAC 


AAAGGAGCAA 


GTGTCATATT 


CTCCATTCTT 


ACGATGGTAA 


m/^a am/^^^<^(^a 
TGAATGCGLA 


z /ou 


TAGGGTAGTT 


ATACATTTTT 


TATTTTTCCT 


TTTTACTTTG 


TTTCTATTTC 


a /^m a a m a a a m 
ACTAATAAAT 




TTTTGTCAAT 


CTCGTCTCAA 


TTAACAGACA 


TAGTCATATT 


CTCTAAACTC 


TGTTTTTAAA 


2880 


CGATCCATTA 


CAAACTTTCT 


AGCCATGCCT 


CATCTCTGAC 


CTGGATACCA 


AGTTCTTGTG 


2940 


CTTTTTGCAG 


TTTACTTCCA 


GCGTCTGCAC 


CTACCACGAC 


GAGGTCGGTC 


TTTTTAGAAA 


JUUU 


TACTACCTGT 


CACTTTGGCA 


CCCAGACTTT 


CGAGTTTACT 


TTTAGCTTCT 


GAGCGCTTGA 




GTCGTTCCAA 


TTTTCCTGTC 


AATACCACGG 


TCAAACCTGA 


CAAGGCCGCA 


m*^/*»^/^ma r*'T> a 
Tt-t-tsl- 1 Al_ 1 A 


■J 1 o ft 
J xz u 


CCGTCTGTCC 


TTTATAGTCC 


AGATTGACCC 


CAGTTTCTTT 


^a IV mm/^m^m^ 
CAATTCTCTG 


a/^/^a/*" a ammm 
AGL.AGAA1 1 i 


•J 1 Qft 


CAGAGCCTTC 


TGTCGCAAAA 


TAAGTCTGAA 


GACTTTTGGC 


AATCACGCCA 


/■•/^m a a /^mmm 
t-t-lAGALl 1 1 


T5 j1 n 
u 


CAATACTAGC 


CACTTCCTCT 


GAATCTGCCT 


niv/^iv/^jvr>)v mm 
GAGACAGATT 


mm/^ a amm*^ a a 
1 1 C AA X 1 uAA 


mf^na a aTa'PT 

IVjvinAAlAl 1 


3300 


GAAGTAAAAG 


CTGACTAACC 


TTGCTTCCGA 


CA l\aAHsAA 1 




aaTaanAfiPT 

rvr\ 1 /VrVVjiAVov. i 


3360 


TCTCGGCAGA 


ATTTTCCTTT 


GATGCTTGGA 


m a r*o/^fpr^ jvm s 
1 AGt^t- A 1 A 


CALi 111 A 


u^uvjrAV. 1 1 X i 


3420 


CCTTAACT CC 


CTCTAAAAGG 


AGuAAA I CV_ 1 




APfiATAAATA 


TCCGCCACAT 


3480 


CCTTGACTAA 


ATTAGCAGCA 


AAAAGCTTCT 


CAACAATAGA 


TGGACCAAGG 


CCTGTAATAT 


3540 


TCATAGCATC 


ACGAGAAGCA 


AAGTGAATCA 


AGCCTTCCAT 


GATTTGAGCA 


GGGCAACGCG 


3600 


GATTGATACA 


ACGTAGGGCC 


ACTTCATCTT 


CAAAGTGCAA 


CAAGTCAGAG 


TTACAACTTG 


3660 


GACAGTTTGT 


AGGGATATCT 


AGTTTTTCTT 


CAGAAACCCG 


TTTGGACTCT 


ACCACACGTA 


3720 


AAACGGCAGG 


GATGATGTCA 


CCAGCCTTAT 


ATACAATGAC 


CGTATCGTCT 


TTTCGGATAT 


3780 


CTTTTTCAGC 


AATATAATCT 


ACATTGTGCA 


GGGTCGCACG 


GCTAACAGTC 


GTACCGGCAA 


3840 


GTTGTACTGG 


TGTTAGATTA 


GCAGTTGGAG 


TTACAACACC 


GGTACGGCCA 


ACTGTCCAGT 


3900 


CAACTGATAA 


GAGTTGAGCT 


TCTTTTTCTT 


CGGCAGGGAA 


CTTGTAGGCT 


ACTGCCCACT 


3960 


TTGGAGCCTT 


AACTGTAAAA 


CCAAGTTCTT 


CTTGACTTGC 


TAGGTCGTTG 


ACCTTGATTA 


4020 
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CCACTCCATC 


AATATCGTAA GGCAGATTTT 


CCCGTTCCTG 


TCCTACTTCT 


TGGATAAAAT 


4080 


TCCAGATTTC 


ATCTATGTTT 


TCAGCCAAGA 


TTCGCTTAGG 


ATTGACCACA 


AAACCTAGTT 


4140 


GTTCTAGGTA 


CTTCAAACCC 


TTTTCTTGGC 


TATCACGAGT 


TGAAGGGCTG 


GCTTCTTGAT 


4200 


AGAGAAACGT 


TGCAAGATTA 


CGCTTGGCAA 


CTACTGCTGT 


ATCCAACTGA 


CGCAGAGTTC 


4260 


CTGCTGCCGC 


ATTACGAGGA 


TTAGCAAATT 


CAGGCTCTCC 


ATTTTCTTGG 


CGCGCTTGGT 


4320 


TAACTTGGTC 


AAAGGAAGCG 


CGTGGCATGT 


AACATTCCCC 


ACGAACTGTG 


ATATCTAGTT 


4380 


CTTCTGGCAA 


AGTCAAAGGG 


ATGTCCTTAA 


CACGCTTGAG 


GTTTTCTGTG 


ATATTTTCAC 


4440 


CAATTGAACC 


ATCTCCACGT 


GTTACCCCAG 


CAACCAAAAT 


CCCCTTTTCA 


TAAGTCAGCG 


4500 


AGATAGATAA 


GCCATCGATT 


TTCAGCTCAC 


T^TATAGGT 


CGGATGAGCC 


ACTTCCTTAC 


4560 


GAACACGCGC 


ATCAAAAGCA 


TCTAGCTCCT 


CACATGAAAA 


AGCATCCTGC 


AAACTATAAA 


4620 


GAGGATACTG 


ATGACTGTAT 


TTTTCAAAAC 


CATCTAAAAC 


CTTGCCACCA 


ACACGATGAG 


4680 


TCGGACTGTC 


TGCTAGCACT 


TGCTCTGGAT 


AAGCAGTTTC 


TAACTCGACC 


AACTCACGGT 


4740 


AAAGGCGGTC 


ATACTCACTG 


TCTGAAACCG 


AGGGATTATC 


GCTGGTATAG 


TACTCAGTCG 


4800 


CATAGCGATT 


GAGCAAAGCG 


ACTAACTCAT 


TCATTCTTTT 


ATTCATAAGA 


CCATTTTACC 


4860 


ATAAAACAAG 


CCCTCCTCAC 


AAACGAGAAG 


GGCGGAAAAA 


ACACTTAGTT 


TGAAATTATT 


4920 


TTTGAAACTC 


AAGCAACCTT 


ATATf^AATTT 


"T^raji^BJlTGA 


G TTC G AAC AT 


ATCCGAGAGC 


4980 


TAAGAAATAT 


AAGGCTACAA 


CTCCAAGTCC 


AATAATCAAG 


AAAGAATAAA 


GATGGACACT 


5040 


TGGCAAGACT 


GTCATAAATC 


CTTTTGCAAT 


AGGCATAAAT 


AGAATAGCTA 


AGGTAAAAAT 


5100 


TGTACTCAGT 


ACTCTTCCAA 


GAAATTCGCT 


CTCAACCTTG 


GTTTGTACTT 


GAGTAAAAAA 


5160 


GTGAATATTA 


AAAATCGTCA 


TAAACAATTC 


ACAAACTAAA 


TTTCCAGAAA 


AGGAAAGAAA 


5220 


AGTTGGAAGT 


GGTAATCCCA 


TCATAAAAAC 


TCCGACACCT 


GTCAAAGCCA 


GTAAAATCAA 


5280 


AAGATTATAA 


ATATTAGCTT 


TAATTTTACT 


AGCTAGAAGA 


GCCCCAATGA 


TGGAACCAAT 


5340 


AGCCCCCATA 


GTTAAAATAC 


TTGCATAGGC 


TCCTTCTGAC 


CCGTAAAGCT 


GATTCGAAAA 


5400 


GGGAAGTAGA 


AATTCAAAAG 


CTGCAAAAAA 


GAAATTAACG 


CTGGAAGCTA 


CCAGCAAAAG 


5460 


GAAGAAAATT 


TCTTGCTGAT 


GCCAGATATA 


GTGTAACCCA 


TCCTTGATAT 


CTACAAAAAT 


5520 


ATCTCTCCCA 


GTAAAAGCCT 


TTTTCTCTTG 


AACTTTTGCT 


TCCTCTTTTG 


GAAGGAAAGC 


5580 


CACTAGAACA AAAGCAATGA 


AAAAAGTCAG 


CGAGTCTAGC 


AGTAGCGTCA 


TATGGAGACT 


5640 


TGCAAACTGT 


AAAACAAGGA 


AGGAAAGAAC 


AGGAGAGCTA 


ACACCTACAA 


CCTGCAAAAC 


5700 


CAGCTCTAAG 


CGAGAATTAT 


AGATCACAAT 


CTCATCTTTC 


TCCACCACTT 


CAGTTATGAT 


5760 
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AGCTTTATTG GCTGTGCGAG AAAAGGCAAA AGCAATAGCC TGCACAATGT TAGCAACAAT 5820 

CAAAGCGCCA ATCATCCAGC TATCATTCCT TATGAAAGAA ATAGCCAGAC AAAGAATCCC 5880 

ACAAACAAGA TCTGCCGTCA TTAAAATCTT ACGACGAGAA AAACGGTCTG AAATAACTCC 5940 

GCCAAAGGGA TTGACGAGAA TAGATGTGAC GAGCTCAGAA ATCTGATACA TTCCTAAAAC 6000 

TGTCTGTCCT ATAGTCCCCA TAGAAGCCAA CCAGACACTA TTTCCATAAT CATAGAGCAT 6060 

ATTTCCCATT TTATTGATAG CCCCACGGCT AATCTIACTGC ACTGCATAGC GATTCATATT 6120 

AAAGCTCCTC TCAAATTTTG AAACTATTGT ATCAAAACCG AAAGGAGCTT TTTATTTTTT 6180 

CCCTTATTTG GGAAAATTAA CTTTTGACAA ATTTTTCGTA GTGTTCCTGA TAATAGGCTA 6240 

CTTGCTCTGG AAGACCTAAC ACATCAAAAA TATGCATGGC CTCTTGCATC TGCTTACAGC 6300 

CTTCTTTACA CTGTCCTTTT TGATATAAGG CAAAACCTTT TAAATAATGG AAAACATTAC 6360 

GCTCATAAAG CTTAATACCT TTGTCAATAA TCTTCTCTGT ATAAGCCTCA AAATAGTTGG 6420 

CATTATAAAA AGAAGAATGC TCTAAACAAT GCTGGTAACA ATTGAGGGCC AAAATCAACA 6480 

CTAATCTCTT ATGGCGACTA ATCTCTTGGT AAAATTCCTC CCTCTCCATA ACTTCTCTAC 6540 

CAATCCGAGT GACATAGTCT ACATCGTAGA AACTATAGAG GTTACCGAAA AGAATCAACT 6600 

CATACATGGT CCATTCTTCT GTTTTGAAGA GATAATCTGC TACCTTACCC AAATCATCCT 6660 

GCTTCATATC ATAACTCGCA TCTCTTTGAC AAATCAGACC TTGTAGCAAA ATCCAGTTCA 6720 

GCTCAAAATA AAGGGGAGTC GTCGAACTCT TAGACTTTTC AAGTTGTTCT CTTTGAAGCT 6780 

TTTGAAAACC TGCAATATCX3 TTTGAATAGT AAAGTGGGAT AATCTGTGCC ATCATAGACA 684 0 

CATGTTCATG ATTATGAAAA TTCCTTGCCT TATCCATGAA ATTTTCGATT GTTACATGAA 6900 

TGTTATCCAA AATCTCAAAG AAACGGGAGA CTGCCAGGTC AGACTCCCCA AGCTCAAAGC 6960 

GAGATAACTG AGAGGTAGAG CAGGATTCGC CTGCTGCTTC CTTTAAAGAA TAATTTCCAC 7020 

TTGTTCGAAA TTCACGAAAT ACTTTTCCAA GATGTTCCAT CTTTACACCT GCTCTGATAA 7080 

TTCTTCCCAC TCAAGCATAG CTTCTTCCTG ACGATGGCTG ATTTTGTCCA GCTCAGCCTG 7140 

TAATTCCATG AGTTTGTCGG CATCGTTTGT TTCCAACATT TGTTCAGAAA TGGCTTGGCT 7200 

TTGACTTTCT AGCTCTTCAA TTTCAGCTTC TAGACTTTCG ATTTGTCGCA TGAGTTTGCG 7260 

AACTTCTTTT TGACTTTCTT TCTGGGCCTG ATAGTCATTG ACTGGACTTG CTTCCTTTGC 7320 

TTGATTGCTA GTTGAAGCTT CCTCAGTCTG ACTCATTTCT GCTGTTGCTT TCTTCTCAAC 7380 

ATAGTAGTCG TAATCTCCAA GGTAGAGAGT TGAACCATTC TCAGACAATT CCAAAACATG 7440 

AGTTGCCACA CGATTGATAA AGTAACGATC ATGACTGACA AACAGCAAGG TTCCATCAAA 7500 

GTCAATCAAG GCATTTTCTA GCACTTCCTT ACTATCAATA TCCAAGTGGT TGGTCGGCTC 7560 
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ATCCAGAATC AAAAAGTTAT TGTTTTCCAT AGACAATTTA GCTAAAAGCA AACGAGCTTT 7620 

TTCGCCACCA GATAGCATGC CGACTGATTT TTTAACATCA TCTCCTGAGA AAAGGAAGGC 7680 

TCCAAGACGG TTGCGGATTT CAACTTCTGG TGTCAGTTTG AAATCATTCC AGAGTTCATC 7740 

CAGCACCGTA TTACTTGGTG TCAGCTTGCT TTGGGTTTGG TCATAGTAAC CAACCTCAAC 7800 

ATTAGCGCCA AAGCGCTTTT CTCCCTTGAT AAAAGGAATC TGGTCCACAA TAGACTTGAT 7860 

AAAGGTTGAC TTGCCGATAC CATTTGGACC AACGATAGCG ACAGCATTCA TCTTACGAAG 7920 

ATCTAGGTTA ATCGGTTGTG ACAAGACTTC CCCGTCATAG CCAACAGCTG CATTTTCAAC 7980 

AGTCAAAACA ACATTGCCCG ACGTTTTTTC AGACTGGAAG GTCATGTTGG CTGATTTCTT 8040 

GCCAGCTTCA GGCTTGTCCA AACGTTCCAT TTTTTCCAGT TGTTTACGGC GAGATTGAGC 8100 

ACGTTTAGTC GTTGAAGCAC GAACTAGATT GCGATTGACA AAGTCTTCCA GAGCAGCGAT 8X60 

TTCCTTCTGT TGCTTTTCAT AGTTTTTTGC CTCAGTAACT AGCTTTTGCT CCTTCAATTC 8220 

GACAAAACGA GAGTAATTCC CCACATAGCG ATCCAAGGAA TGCTTGGTCA AATCTAGCGT 8280 

AATTGTCGCA ACCTTGTCCA AGAAATAACG GTCGTGGCTG ACGATAATGA GGGCACCGCT 8340 

ATAGTTTACC AAGTAATTCT CTAGCCAGGC GATGGTTTCA ATATCCAAGT GGTTAGTTGG 8400 

CTCGTCCAAG ACCAAGAGAT TGGGCTTTTC AAGGAGCATT TTGGCAAGTG CCAAACGAGT 8460 

ATTTTGACCA CCAGAAAGCT CAGCAJVTTTT CATCTGCCAC *.T,\CACTCGT CAAACTTGAA S520 

TCCATTCAAA ATCGCTCGAA TATCAGCTTC ATAGGTAAAG CCACCTGCTT GGCGAAAATT 8580 

CTCAGATAAG CGGTCATAAT CTGACATCAG TTTATCCAAA TCCTCACCAG ACTTTTCACC 8640 

CATCTCCAGC TCCATCTGAC GC AGTTGTCT CTCCGTCCGA CGCAAATCAT TAAAGACATG 8700 

AAGCATTTCA TCGTAGATGG TATTTTCAGA CTCAAAACGG CTATCTTGGG CTAGGTAAGA 8760 

CAGAGAAATA TCTTTTTTCT TATTGATTTC TCCGCTAGTT GGCTCCTCTT CTCCAACTAA 8820 

AATCTTCAAA AGAGTAGACT TACCTGCACC ATTTTTCCCA ACAAGAGCAA TCCGATCTCG 8880 

TTCATCAACC TGCAGGTTGA TATTATCGAA AAGAACCTCT CCTGCAAAAG AACGTTCAAT 8940 

TTTATTAGCT TGTAAAATAA TCATACAAGT AGTATAGCAT GTTTCCCTAA GGCATTCAAG 9000 

ATAATCGTAA GTCTTTTAGT ACAACTTTTA TAACATAAAA TAAACTAAAT TATGTATATT 9060 

TTATATTAGA TTACTTCACT ATCTTGTTGG ATTTTCTAAC CAGCTAATCT TGTTTCAAAT 9120 

AGTTATCGCA CAAGTCTATT ATTTAATTCT TTTCATCATT TACGTACGTA TAGCAGATTG 9180 

AAATAAGATG AGAACAAATC GATTGGGAAA GTAAAATTAA TTTCTATAAA TGTTTTAGCA 9240 

ATTGTTTCGT ACTATTTTAG ATTCAGTCTA CTATATACAA TATTTTCGGA ACATTCAACT 9300 
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TTTTAACTCT ATTTATTACT AGATTTCATA ATTAAAAAAC CTACTGACCA AGCTAGAAAG 9360 

CTTGATACAA TAGGCTTTTT AAAGACTGAT TATTTAACAG CGTCTTTAAG AGCTTTACCA 9420 

GCTTTGAATG CTGGTACTTT AGAAGCTGCA ATTGTCATTT CTTTACCAGT TTGTGGGTTG 9480 

CGACCTTTAC GTTCTGCGCG CTCACGAACT TCAAAGTTAC CAAAACCGAT CAATTGAACT 9540 

T 9541 



(2) INFORMATION FOR SEQ ID NO: 133: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3502 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 

TTGACTATCC TATCATGCTT TCTAAGGTCT ACTCAAGAAA ATCATTTTCA AGTTTTCACA 60 

CCTTTCTCAA AAAAGTTAAA AAATTTTCTC AAAAACGCTT GACTCTGACC TAAGGCGAAG 120 

GGTTATACTA TCATTGTAAG GAGGAAATCA TGTACCATAT AAAAGAAGCT GCGCAGCTTT 180 

CAGGTGTCTC TGTCAAGACC CTGCATCACT ATGACAAGAT AGGACTCTTG GTCCCCTTAA 240 

AGTCGGAAAA CGGCTATCGA ACCTACAGTC AAGAGGATTT GGAACGCCTT CAGGTCATTC 300 

TT-TACTaCaa ATATCTAGGC TTTTCTTTAG AGAAAATAGC AGAGCTGTTA AAGGAAGAAA 360 

GGACAGATTT ATTGCCCCAT TTGACTAGGC AGTTGGACTA TCTAACTCGC GAAAGGCAAC 420 

ATCTGGATAC CTTGATTTCC ACCTTGCAAA AAACTATTCA AGAACAAAAA GGAGAAAGAA 480 

AAATGACCAT TGAGGAAAAA TTCACGGGAT TTAGCTATCA AGACAATCAA AAATACCACC 540 

AAGTU^GCGGT AGAGAAATAT GGTCAAGAAG TCATGGGACA AGCGCTCGAA CGCCAAAAAG 600 

GTCACGAAGA CGAGGCTACG GCCGCCTTTA ACCAAGTCTT TCAAACTTTG GCACAAAATC 660 

TTCAAGTTGG TTTACCTGCA ACAGCAACCG AAAACCAGGA GCAAGCAGCC AAGCTCTTGC 720 

AAGCCATTCG CACTTATGGA TTTGACTGCT CTATTGAGGT ATTCGGTCAT ATCGGTAAAG 780 

GTTACGTCTA CAACCCAGAG TTTAAGGAAA ACATTGACAA GTTTGGTTCT GAAACAGCCC 840 

AGTACACGTC AGATGCCATT GCGGTTTACG TTCAGACAAA TGCAGAATAA ATAGGCTAGG 900 

AATTTCCTAG CCTATTTTTT ACTTCAAATC ATAAAGCCAG TCGTCACCGT TTTTGTAGTA 960 

AAAGAATTCA CTGAGATCTT CTTCTAGAAA CACACGAAGC ATATCAGACA TATCATCGGT 1020 

TGCAAGTTTT AGATGAGAAA GATTTTCAAA GTCCTCCCAC CAAACTTTCC CTTCGTCTGA 1080 

AGACTGGAGT TCACCAGTAA AGTGTTCTGT CTTGTAAAAA AGGACGACAT AACGATAATC 1140 
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CTTGTCGTCA TACCAGTTTT TGATACCACA GAGTTGGGGT TTGGAAATGA TCAGACCAGT 1200 

TTCTTCTTTC ACTTCACGAA TGACAGCATC GACAAAGGAT TCGCCACGTT CAACATGACC 1260 

ACCAGGAAAA GTAATGCCAG ACCAGTCGGG ATTAACTCGG TCTTGGACCA GGACCTTATC 1320 

TCCGTTTTTA ATCATACACA TGTTAACAAA TTCGACTGCC TCTCTTCTGT TCATTCTTCA 1380 

CAACCTTTAA TCTTTAATCA TAATGCAGAC TTCCCGCCAC CCAGCCGGTA CAGAGGGCAG 1440 

AAGTGATGTT AAAGCCACCC GTGTGGGCAT TGATATCCAT AACTTCGCCT GCAAAGTGGA 1500 

GGCCAGGTAC CAGCTTACTT TCAAGGGTTT TAGGATTGAT TTCCTTGAGA CTGACTCCAC 1560 

CCTTGGTAAC AAAGGACTTT GCAAGGGACA TTTTTCCAGT TACAGGAATT TTAAGTTCTT 1620 

TAATGGACTG GACAAGTTGT TCTCGTTCCT TTTCAGTCAG TTGTTTGACT TTTTCAGGAT 1680 

ATCCTTGTAC AAAAAATTCG GCCAAGCGTT CTGGTAACAA GGTTTTTAAA GCGTTTTTCA 1740 

AGGATTTTTC CCGATTTTCT TCTAGAAATG TAACCAAGTC CTTCTCAGAA AGTTGAGGCA 1800 

AAACATCGAG TGAGAGAACC TCCCCACCTT TGACAAAGCT AGACATGCGT AGGGCAGCAG 1860 

GACCTGACAA ACCAAAGTGG GTAAAGAGTA AATCATGAGT GATGACATGC TTACCATAAC 1920 

TTAGGGTCAC ATCGTCCAGA GAAATACCTT GTAAGGCTTT ATGTGGAAAA TCTGTTAATA 1980 

AAGGACTTTC AGCAGCCTCA AGATCGGTGA TGGTATGCTT AAAATGGCGA GCAATCTCX3T 2040 

GACCAAAACC AGTCGAACCA GTCG.^.^CC.^T AAGACTTACC ACCTGTTGTG aCAAtGAGTT 2100 

TCTCACAAGT GAAGGTTTGA TCCGCTGACT TAAGGACAAA CTGGTCATCT ACTTTTTTAA 2160 

CAGAAACGAT TTCTATTTGA GTAGCAACTT GACCACCTAG TTCGGTGATT TTCTTTTCCA 2220 

AAGCTTCGAT AATAGTCCGA GACTTGTCAC TGGCTGGAAA GACGCGTCCG TGGTCTTCGA 2280 

CCTTAAGTTT AACACCATTT TCTGTAAAAA AGTTGATGAT GTCATGATTA TCGAACTGGG 2340 

AGAAAACACT GTAAAGAAAG CGTCCGTTTC CAGGAATTCC AGCTAGCAGG TTGTCTAAGC 2400 

TACCATTGTT GGTCACATTG CAACGTCCCC CACCAGTCCC AGCTAATTTT TTTCCAAGTT 2460 

TCCGATTTTT TTCGATGAGG AGGGTTTTCT GTCCATAAAA GCTACTGGAA ATCGTAGCCA 2520 

TCATACCAGC AGGTCCCCCA CCGATGACT^ TAGTATCAAA ATGTTTCATA GCTCTATTGT 2580 

ACCACAAAAA AACAAGAGAT GATGGTCACC TCTTGTCAAG AATGCAATTA ATCAATTTCA 2640 

TAGCCCATCA GCAAACCGCC CTCTTCTGCA TAGAAACTGC AGAGACCAGA GGTTGGTAGA 2700 

ATTTTAATAT CCGCTTGTGG GAAGGTTTGA CGGATTCGCT CTGAGAGCTG TTGACAACAT 2760 

TTTTCGTTAT TGCGTTGGGC CATGACAATA CGGCCACCAG CATATCCAGC TTTTACTAAC 2820 

TCATCATAGG CAGCTTGAAC TGATTTCTTT GATCCCCTTG CTTTTTGTAG CAATTCGAGA 2880 
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GTCCCAGTTT CACTAGCTTT TCCGACCATA CGAATGTTGA GAAGGCCAAC GACCGTACCG 2940 

ATAAGCTTGC TCAAACGGCC GTTCTTCACC AAGTTATCGA CTTTGGCTAG GACAAAGAGC 3000 

AACTTAGTTT TTTCTTGATA GGCGGTGATA GCTTCAACCA CTTCTTCAAA AGACAAGCCC 3060 

TGGTCAATCA AGTCATTCAA TTTTTCTACG AGTAGGTCAA CTTCACCACC AGCAGATAAA 3120 

CTATCAATCA CATGAATCTT AGTGTCAGGA TGGTCTTCCA GATAAATATT CTTTGCTAGT 3180 

TGAGCACTAT TGTGACTGCC AGAAAGGGTA CCTGTGATGG TTACTAGGAA AATGTTTTTG 3240 

GCACCTTCAA ATGCTCGCAA ATAGTCATCT GGGCTTGGAC AAGCCGATTT TGAAGCTTCT 3300 

GCAGTTGCAT ACATGGTTTC CATCATTTGG TCAATATCGA GACTGGCGTC ATCAACAAAG 3360 

ACCTGATCAG CTACTTGAAT GGTTAAGGGG ACACTTACAA AGGTTGTGTT AATAGCTGGT 3420 

GTTGGCAGTT GACGATAATC ACAACCAGAG TCAGCAATAA TCTTCCAAGT CATAGAAATT 34 80 

CTCCATCTTT GTCAGGAACG AT 3502 



(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12665 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 

CGATTGATTT TTTTAAAGCG TTCGATAGAG AATGAGAAAC GAATCCTTAG CAATGGCGGG 60 

AAAGAATTTG GAGTTGAGAA TACAAAACGA TTAACTATGG CTCATATTGT TTTTTATCTC 120 

TCTTGCTTGG TTGAGGCAAT GGTGCACAAG ACAATTTTTG ATGGCATGGG CATGGTTGGT 180 

TTAGTCTTGC TTATTTTTTC TATGCTGATG TTGATGTTGG TGATTCACTT GTTGGGAGAT 240 

ATTTGGACAG TGAAGCTTAT GCTTGTCAAT AATCACAAAT ATGTAGATCA TATCTTGTTT 300 

AGGACAGTAA AACACCCTAA TTACTTTTTA AATATTCTTC CTGAGTTGAT TGGCTTGACC 360 

TTGTTGAGTC ATGCTTATGT GACTTTTGTT TTAGTTTTTC CAGTTTATGC AGTTATTTTG 420 

TATCGACGAA TAGCTGAAGA GGAAAAGCTA TTACATGAAG TTATAATCCC AAATGGAAGC 480 

ATAAAGAGAT AAATACAAAA TTCGATTTAT ATACAGTTCA TATTGAAGTG ATATAGTAAG 540 

GTTAAAGAAA AAATATAGAA GGAAATAAAC ATGTTTGCAT CAAAAAGCGA AAGAAAAGTA 600 

CATTATTCAA TTCGTAAATT TAGTGTTGGA GTAGCTAGTG TAGTTGTTGC CAGTCTTGTT 660 

ATGGGAAGTG TGGTTCATGC GACAGAGAAC GAGGGAGCTA CCCAAGTACC CACTTCTTCT 720 

AATAGGGCAA ATGAAAGTCA GGCAGAACAA GGAGAACAAC CTAAAAAACT CGATTCAGAA 780 
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CGAGATAAGG CAAGGAAAGA GGTCGAGGAA TATGTAAAAA AAATAGTGGG TGAGAGCTAT 840 

GCAAAATCAA CTAAAAAGCG ACATACAATT ACTGTAGCTC TAGTTAACGA GTTGAACAAC 900 

ATTAAGAACG AGTATTTGAA TAAAATAGTT GAATCAACCT CAGAAAGCCA ACTACAGATA 960 

CTGATGATGG AGAGTCGATC AAAAGTAGAT GAAGCTGTGT CTAAGTTTGA AAAGGACTCA 1020 

TCTTCTTCGT CAAGTTCAGA CTCTTCCACT AAACCGGAAG CTTCAGATAC AGCGAAGCCA 1080 

AACAAGCCGA CAGAACCAGG AGAAAAGGTA GCAGAAGCTA AGAAGAAGGT TGAAGAAGCT 1140 

GAGAAAAAAG CCAAGGATCA AAAAGAAGAA GATCGTCGTA ACTACCCAAC CATTACTTAC 1200 

AAAACGCTTG AACTTGAAAT TGCTGAGTCC GATGTGGAAG TTAAAAAAGC GGAGCTTGAA 1260 

CTAGTAAAAG TGAAAGCTAA CGAACCTCGA GACGAGCAAA AAATTAAGCA AGCAGAAGCG 1320 

GAAGTTGAGA GTAAACAAGC TGAGGCTACA AGGTTAAAAA AAATCAAGAC AGATCGTGAA 1380 

GAAGCAGAAG AAGAAGCTAA ACGAAGAGCA GATGCTAAAG AGCAAGGTAA ACCAAAGGGG 1440 

CGGGCAAAAC GAGGAGTTCC TGGAGAGCTA GCAACACCTG ATAAAAAAGA AAATGATGCG 1500 

AAGTCTTCAG ATTCTAGCGT AGGTGAAGAA ACTCTTCCAA GCCCATCCCT GAAACCAGAA 1560 

AAAAAGGTAG CAGAAGCTGA GAAGAAGGTT GAAGAAGCTA AGAAAAAAGC CGAGGATCAA 1620 

AAAGAAGAAG ATCGCCGTAA CTACCCAACC AATACTTACA AAACGCTTGA ACTTGAAATT 1680 

GCTGAGTCCG ATGTGGAAGT TA_AAAAAGCG GARCTTGAAC TAGTAAAACA GGAAGCTAAG 1740 

GAACCTCGAA ACGAGGAAAA AGTTAAGCAA GCAAAAGCGG AAGTTGAGAG TAAAAAAGCT IBOO 

GAGGCTACAA GGTTAGAAAA AATCAAGACA GATCGTAAAA AAGCAGAAGA AGAAGCTAAA 1860 

CGAAAAGCAG CAGAAGAAGA TAAAGTTAAA GAAAAACCAG CTGAACAACC ACAACCAGCG 1920 

CCGGCTCCAA AAGCAGAAAA ACCAGCTCCA GCTCCAAAAC CAGAGAATCC AGCTGAACAA 1980 

CGAAAAGCAG AAAAACCAGC TGATCAACAA GCTGAAGAAG ACTATGCTCG TAGATCAGAA 2040 

GAAGAATATA ATCGCTTGAC TCAACAGCAA CCGCCAAAAA CTGAAAAACC AGCACAACCA 2100 

TCTACTCCAA AAACAGGCTG GAAACAAGAA AACGGTATGT GGTACTTCTA CAATACTGAT 2160 

GGTTCAATGG CGACAGGATG GCTCCAAAAC AATGGCTCAT GGTACTACCT CAACAGCAAT 2220 

GGCGCTATGG CGACAGGATG GCTCCAAAAC AATGGTTCAT GGTACTATCT AAACGCTAAT 2280 

GGTTCAATGG CAACAGGATG GCTCCAAAAC AATGGTTCAT GGTACTACCT AAACGCTAAT 2340 

GGTTCAATGG CGACAGGATG GCTCCAATAC AATGGCTCAT GGTACTACCT AAACGCTAAT 2400 

GGTTCAATGG CGACAGGATG GCTCCAATAC AATGGCTCAT GGTACTACCT AAACGCTAAT 2460 

GGTGATATGG CGACAGGTTG GGTGAAAGAT GGAGATACCT GGTACTATCT TGAAGCATCA 2520 



wo 98/18931 



PCT/US97/19588 



910 

GGTGCTATGA AAGCAAGCCA ATGGTTCAAA GTATCAGATA AATGGTACTA TGTCAATGGC 2580 

TCAGGTGCCC TTGCAGTCAA CACAACTGTA GATGGCTATG GAGTCAATGC CAATGGTGAA 2640 

TGGGTAAACT AAACCTAATA TAACTAGTTA ATACTGACTT CCTGTAAGAA CTCTTTAAAG 2700 

TATTCCCTAC AAATACCATA TCCTTTCAGT AGATAATATA CCCTTGTAGG AAGTTTAGAT 2760 

TAAAAAATAA CTCTGTAATC TCTAGCCGGA TTTATAGCGC TAGAGACTAC GGAGTTTTTT 282 0 

TGATGAGGAA AGAATGGCGG CATTCAAGAG GCTCTTTAAG AGAGTTACGG GTTTTAAACT 2880 

ATTAAGCCTT CTCCAATTGC AAGAGGGTTT CAATCTCTGC CAGGGTGCTG GCTTGCGAAA 294 0 

TGGCTCCACG GAGTTTGGCA GCGCCAGATG TTCCACGGAG ATAGTGAGGA GCGAGACCGC 3000 

GGAATTCACG AACTGCGACG TTTTCTCCTT TGAGGTTAAT CAATCGTTTC AAGTGTTCGT 3060 

AGGCGATCTT CATCTTGTCT TCAAAGGTCA AATCAGGTAG GATTTCTCCT GTTTCAAAGT 3120 

AATGGTTGAT TTGGTTGAAG AGGTAAGGAT TTCCCATGGC AGCTCGGCCA ATCATGACTG 3180 

CGTCAGCACC AACTTCTTCG ATGCGTTGCT TGGCTTCTTG GACAGTACGG ATATCACCGT 3240 

TGGCGATGAA TGGAATCTTG GTTAGAGCTT GGGCAACCTT GTAAAGGGTC TCAAGGTCTG 3 300 

CGTGGCCAGT ATACATTTGT TCACGGGTAC GGCCATGCAT GGCGAGGGCA GAAACACCTG 3 3 60 

CAGCTTCAGC AGCGAGAGCA TTTTCTACTG CAAGAGATGG GTCCGCCCAG CCGGTACGCA 342 0 

TTTTGACAGT AAGTGGGATA TCAAGGACAG ACTGGACCTT GTTGATGATG GAGTAAATCT 34 80 

TGTCTGGATC CTTGAGCCAC ATAGCACCAG CTTCGTTCTT CACGATTTTG TTGACAGGGC 3540 

AGCCCATGTT GATATCGACG ATATCGGTCT TGGTGTTTTC TTGGATGAAT TCTGCTGCGC 3 600 

GTGCTAGGCT GTCTTCATCG CTACCAAAAA GTTGGATAGA GACAGGGTTT TCGCCCTCAT 3 660 

CGATATGAAG CATGTGCAGG GTTTTTTCGT TGTTGTATTG GATTCCCTTG TCAGAGACCA 3720 

TTTCCATTAC AACGAGTCCA GCTCCGAGCT CCTTTGCGAT AGTACGAAAG GCTGAGTTGG 3780 

TCACGCCAGC CATAGGCGCT AAAACGGTAC GATTGGGAAT CTCAATATTG CCAATCATAA 3840 

AAGGTGTATT AAGATTTGTC ACGAATGAGT TCCTCCAGGT CCTTTTCATC AAAGTTGTAA 3 900 

GTAGTTTGGC AGAATTGACA AGTGATTTCT GCCCCGTGGT CTTCCTCTTT CATTTCCTGT 3 960 

AAGTCTGAGC TTGGAAGGCT GGCAAGAGCG TTCATAAAGC GTTCATGGCT ACAGTCACAT 4020 

TGGAAACGGA TTTCTTCTTC AGAAAGACGC TTGTAGGCTT CGTCCCCGTA GATAGCCTTG 4080 

AGGAGGGCTT CGATATGGTC GTCGCTTTCG AGAAGAGTAG AGATAGCTGG CATTTCTTGG 4140 

ATGCGTTTTT CAAAGCGAGC AATCTCTTCT TTCTTGGCTC CTGGCAAGAC TTGAACTAGG 4200 

AAACCACCTG CAACCTTGAC CTTGTCTTCC TCGTCCAAAA GGACATTGAG GCCGACCGCT 4260 

GAAGGCGTTT GTTGGCTTTC AGTAAGGTAA AAGGCAAGGT CTTCACCGAT TTCTCCAGAG 4320 
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ATGAGGGGAG TTATAGAGTT GTAAGGATTT CCAGTACCGT AGTCTGTGAT AACGAGGAAT 4360 

TGACCATTTC CAACAAAAGG TCCGACTAGG ACTTCACCAG TCGCAGTCTT TTTGATGTCA 4440 

ACACCAGGAT TTTGAACATA GCCTTTGACG TTCCCCTTGG TATCAGCGAC GGTGATAATA 4500 

GCACCTAGAG AGCTAGATCC CAACACCTTA ACTGTAAGTT TGGTATTTCC TTTTTCATTG 4560 

GCTGCGAGAA TCTGGCTAGC GATAAGAGTT CGACCAAGCG CTACAGTTGA GCTAGCTTGG 4620 

GTTTGATGTT TTTCTTGAGC AGTGCGGACG GTTTCAGTGC TATCAAGGAC AAAAGCACGA 4680 

AAGGcTCCGC TTTCTGATAT AGTTTTAATA ATTTTATCCA TAGCTACTAT TTTAGCATAA 4740 

AAATGCCCAA AGGGGGAGCC GTGTGTTTAC TGATTTTCAG GATAATGGAC CAGGAAATCA 4800 

GCATGAAAAT AAAAAGAGAA ACAGATTATT TTAGCATTTG TCAGATTTAT GCTATGCTTA 4860 

AGGTAGAAAA TGAAAGGGAT AACAAATGTA TTTAGGAGAT TTGATGGAGA AAGCCGAGTG 4920 

TGGTCAATTT TCAATACTTT CCTTTCTATT ACAAGAGTCT CAGACGACCG TCAAGGCTGT 4 980 

AATGGAAGAA ACAGGATTTT CAAAAGCAAC CCTAACCAAA TATGTCACCC TGCTCAATGA 5040 

CAAGGCTTTG GATAGTGGCT TAGAGCTGGC TATTCACTCA GAAGATGAAA ATCTGCGTCT 5100 

GTCTATCGGT GCAGCTACCA AGGGGAGAGA TATTCGGAGC TTGTTTTTGG AGAGTGCTGT 5160 

TAAATACCAG ATTTTGGTTT ATCTTCTCTA CCACCAACAG TTTTTAGCCC ATCAGCTGGC 5220 

TCAAGAATTG GTGATTAGCG AGGCTACGCT TGGTCGTCAC TTGGCTr^GTT TAAATCAGAT 52SC 

TTTGTCAGAA TTTGATTTAT CCATCCAAAA TGGCCGTTGG CGAGGTCCAG AGCATCAGAT 5340 

TCACTATTTC TATTTCTGTC TTTTCCGAAA GGTCTGGTCG AGTCAGGAAT GGGAAGGTCA 5400 

CATGCAGAAA CCAGAGAGAA AACAGGAGAT TGCCAATTTA GAGGAAATCT GCGGTGCAAG 5460 

TTTGTCTGCG GGGCAGAAAT TGGACTTGGT TCTCTGGGCT CACATCAGTC AACAACGTCT 5520 

TCGGGTCAAT GCTTGTCAGT TTCAAGTCAT AGAAGAGAAA ATGCGAGGGT ATTTTGACAA 5580 

TATCTTTTAT CTTCGTTTGC TGAGAAAGGT TCCGTCCTTT TTTGCTGGGC AACATATTCC 5640 

ACTAGGAGTT GAGGATGGTG AGATGATGAT ATTCTTCTCT TTTCTCCTAT CTCATCGCAT 5700 

TCTTCCTCTT CATACTATGG AGTATATTCT TGGTTTTGGA GGGCAGTTGG CAGATTTACT 5760 

GACGCAATTG ATTCAAGAAA TGAAGAAGGA GGAACTATTG GGGGATTATA CAGAGGACCA 5820 

TGTCACCTAT GAACTCAGTC AGCTTTGTGC TCAAGTCTAT CTCTATAAGG GCTATATTTT 5880 

ACAGGATCGC TACAAGTACC AGTTAGAGAA TCGTCATCCA TATTTACTGA TGGAACATGA 5940 

TTTTAAAGAG ACAGCAGAGG AGATTTTTCA TGCTCTACCT GCTTTTCAAC AGGGGAGAGA 6000 

TTTAGATAAG AAGATTCTCT GGGAATGGCT CCAGTTAATC GAATATATGG CTGAAAACGG 6060 
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TGGCCAGCAT ATGCGGATTG GTCTGGATTT GACATCTGGT TTTCTTGTCT TTTCAAGGAT 6120 

GGCAGCCATT TTGAAACGGT ATTTGGAATA CAATCGTTTT ATTACCATTG AAGCTTATGA 6180 

CCCTAGTCGG CATTATGATT TGCTGGTTAC CAATAACCCG ATTCATAAGA AGGAACAGAC 6240 

ACCAGTCTAT TATTTAAAAA ATGACTTGGA TATGGAGGAT TTGGTAGCGA TTCGCCAGTT 6300 

ATTATTCACT TAAAAGGCTT GGTTAATCCA GGTCTTTTTT GTGAAATTCA CACAATCTCC 6360 

TCACATTTTT TTAAAAATTA AAAAAAGTTG ATAAACAAGA AAGCGCTTTA TTTTGTATAC 6420 

TAGTAAGTGT AAAGAGGAAA CACCTCAAGA TCTTTATCAG GAGGACAGTA CATGTCACAA 6480 

GAAAAATACA TCATGGCCAT TGACCAGGGA ACTACAAGTT CTCGTGCCAT CATTTTCAAC 6540 

AAAAAAGGGG AAAAGGTTAG CTCGAGTCAA AAAGAGTTTA CCCAGATTTT CCCTCAGGCA 6600 

GGTTGGGTTG AGCACAATGC CAATGAAATT TGGAACTCTG TTCAGTCAGT TATTGCGGGT 6660 

GCTTTCATCG AAAGTGGTGT CAAGCCAAAT CAAATCGAGG CAATCGGGAT TACCAACCAA 6720 

CGTGAAACAA CGGTTGTCTG GGATAAGAAA ACAGGACTTC CTATCTACAA TGCTATCGTT 6780 

TGGCAGTCAC GCCAGACAGC ACCTTTGGCT GAGCAACTAA AAAGCCAAGG TTATGTGGAA 6840 

AAATTCCATG AAAAGACTGG TTTGATTATT GATGCTTACT TCTCTGCTAC CAAGGTTCGT 6900 

TGGATTTTGG ATCATGTAGA AGGTGCTCAA GAGCGAGCAG AAAAAGGGGA ATTGCTCTTT 6960 

GGTACTATCG ATACTTGGTT GGTTTGGAAA TTGACTGACG GTGCGGCTCA CGTGACTGAC 7020 

TACTCAAATG CAGCTCGTAC CATGCTTTAT AACATTAAAG AACTCAAATG GGATGATGAG 7080 

ATTTTGGAAA TCCTTAACAT TCCGAAGgCT ATACTTCCAG AAGTTCGTTC TAACTCCGAA 7140 

ATCTACGGCA AGACAGCTCC ATTCCATTTC TACGGTGGAG AGGTGCCAAT CTCAGGTATG 7200 

GCTGGGGACC AACAAGCAGC CCTCTTTGGA CAGTTGGCTT TTGAGCCAGG TATGGTTAAG 7260 

AATACTTATG GAACAGGCTC TTTCATCATC ATGAATACTG GGGAAGAGAT GCAGTTGTCT 7320 

GAAAACAACC TCTTGACAAC CATTGGTTAC GGAATCAACG GTAAGGTTTA TTATGCCTTG 73 80 

GAAGGTTCTA TCTTCATCGC AGGAAGTGCT ATTCAGTGGC TTCGTGACGG TCTTCGCATG 7440 

GTTGAAAATT CACCAGAATC TGAAAAATAC GCTCX3TGATT CTCACAACAA CGATGAAGTT 7500 

TATGTCGTTC CAGCCTTTAC AGGTCTAGGC GCTCCATACT GGAACCAAAA TGCTCGTGGT 7560 

TCCGTCTTTG GTTTGACTCG TGGAACAAGC AAAGAAGACT TTATCAAGGC GACTTTGCAA 7620 

TCTATTGCTT ATCAAGTGCG TGATATCATC GACACCATGC AAGTGGATAC TCAGACCGCC 7680 

ATTCAAGTAC TGAAGGTGGA TGGTGGTGCA GCCATGAACA ACTTCCTCAT GCAGTTCCAG 7740 

GCGGATATTT TAGGCATTGA CATTGCACGT GCTAAAAACC TGGAAACAAC AGCTCTAGGA 7800 

GCGGCCTTCC TAGCAGGTTT GTCAGTAGGG TACTGGAAAG ACTTGGACGA GTTGAAACTC 7860 
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TTGAACGAGA CAGGAGAACT CTTTGAGCCA TCTATGAACG AATCTCGCAA GGAACAACTC 792 0 

TACAAGGGCT GGAAGAAGGC TGTGAAAGCA ACTCAAGTCT TTGCGGAAGT AGACGACTAA 7980 

TACTGGCAGA ATAAAGCGAT TTATTTAGAA AGTGTGTAAA TATGGAATTT TCAAAGAAAA 8040 

CACGTGAATT GTCAATTAAA AAAATGCAGG AACGTACCCT GGACCTCTTG ATTATCGGTG 8100 

GAGGAATCAC AGGAGCTGGT GTAGCCTTGC AGGCGGCAGC TAGCGGTCTT GAGACTGGTT 8160 

TGATTGAAAT GCAAGACTTT GCAGAAGGAA CATCTAGTCG TTCAACAAAA TTGGTTCACG 8220 

GAGGACTTCG TTACCTCAAA CAATTTGACG TAGAAGTGGT CTCAGATACG GTTTCTGAAC 82 80 

GTGCAGTGGT TCAACAAATC GCTCCACACA TTCCAAAATC AGATCCAATG CTCTTACCAG 8340 

TTTACGATGA AGATGGAGCA ACCTTTAGCC TCTTCCGTCT TAAAGTAGCC ATGGACTTGT 8400 

ACGACCTCTT GGCAGGTGTT AGCAACACAC CAGCTGCGAA CAAGGTTTTG AGCAAGGATC 8460 

AAGTCTTGGA ACGCCAGCCA AACTTGAAGA AGGAAGGCTT GGTAGGAGGT GGAGTGTATC 8520 

TTGACTTCCG TAACAACGAT GCGCGTCTCG TGATTGAAAA CATCAAACGT GCCAACCAAG 8580 

ACGGTGCCCT CATTGCCAAC CACGTGAAGG CAGAAGGCTT CCTCTTTGAC GAAAGTGGCA 8640 

AGATTACAGG TGTTGTAGCT CGTGATCTCT TGACAGACCA AGTGTTTGAA ATCAAGGCCC 8700 

GTCTGGTTAT TAATACAACA GGTCCTTGGA GTGATAAAGT ACGTAATTTG TCTAATAAGG 8760 

GAACGCAATT CTCACAAATG CGCCCAACTA AGGGAGTTCA CTTGGTAGTA GATTCAAGCA 2S2C 

AAATCAAGGT TTCACAGCCA GTTTACTTCG ACACAGGTTT GGGTGACGGT CGTATGGTCT 8880 

TTGTTCTCCC ACGTGAAAAC AAGACTTACT TTGGTACAAC TGATACAGAC TACACAGGTG 8940 

ATTTGGAGCA TCCAAAAGTA ACTCAAGAAG ATGTAGATTA TCTACTTGGC ATTGTCAACA 9000 

ACCGCTTCCC AGAATCCAAC ATCACCATTG ATGATATCGA AAGCAGCTGG GCAGGTCTTC 9060 

GTCCATTGAT TGCAGGGAAC AGTGCCTCTG ACTATAATGG TGGAAATAAC GGTACCATCA 9120 

GTGATGAAAG CTTTGACAAC TTGATTGCGA CTGTTGAATC TTATCTCTCC AAAGAAAAAA 9180 

CACGTGAAGA TGTTGAGTCT GCTGTCAGCA AGCTTGAAAG TAGCACATCT GAGAAACATT 9240 

TGGATCCATC TGCAGTTTCT CGTGGGTCTA GCTTGGACCG TGATGACAAT GGTCTCTTGA 9300 

CTCTTGCTGG TGGTAAAATC ACAGACTACC GTAAGATGGC TGAAGGAGCT ATGGAGCGCG 9360 

TGGTTGACAT CCTCAAAGCA GAATTTGACC GTAGCTTTAA ATTGATCAAT TCTAAAACTT 9420 

ACCCTGTTTC AGGTGGAGAA TTGAACCCAG CAAATGTGGA TTCAGAAATC GAAGCCTTTG 9480 

CGCAACTTGG AGTATCACGT GGTTTGGATA GCAAGGAAGC TCACTATCTG GCAAATCTTT 9540 

ACGGTTCAAA TGCACCGAAA GTCTTTGCAC TTGCTCACAG CTTGGAACAA GCGCCAGGAC 9600 
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TGCACCCCAT GGACCCCCCA ATAAAGCACC TATCCTACCA ATCATATAAC TGATTCCAGC 11460 

ACCAGTCATG AAGTTAGCGA ATGTGTTAGC TTGTTTATTC CCATGTATTG TGTTGACGTA 11520 
ATTCCAAACA TTAGGATCGT ATGATCTAAA AGATATATTT AGGTCGATTT CATTCTTTTG , 11580 

ATAAGCCATA TAAAATGCCC CATTGATATA GACGCCGTCA GCACGTCGTT CAATAGTGTC 11640 

TACACTTCCA TCTGGATTGA CAACCTCAAG AACTTCATCG CTTAAAATAT TTACTTGCGT 11700 

ATCTCCGAAC CGCACTGATG AGCCATTCTC AAACTGAGCC TCACCAGATA CAACTTTAGA 11760 

GTTTGCCGAT AAGCTATCAT CAGCAAAAAC AAACAAGCGA CGGGGAAATG CTAGACATAC 11820 

AGAAAACAGA CATAACTAGC AAACACATGC ATTTAAACAT CTTAGACATA ACGGAAACTC 11880 

CTTTGTATTT TTGATTTTTT TCAACTTTTA TTATACAATA AAACCAAATA AAAAGAAAGC 1194 0 

GGTAACAATA TGCTTAATGC GAAAATTTTT TATATATTTT TATGTTTGAT CGTTATCGAA 12000 

ACTACAGGCT TGTTGTTGTT GAAAAGAGGT CTCGAAATGG GTTATTTAGA CACAGAAGCT 12060 

ATTATCCTCG CAGTTTTTTC ATTTGCTTTT TACAACCTAT GTTCATTCGC TTGGGTCTGC 12120 

TCTACAATAA AAAACAATAA AAAATAAATA GACGTATTTT CAAAAAAAAC maAATGCATA 12180 

TTTATATTAG CAAAACGACG ATTTAAATCG TCGTTTTTTT GTAGTACGAC GGGCATGTCG 12240 

TATATCTGAG GTGTAAGTCC TCAGCCTGAC TATCGTGAGG TAGCAGGGAG AGGAAGGGAT 12300 

AGCGAAATCG TGGCTCTACG AJVCAGGAACG TnATAGTAAG GCGTATATAC CGGATAAGGA 12360 

GGCTTCAAAC TCTAAAGTCC AAAAAGGTAG TCGTAACCTA TATGTGTAAA TCACGAGAGT 12420 

AATTGAATTC GGACTAAGGT TTGTGTGAAA AAGATAAATC TTTCTAGAGT CTAAAGACTC 12480 

TGCGTCAGAT TTCCTATTTT CACTGTAACC TTTTAACGTC CTCATATCTT GTATAAACGA 12540 

GGAAAGATGT ACGACTTATC CCGTGAGGTT TCATGAGCGT GAAAGCGTAG TAACAACGAA 12600 

TCATGAGAAG TCAGCCGAGC CCATAGTAGT GAGGAAACTT CCGTAATGGA AGTGGAGCGA 12660 

AGGGG 12665 
(2) INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5305 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 

CGCTAATCAC TACAATCATT TTATTGTACT TTTTCACTCT CAAGAAAAGC AAGAAGTATT 60 
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CATTTTAGTT TCATTTAGTA TTATTTTGCA TACCTAAAAT ACAGTAAAAA ATCAGTCATC 120 

TTGGTATGCT CCTGCTTTCA CTATTCAACA CGTTTTTGAC TTATACTAGG CTCATTTCCA 180 

AAAGCATTAT ATAATAGTGA TATGAAACCA ACTAAACTAA ACAAGAAATA TAAGCAATAA 240 

AAATTCGTTT AAAAGATCTT ACTAAAGCTA ATACTAAATA AAAATAAAAG AGTAAACTAG 300 

GAAGTTTATT TCAAACAACC TAAAATACTG ATTTTCGGCT GAAGATAATA CTGGAGTGCA 360 

AATTAATGGG GTTATAATAA ATAGCTGATA GCTTGTGTTG GTTTTGGATT TTTTAAGAGT 420 

AGATGAGTAT TAAAACTATA AGGAGGACGA AGGTGGCTAA AAATTTAAAA TTAAAATTAG 480 

CTCGGGTAGA GCGTGATTTA ACACAAGGTC AACTGGCAGA GGCTGTCGGG GTGACACGCC 540 

AGACTATTGG TTTAATAGAG GCGGGAAAAT ACAATCCCAG TCTCTCGCTC TGCCAGTCTA 600 

TTTGCAGATG TTTAGGGAAA ACCCTAGACC AACTATTTTG GGAGGAAGAA GATGAAAAAT 660 

AGATTTTATT ATTCTCAATT ACTAGACGAA AGAGAAGAAC AACTGTTCAA TAAAGCGGGC 720 

TCTGAAAGTT TCTATATCTG CATTGCTTTG TCGCTCCTAT CTTATATCAT TTCAGTATTA 780 

GCACCAAGCC TTTTTAATTC T/ATATGCTG CTAATCGTTA TCATCATAGG GACATTTTAC 840 

TTTTTCAATC GTGCCCGTTA TCTGGGAGTG ACCTACTATG GTCGTTTTCA TTTTACGATT 900 

TTGGGTTGTT TTTTCCTAAC CTTGGCTATT ACGGCTCTTT TGATGTTGCA GAATTATCAA 960 

TTCAACATAG AAATTTATCA GCACAATCCT TTGAATTTTA AATACCTGTC TGCTTGGGTC 1020 

ATTACTTATA TCA'rrrACCT TCCGTGGATC TTTATTGGCA ATCTTGGTCT TAAGAGCTAT 1080 

GGCGAATGGG CTCAGAAAAA ATTTGAACAA GATATGGATG AATTGGAGAG TGGAGAATAG 1140 

CTTGTTACTC TTTTCTCAAT CCAGCTAAAA TGTGATATAA TAGTACTAAT TTATTGGAAT 1200 

ACATGAAAGT TCTTGAAAAT TTTCATGGGT TTCTAGCTAA GGAAGTAGGA AAAGTATGTA 1260 

TCCAGATGAT AGTTTGACAT TGCACACGGA CTTGTACCAG ATCAACATGA TGCAGGTTTA 1320 

CTTTGACCAA GGGATTCACA ATAAGAAGGC GGTCTTTGAG GTGTATTTCC GCCAACAGCC 1380 

TTTTAAGAAC GGCTATGCGG TTTTTGCAGG TTTAGAAAGA ATTGTGAACT ATCTTGAAGA 1440 

CTTGCGTTTT TCAGATAGTG ATATAGCCTA TTTGGAGTCG CTTGGTTATC ATGGGGCGTT 1500 

CTTGGATTAC CTTCGCAATT TCAAGTTGGA GTTGACCGTT CGTTCTGCCC AAGAAGGGGA 1560 

TTTGGTTTTT GCTAATGAAC CGATTGTGCA GGTGGAAGGA CCTCTAGCCC AATGTCAGTT 1620 

GGTCGAAACG GCTCTTTTGA ACATCGTCAA CTACCAGACT TTGGTGGCGA CGAAGGCAGC 1680 

TCGTATTCGT TCGGTTATCG AAGATGAACC CTTGATGGAG TTTGGGACAC GTCGGGCTCA 1740 

AGAAATGGAT GCGGCCATCT GGGGAACACG CGCAGCTGTG ATTGGTGGCG CCAATGGAAC 1800 

CAGCAACGTG CGTGCGGGTA AGCTCTTTGA CATTCCTGTT TTGGGAACCC ATGCCCATGC 1860 
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CTTGGTACAG GTTTATGGCA ATGACTATGA AGCTTTCAAG GCTTACGCTG CGACCCACAA 1920 

AAATTGTGTC TTTCTTGTGG ATACCTATGA CACCCTTCGC ATCGGTGTAC CAGCTGCCAT 1980 

TCAGGTGGCG CGTGAGCTGG GTGATCAGAT TAACTTTATG GGTGTGCGGA TTGACTCTGG 2040 

GGATATTGCC TACATTTCTA AGAAAGTCCG TCAGCAACTG GATGAGGCTG GATTTACAGA 2100 

GGCTAAGATT TATGCTTCTA ATGATCTAGA TGAAAATACC ATCCTTAACC TCAAGATGCA 2160 

AAAGGCCAAG ATTGATGTCT GGGGTGTGGG TACCAAGCTG ATTACAGCCT ATGACCAGCC 2220 

GGCTCTTGGG GCGGTTTACA AGATTGTTGC AATCGAAGAT GAAACTGGTC AGATGCGCAA 2280 

TACGATTAAG CTGTCTAATA ATGCTGAAAA AGTTTCTACG CCAGGTAAGA AGCAGGTGTG 2340 

GCGCATTACC AGTCGTGAAA AAGGCAAGTC AGAAGGCGAC TATATCACTT ATGATGGTGT 2400 

GGATATTAGC GACATGACAG AAATCAAGAT GTTCCATCCG ACCTATACAT ACATCAAGAA 2460 

GACGGTTCGT AATTTTGATG CCGTTCCTCT CTTGGTGGAT ATCTTCAAAG AAGGAATATT 2520 

AGTTTACAAC TTGCCTAGTT TGACTGACAT TCAGGATTAT GCCCGTAAGG AATTTGACAA 2580 

GTTGTGGGAT GAGTATAAGC GTGTGCTCAA TCCGCAGCAC TATCCAGTGG ATTTGGCGCG 2640 

TGATGTATGG CAAGATAAGA TGGACTTGAT TGATAAGATG CGCAAGGAAG CCCTTGGTGA 2700 

AGGAGAAGAA GAATGAGTTT GCAAGAAACG ATTATCCAAG AGCTGGGTGT CAAACCAGTG 2760 

ATTGATGCCC AGGAAGAAAT CCGTCGTTCT ATTGATTTCT TAAJ^GATA TCTGAAABAA 2S2C 

CATCCCTTCC TAAAAACCTT TGTACTAGGG ATTTCTGGGG GACAAGACTC AACCTTGGCA 2880 

GGACGTTTGG CGCAATTAGC TATGGAAGAA CTGCGAGCTG AAACGGGAGA CGATAGCTAC 2940 

AAATTTATCG CTGTCCGCCT GCCATACGGA GTGCAAGCTG ATGAAGCAGA TGCTCAAAAA 3000 

GCCCTAGCCT TCATCCAGCC AGATGTCAGC TTGGTTGTGA ATATCAAGGA ATCAGCTGAT 3060 

GCCATGACAG CTGCAGTTGA AGCGACAGGT AGTCCTGTTT CAGACTTCAA CAAGGGGAAT 3120 

ATCAAGGCAC GTTGCCGTAT GATTGCTCAG TATGCCCTTG CTGGTTCCCA TAGCGGAGCG 3180 

GTCATTGGAA CAGACCACGC CGCGGAAAAT ATCACAGGTT TCTTTACCAA GTTTGGTGAC 3240 

GGCGGTGCGG ATATTCTCCC TCTTTACCGC CTCAATAAAC GCCAAGGAAA ACAGCTCTTG 3300 

CAGAAACTTG GCGCAGAGCC AGCCCTTTAT GAAAAAATCC CAACGGCAGA CCTAGAAGAA 3360 

GATAAACCAG GCCTAGCTGA CGAAGTCGCA CTTGGAGTCA CCTACGCAGA GATTGACGAC 3420 

TACCTAGAAG GCAAAACAAT CAGCCCAGAA GCTCAAGCGA CCATTGAAAA CTGGTGGCAC 3480 

AAAGGCCAAG ACAAACGCCA CTTACCCATC ACCGTATTTG ATGACTTTTG GGAGTAAAAA 3540 

GGTCCGGGGG ACCTTTTTAG CTTCTTGCCC TGAAATTAAA AAGCAAGAAA AACCTCCACT 3600 



wo 98/18931 



PCT/US97/19588 



918 

GGAGGTTTTC AGCCTCTCAT CTTGAAATAA GAAAGTGAGA GAAGGTCTGG GGGATCTTGA 3660 

ACCCCGAGTT TAGAAATAAG AAAATGAGGC AGATTCAGTA ACTCGAAGAG TTCGATTTCA 3720 

TCGTCTTACC CCTGCAACGA TGACTAGGTT TGAAAAAGCT TGCTAGAGCG CATTTCAAAC 3780 

CAGGCAGCAA CTGCGTCAAG AAATTAGAAG ACAAACTCGT TTTCTAGCTG TTACTGAGTT 3840 

GAGCCTTTTT ACTACGAGTA TAGAAATAAG GAAGTGAGGT AGCATCATGA AATCTATCGG 3900 

TACGCAAATA TTACAGACAG AACGTTTGAT TTTAAGAAGA TTTGTGGAGA GTGATGCAGA 3960 

AGCCATGTTT CAAAATTGGG CTTCATCCGC TGAGAATCTG ACCTATGTTA CCTGGGATCC 4020 

CCATCCTGAT GTCGAAATCA CTCGAAACTC GATTTGCAAT TGGGTTGCTT OCT AT ACT AA 4080 

TCTCAACTAT TATAAATGGG CCATTTGTCT AAAAGAAAAC CCAGAGCAAG TAATAGGAGA 4140 

TATCAGCATT GTTAAGATAG ACGAGGCTGA TTTAAGCTGT GAAATTGGCT ATGTGTTAGG 4200 

CAAGGCTTAC TGGGGAAATG GTATGATGAC AGAGACTTTG AAAGCTATCT TGGACTTTTG 42 60 

TTTTACTCAA GGAGGTTTTC AAAAGGTCAG AGCACGTTAT GCCAGTCTCA ACCCAGCTTC 43 20 

AGGTCGTGTC ATGGAAAAGG CTGGAATGTC CTATCTACAA ACCATTGTTA ATGGTGTAGA 4380 

GAGAAAAGGC TATCTTGCGG ATCTTATTTA TTATGGTATA AGTAGGGAAG AATGTTGAAT 4440 

TCTATTTTCT GTTTCTATCG AAGTCAACTA TTTATTGTAA ATATAATAAT TAGCATTCCA 4 500 

AGTTTATTTG AAACTTTAAA ATAGCATATT GATTAGTACA AGACAGATGT TCTAGTTCCT 4560 

TCTTTAATCT GGTTTAGTGT TAGTTAAAAA ATCGCTTTAA GCTTGTAACT AAGAGGGAGC 4 620 

TAATCGACTA GATTCTCCAG CCGAACAGGl GGTAATGTAC TTTTTATAGT GTAATCCTAG 4 680 

CTGTTGTTAA ATTTAAAATA GAATCCTCTA TCGAGTTAGG GAATTAAATT CAACCAATTT 4740 

TATTCATGTT TTTTCTATCA AATTATCTAA TATTAAAATA GTCTCATTCT GATGAGAAAA 4 800 

CTATTCCCAA ATCATTCATA CCTCTCTCAA CTAGATGTAA CTTACAAAAC CCCTGACCTC 4860 

ATGAGCCACT TTCTTCCTCC TCATGAGGTC AGTTTTACTT TCTGCTGTTC CAGTATCGTT 4920 

TTTCCTCGCT AGATTTCCTC AAAAGGGCAG ACTCCTCCCT TGGTGCGTCA CACGATTTTT 4980 

TCATCTCGAC TGTTCTTTAA TGCATCATTA ACGACGCTTT TCTTCTAGGT GGTTCATAAG 5040 

GAACAGGAAG ATTCAGGTTG ACTTTTCTAA TCCTAGAATA AAGTGCTGAA AACAATTCGG 5100 

AATAGGCATA GAGACTAGAC AATTTGAGGA GCTGCTTGCG TCCTGTTCGA ACACATTTTC 5160 

CCACCACGTG AAGAAAAAGA TGGCGGAAGC GTTTGATTGT TAAAGTTTGG AAGTCACCTC 5220 

CAGCTAGATG TTTGAGAAAA AGATAGAGAT TGTAGGCGAT ACAGCTCATC ATCATACGAA 5280 

CTTCGTTTTT GATTAAGGTT GAACT 5305 
(2) INFORMATION FOR SEQ ID NO: 136: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3964 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOP0LCX3Y: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

TGGCAGCTCG TCGTCGTAAA GGACGCAAAG TTTTGGCTGC ATAATCCAAA CGAATTCTAT 60 

CAAAAATCAG TAGGAACTCG AGTCTACTGA TTTTTATTTT TGTAAAAAAG TTCAGTAGAT 120 

GCAAATGGAT TCGGAAGCGA TGTTACAGTA GATTGAAACT AGAATAGTAC ACCTCTGTTT 180 

CTAAAACATT GTTAGAAATC GATTTGACTG TCCTGATCGA TTTGTCCTGT TATTATTTTA 240 

TTTTACTATA AAGTTGAAGT AGGTGGAGAT GGTACAGCAA CAATCGTCTT TAAAGATGGT 300 

TCAGCTATTA CAATTCCAGG AAATCAATTG GTAGCACAAG ATCCAAAAGC ACAAGATAGC 360 

ACTAAACTGA CTGCTGAAAA ATCAACTGTT AAAGCACCTG CTCAAAGAGT AGATGTAAAA 420 

GATATAACTC ATTTAACAGA TGAAGAAAAA GTTAAGGTTG CTATTTTACA AGCAAATGGT 480 

TCAGCATTAG ACGGAGCGAC AATCAATGTA GCTGGAGATG GTACAGCAAC AATCACATTC 540 

CCAGATGGTT CAGTAGTGAC GATTCTAGGA AAAGATACAG TTCAACAATC TGCGAAAGGT 500 

GAATCTGTAA CTCAAGAAGC TACACCAGAG TATAAGCTAK AAAATACACC ACCTCGAGAT 560 

AAGGGAGGCA ATACTGGAAG CTCAGATGCT AATGCGAATG AAGGCGGTGG TAGCCAGGCG 720 

GGTGGATCAG CTCACACAGG TTCACAAAAC TCAGCTCAAT CACAAGCTTC TAAGCAATTA 780 

GCTACTGAAA AAGAATCAGC TAAAAATGCC ATTGAAAAAG CAGCCAAGGA CAAGCAGGAT 840 

GAAATCAAAG GCGCACCGCT TTCTGATAAA GAAAAAGCAG AACTTTTAGC AAGAGTGGAA 900 

GCAGAAAAAC AAGCAGCTCT CAAAGAGATT GAAAATGCGA AAACTATGGA AGATGTGAAG 960 

GAAGCAGAAA CGATTGGAGT GCAAGCCATT GCCATGGTTA CAGTTCCTAA GAGACCAGTG 1020 

GCTCCTAATG CTGCTCCTAA GACAACAAGT GCACCGCAAG CAACTGCAGG AACAATGCAA 1080 

GATGTTACCT ACCAGTCACC TGCTGGCAAA CAATTACCTA ACACAGGTTC AGCATCAAGT 1140 

GCAGCACTTG CTAGTCTTGG TCTAGTGGTG GCAACAAGTG GTTTTGCTTT GCTAGGAAGA 1200 

AAGACTAGAC GTAGAAAATA GAACAGCTAG AAAATTCTAT TCTCTACTTA AAGTTAGATT 1260 

ATAAGGGGGA TTTTGAGAAG TCATCAATCC TAGTGATGGG TGAGAAAAGT GAGAACCCAA 1320 

GATAATCACA TACTTTAGCT GAATAGGAAT ATTCTATCAA TGTAGCCAAT CTCTTCTGTC 1380 

TCTAACTGTG GAATAGGAGA TGGGCAATAT CGGATAGAAA AGATAGCAGA ATAGCTCTCT 1440 
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ATTGAAGAGA GGAGGGGAAA CCGAAAAATT AGGTGCCCCT CCTCTTTTTT GGTATAATAG 1500 

AAGATAGAAA ACGAGGTTAG AAGAGATGAT TTTTGATACA CATACACACT TGAATGTAGA 1560 

AGAATTTGCA GGTCGTGAGG CAGAAGAAAT TGCCTTGGCT GCTGAGATGG GTGTGACACA 1620 

GATGAATATT GTTGGTTTTG ATAAACCGAC GATTGAGCAT GCCTTGGAGT TGGTAGATGA 1680 

GTATGAGCAG CTCTATGCGA CTATTGGTTG GCATCCTACA GAAGCTGGTA CTTATACAGA 1740 

GGAAGTTGAG GCTTACTTGT TGGATAAGTT AAAACATTCC AAGGTTGTGG CTTTAGGTGA 1800 

AATTGGCTTA GATTACCATT GGATGACAGC GCCCAAAGAG GTGCAGGAGC AGGTTTTTCG 1860 

CCGTCAGATT CAGCTATCTA AGGACTTGGA TTTGCCTTTT GTTGTCCATA CCCGTGATGC 1920 

GCTGGAAGAT ACCTATGAGA TTATCAAGAG TGAGGGCGTT GGTCCTCGTG GTGGTATCAT 1980 

GCATTCATTT TCAGGGACGC TTGAGTGGGC AGAGAAGTTT GTGGATCTTG GTATGACCAT 2040 

TTCCTTCTCA GGAGTGGTGA CTTTTAAGAA GGCAACTGAC CTCCAAGAAG CAGCTAAAGA 2100 

GTTACCTTTG GACAAGATGT TGGTGGAAAC AGATGCGCCT TACTTAGCAC CTGTACCCAA 2160. 

GCGTGGTCGT GAAAATAAAA CAGCCTATAC TCGCTATGTG GTCGACTTTA TCGCTGACTT 2220 

GCGTGGTATG ACGACAGAAG AGCTGGCGGT AGCAACGACT GCAAATGCAG AACGAATTTT 2280 

TGGACTGGAC AGCAAGTAAT GAAAGAGAAA ATTTCTCAAG TTATCGTGGT TGAAGGGCGT 2340 

GATGATACGG TCAATCTCAA ACGTTATTTC GATGTGGAGA CCTATGAGAC TCGAGGTTCT 2400 

GCCATCAATG CTCAGGATAT AGAGCGGATT CAGCGCCTGC ACCAACGTCA TGGAGTCATT 24 60 

GTCTTTACAG ACCCAGATTT TAATGGGGAA CGGATTCGGC GCATGATCAT GATGGTCATT 2520 

CCAACAGTTC AGCATGCCTT TCTCAAGCGA GATGAAGCTG TTCCCAAGTC CAAGACCAAG 2580 

GGGCGTTCTC TGGGAATTGA GCATGCCAGC TATGAAGACC TGAAAACGGC TCTAGCTCAA 2640 

GTGACAGAAC AATTTGAACA TGAGAGTCAG TTTGACATTA GTCGTAGCGA TTTGATTCGC 2700 

CTTGGTTTTC TAGCAGGGGC AGACAGCCGT AAGCGTAGAG AATATCTCGG AGAGACTCTC 2760 

CGAATCGGCT ATTCCAACGG CAAGCAACTC CTCAAACGCC TAGAGTTGTT TGGGGTTACT 2 820 

TTGGCAGAAG TGGAAGAAGC TATGAAATCT TATGAGTAGG AAAGATGTAG CCGTTACAAT 2880 

TTTTTAAGTT TCACAGTATT TTTCGAAGCA GGTAGAAGAG GAGGCGTCTG ATGTTAATTG 2940 

GTCAAAAAAT TAAAGAGATT CGGATAGAAA AAGGAATTAG TCGTCCAGAT TTTTGTGGAG 3000 

ATGAGCAAGA ACTGACAGTT CGTCAACTGT CGCGAATTGA AAGTGGAGCT TCGCAACCGA 3060 

GTTTGCCCAA GTTAGACTAT ATTGCTCGCC GGCTAGGAGT TCCAGTTTAT AGCCTTATGC 3120 

CGGATTTTTC AGCTCTTCCT TCTGCTTATT TAGAATTGAA ATACCAGATT TTACGTGAAC 3180 

CAATCTATGG TAAAGAAGAG GAGTACGATA AGAAGGAAGC GTGTTTGGAA GAGATTTATA 3240 
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AAACATACTT TGATAATCTT CCTAAAGAAG AACAATTAGC ATGTGAAGTA TTGCAGGCGT 3300 

GTTTGGATAC TTCTAGAACT AGAAGGCCTG AATATGCAGA GTTAATACTT GAGGAACATA 3360 

TGCCTCAGAT. TATAGAAAAA GAAGCTTATT CAATAAATGA TATGTTGTTG ATTCGTTTGT 3420 

TTTTTTATCA AATGCTCATT AGAAAAGATC TTGCCAAATT TATAAATCAA ATCGAAAAGC 3480 

TAATGCTCTT TCTTTTGGAA CAGAAGAAGG TAACTCAAAT AGAGAATTAC TTTATAATTA 3 54 0 

GAGATACTCT TATTTCAGGA ATGTGTTGTC TTGAAAAGGT AGGAGTAACT GATTGTTTTA 3600 

ATGATTATCT ATCGTGTTTA CAAGAAATTA TGGATAAAAC TCAAGATTAT CAAAAGAAAC 3660 

CTCTTGTATT TATGTTTTTG TGGAAGCAAG CATTAAGAGA AGAAAGAGAT TTTAGTTTAG 3720 

CTGAATCATT TTATCAGTCT TCTAAAACAT TTGCGCAGCT AATTGGAGAT GAATTTCTAG 3780 

TAAAGAAATT GACAGAGGAA TGGCAAGAGG ATGTCAAAAA ATATTTATAA ACATAGTGAA 3840 

TCAGTGACAA AGATGTCCTT GTCCTCGTAT CAAAACAGTT CTAAAGTTCG TCTTTAGGGA 3900 

TGTTTTTTTA GATATAAGCT AAAAATGACA CGAAATGGTT AGATTTTAAG GACATTGATG 3960 

TCCG 3964 



(2) INFORMATION FOR SEQ ID NO: 137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12666 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 



TGAGACCGTT 


ATTTGTATTA 


GGGAAATGGG 


TATCTATTTT 


TAATGCTGTG 


GGGATTTTGA 


60 


TTGTTTCTAT 


TATTCAAACC 


AAAAGCTTGT 


CAGGTATTGG 


AGCAGGATTG 


TTTAATCTAT 


120 


ATAACATTTC 


ATCTTATATA 


GGTGATTTAG 


TTAGTTTCAC 


TCGATTGATG 


GCATTAGGAT 


180 


TATCTGGAGC 


AAGTATAGCA 


TCAGCTTTCA 


ATTTAATTGT 


TGGTTTGTTT 


CCGGGAATAT 


240 


TGGCTAAACT 


GACAATTGGA 


TTAGTATTAT 


TCATTCTTTT 


ACATGCGATC 


AATATTTTTC 


300 


TATCGTTACT 


ATCAGGATAT 


GTTCATGGAG 


CACGTCTGAT 


ATTTGTTGAA 


TTTTTTGGTA 


360 


AGTTTTATGA 


GGGTGGAGGA 


AAACCATTTC 


AACCTTTGAA 


GGCTTCTGAG 


AAATATATTA 


420 


AGGTTATTAC 


AAAGAATTAA 


TGGAGGATAT 


ATATAATGGA 


ACATTTAGCA 


ACTTATTTTT 


480 


CAACCTATGG 


AGGAGCTTTC 


TTCGCTGCAT 


TGGGAATTGT 


ATTGGCGGTT 


GGATTAAGCG 


540 


GTATGGGGTC 


TGCTTATGGA 


GTTGGTAAGG 


CTGGGCAATC 


TGCCGCAGCT 


TTACTGAAAG 


600 
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AACAGCCTGA AAAGTTTGCC TCAGCTTTGA TATTGCAATT ATTGCCCGGA ACACAAGGAT 660 

TATATGGTTT TGTTATTGGA ATTTTAATTT GGTTGCAATT AACTCCAGAA CTTCCTTTAG 720 

AAAAAGGCGT TGCTTATTTC TTTGTAGCTC TTCCAATTGC TATTGTAGGA TACTTTTCAG 780 

CTAAGCATCA AGGAAATGTA GCAGTAGCGG GAATGCAAAT CTTGGCTAAA AGACCAAAAG 840 

AATTCATGAA GGGAGCAATT TTAGCTGCCA TGGTAGAAAC CTATGCAATT CTTGCTTTTG 900 

TCGTATCATT CATTTTGACC CTTCGTGTAT AAGAAATAAA TTTGCAATTC AAAGGAGGTG 960 

TCTAAATGAG CAATTTAGAA AACTTACGAG AGTCTGTTAT TGAACAAGCT CATGAAAAAG 1020 

GGCGTATGAA ATTATTGGAT TCCAAAAAGA AGATTGATGA TGAATTTGAA ATGCAAAAGT 1080 

CGCTCATTAT AAAGAAAAAA GAAGCTGAAC ATGAACGAAA GTTAAAAGAA TTGCAACAGA 1140 

AATATCAAAT AATTTTTCAA CAATTAAAAA ATAAGGAACG CCAATCAACG TTAGTATCAA 1200 

AACAGAAAAT ATTAAAAGAA CTTTTTCAAT CTGCTTTACT AGAAATGGAA TCTTGGAGTG 1260 

CAGATAAAGA AATGGAGTTC ATCTATCGAA TTCTGGAACG ATATTCACAA CAAGAGGTCA 1320 

TAGTAACCTT TGGGGAACGG ACTTTAGCTA AATTCAATTT GGAACAATTA GAGAAATTGA 1380 

AATTCTCTTT TCCAAATTAT TTATTTAGTG AACAACCTAT CTCAAATGAA TCAGGCTTAC 144 0 

TTATTTCAAT AGGTAAAATT GATGATAACT ATTTGTATAA AACATTAATT GGATCGATTT 1500 

CTAAGGAAGA AAGTTCAAGT ATCGCAAATC AAATTTTTAT CAATTAAGGA TGAAATTGGT 1560 

•i'AATCCTTCT TAGAAATTTG GAGTATTCCA ATAAAATTAG AAAGGTATTT TATGGATACT 1620 

AATCTTTTTT CAAAAATAAA TACGACGATT TCGGTAAAAG AAAACGATTT TATTACAGAA 1680 

GAAAAATTTC AAAAAATTAT ACAATCCAAA GATACGGAGA CATTGGCATT TATCTTAGAA 174 0 

TCAACTCCCT ATCATTTATC GATTGACATC TTAGAAGATC CTAGTCAGAC AGAGATTTCG 1800 

CTAATGACAA AATTAGTCAA TGATTATAGA TGGGCCTATG CTGAAAGTCC GTCTGATATA 1860 

ATTGTGACTT TATTTGCTTT ACGATATGTT TATCATAATA TCAAAGTTTT ATTAAAATCT 1920 

AAGGCGGCAA TTAAGAAAGA TTTTTCTAAA TTATTAATTC CAATAGGGAT TTTTGATATA 1980 

GAAAGTTTAA AACATTTAGT TTCTTCCTTA CATTCAGATA CACTTCCTGA TTTTATGGTT 2040 

CGTGAAGTAG AATCAATTTG GAATGAGTAT GAAACTTTTA ATAATATTCG TGTACTTGAT 2100 

GTCGGAGCTG ATCTAGCATA TTTTAAACAT CTGAAACTTT TATCTAATGA GTTAGATGAG 2160 

GTACTGTCTC AGGTTATTGT CGAAATGATT GACTTTTATA ATATTATTAC TGTAAAACGT 2220 

GGTTTATCTC AAAATAAGAG TCATGGGGAT ATTTTACAAT TACTTTCAGA TGAAGGAAGT 2280 

ATTTCTGCTA AAGAATTTAT ATACATTGTA GAAAATCAAG AAATATTTGT GTGGTTCAAT 2340 

AAAATAAATC CAAGCTTAGA TTCAATCTTT TCAACTTATG AATTGAAGAT GCAGGACGCA 2400 
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ACAATTTCAT CTTCTGAGTT AGAATTTTTA TGTGATTTAC TATTGTATAA AACTTTAGAT 2460 

CAAGGAAGGT ACAATGTAGA GGGGCCGTTA GTTCTTGCTA GATATTTATT GGGATGTGAG 2520 

TTTGAAGTAA AGAATCTCAG AATGATCATA TCAGCTCTTC AAAATACAAT TCCCTTTGAA 2580 

TCAATAAAAG AAAGGATACG CCCACATTAT GGAAGCTAAT AAGTATAAAA TTGGCATAAT 2 640 

TGGTAGCCGT GATATTATTT TACCATTTAG CATGATTGGG TTTGATATAT TTCCTGCCTA 2700 

CCAAGAACAA GAAGCTATAA ATACACTAAG AAAATTAGCT CAATCTGATT ATGGTGTCAT 2760 

TTATATCACT GAAGACATTG CTTCAATGAT ATTAGATACA ATTCGCCATT ATGATTCCCA 2 820 

AGTTGTGCCT GCTATTATTT TATTACCGAC TCATAAACAA GGTTTAAATT TAGGATTAAA 2880 

ACGTATAGAG GATAATGTAG AGAAAGCAGT AGGACACAAT ATTTTATAAT AATGTACAAA 2 940 

ATTGTCTGTA ATATTATTCT ATAATTTTTG GACTTAGTAA GGAGAATAAC TTTGACTCAA 3000 

GGGAAGATTA TAAAAGTATC GGGACCTCTA GTTATTGCAT CAGGTATGCA GGAGGCTAAT 3060 

ATTCAAGATA TTTGCCGTGT AGGTAAGCTA GGGTTAATCG GTGAAATTAT TGAAATGAGA 3120 

AGAGATCAGG CATCTATCCA AGTCTATGAA GAAACATCTG GTCTTGGTCC GGGAGAACCT 3180 

GTTGTTACAA CTGGAGAACC TCTCTCGGTT GAATTAGGGC CAGGATTGAT TTCTCAAATG 324 0 

TTTGATGGCA TACAACGCCC ATTAGATCGA TTTAAATTGG CTACTCATAA TGATTTTCTA 3300 

GTTCGTGGGG TAGAAGTTCC AAGTTTGGAT AGAGATATTA AGTGGCATTT TGATTCC.\CT jjGC 

ATAGCAATTG GTCAAAAAGT GAGTACGGGT GATATTCTTG GAACTGTCAA GGAAACCGAG 3420 

GTAGTTAATC ATAAAATTAT GGTTCCTTAT GGAGTATCTG GAGAAGTCGT TTCTATTGCA 34 80 

TCTGGCGATT TTACAATTGA TGAAGTTGTA TATGAAATAA AAAAATTGGA CGGTAGTTTC 3540 

TATAAAGGAA CGCTTATGCA AAAATGGCCT GTCCGCAAGG CGCGTCCTGT TTCTAAACGT 3 600 

TTAATTCCAG AAGAACCATT AATCACAGGT CAACGAGTTA TTGATGCATT CTTTCCAGTA 3 660 

ACCAAAGGGG GAGCTGCAGC AGTTCCTGGA CCGTTTGGAG CAGGAAAGAC AGTTGTACAA 3720 

CACCAAGTAG CTAAATTTGC CAATGTTGAT ATTGTTAnT ATGTCGGTTG TGGAGAACGT 3 780 

GGAAATGAAA TGACGGATGT ACTGAATGAG TTTCCTGAGT TGATTGACCC TAATACCGGA 3840 

CAATCAATTA TGCAACGGAC AGTTCTGATT GCTAATACTT CAAATATGCC TGTTGCTGCT 3900 

CGTGAGGCTT CAATTTATAC AGGAATTACC ATGGCTGAGT ATTTTCGTGA TATGGGCTAC 3960 

TCTGTCGCCA TTATGGCTGA TTCAACTTCA CGTTGGGCAG AAGCGCTACG TGAAATGTCA 4020 

GGACGTCTAG AAGAAATGCC TGGTGATGAG GGTTATCCTG CTTATCTGGG AAGTCGTATC 4080 

GCTGAATATT ATGAAAGAGC AGGACGTTCT CAGGTTCTAG GGCTTCCAGA ACGTGAAGGA 4140 



wo 98/18931 



PCT/US97/19588 



ACGATTACTG 
CAAAACACTT 
CGTCATTTTC 
ACTTATATAG 
TACTTACAAC 
TCTGATAATG 
CAGAACGCTT 
AGTAATATTC 
ACAGAGATTA 
TCAGAAGATA 
TTGATATTAG 
GAAGTTGTTG 
GTTGAAATTC 
GATAAAGCAA 
ATTCGTTTTG 
TTTAATGGGA 
GATATTGATG 
ACAGGGATCT 
TTTTCAGGTT 
GTTTTAAATT 
GAAGCTGAGT 
TTTATGAACT 
ACTGCGGCAG 
ATGACTAACT 
AGACGAGGCT 
CGCTTAGTTG 
GACATAACAC 
TCGCATGAGT 
CGATTAAAAG 
AATCAACTGT 



CTATTGGAGC 
TACGGATTGT 
CTGCAATTAA 
ATGGTAAAGA 
GGGAATCTAG 
AACGACTAAC 
TTGATTCGGT 
TCACTTTTGC 
TGGAAGGTAC 
GATTAGATGA 
AAACAGGAGG 
GGCCTCTTAT 
AACTTCATAA 
TGGTTCAGCT 
CTGGTCATGC 
TGGGAAAACC 
GTCAAGCTAT 
CCTCTATTGA 
CGGGCTTACC 
CTGATGAAAA 
TTTTTATGGA 
TGGCAAATGA 
AGTATCTAGC 
ATTGTGAAGC 
ATCCGGGATA 
GTAAAAAAGG 
ATCCAATTCC 
TGTATAATCA 
ATAAGGGATC 
TTGCAGCCTA 



TGTATCGCCA 
GAAAGTTTTT 
CTGGCTTACA 
GAAGACAGAT 
TTTAGAGGAA 
GATGGAAATT 
AGATACATTC 
TGATCAGGCA 
CGTGGCAGTT 
AATCAAAATT 
TCTATAAATG 
GATTGTTGAA 
TGGAGAAATT 
TTTTGAAGGA 
ATTAGAATTG 
AATTGATGGT 
TAATCCTGTA 
TCATTTGAAT 
TCATAATGAA 
TTTTGCGGTT 
AGAACTCAGA 
TCCTGCAATT 
TTTTGAAAAA 
GTTACGTGAA 
TTTATATACA 
TTCGGTGACA 
TGATTTAACT 
AGGTTATCGT 
TGGAGAAGGT 
TGCCCAAGGG 



924 
CCTGGTGGAG 

TGGGGGCTTG 

TCTTATTCAC 

TGGAATAGTA 

ATTGTTCGTC 

GCTAAACAAA 

ACTTCGTTTG 

AATCATGCTT 

CGAGACCGTA 

ATATCAAATG 

AGTGTTATAA 

CAAGTAAATA 

CGTCGTGGAC 

TCTAGTGGAA 

GCTGTATCTG 

GGACCAGATT 

TCTAGAGATT 

ACTCTTGTAC 

TTAGCTGCTC 

GTATTTGCAG 

AAAACAGGAG 

GAGCGTATTG 

GATATGCACG 

GTCTCGGCAG 

AATTTATCAA 

CAGATTCCTA 

GGATACATTA 

CCACCAATCA 

AAAACTCGTG 

AAAAAGGTTG 



ATATTTCAGA 
ATGCTCCGTT 
TATATAAAGA 
AAATAACTCG 
TTGTTGGAAT 
TTCGAGAAGA 
CAAAACAAGA 
TAGAGTTGGG 
TGGCGAGAAG 
AGATTACACA 
AAGAATACAG 
ATGTGTCTTA 
AAGTTTTAGA 
TAAATTTAGA 
AGGATATGGT 
TAATTCCAGA 
ATCCAGATGA 
GTGGTCAAAA 
AGATAGCAAG 
CAATGGGTAT 
CGATCGATCG 
CAACTCCCCG 
TTCTAGTTAT 
CTCGCCGTGA 
CTCTATACGA 
TTTTAACAAT 
CTGAAGGGCA 
ATGTTTTACC 
GAGATCATGC 
AAGAGTTAGC 



ACCAGTTACT 
GGCACAGCGA 
CAGTGTGGGC 
TGCGATGAAC 
TGATTCTCTG 
TTATTTGCAA 
AGCAATGCTA 
TTCTTACTTT 
TAAATATGTT 
TCAAATTCAT 
AACTGCTAGT 
CAATGAGTTA 
GATCCACGAA 
AAAGTCTAAA 
TGGTCGTATT 
GAAATATTTA 
ATTTATTCAG 
ATTACCAGTA 
ACAAGCGACT 
TACTTTTGAA 
TTCGGTTTTA 
CATTGCTTTA 
CATGACGGAT 
AGTTCCAGGG 
AAGGGCTGGT 
GCCAGAAGAT 
AATTATTTTG 
TTCTCTCTCT 
TCCAACTATG 
AGTAGTATTA 



4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
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GGAGAATCGG CTTTATCTGA TGTAGATTIAA TTGTATGTGA GGTTTACAAA GCGTTTTGAA 6000 

GAAGAGTACA TAAACCAAGG ATTTTATAAA AATCGAAATA TAGAAGATAC GTTGAATCTT 6060 

GGGTGGGAAT TACTATCAAT TCTTCCTAGA ACAGAGTTAA AACGTATCAA AGATGATTTG 6120 

CTTGATAAAT ACTTACCTTT GGTAGAAGTT TAATCCGGAA ATGGAGTGAT TATCTATGGT 6180 

ACGTTTGAAT GTAAAACCAA CTCGTATGGA ATTGAATAAC TTAAAGGAAC GTTTGACAAC 6240 

AGCTGAACGT GGACATAAGT TATTAAAGGA TAAAAGAGAT GAATTGATGA GGCGATTTAT 6300 

TTCTTTGATT CGTGAGAATA ATCAACTTCG GAAAGAAGTG GAAAGTTATC TAATTGATAA 6360 

TCTA7VAATCC TTTGCAGTTG CTAAATCATT AAAGAATTCT CAAATGGTGG AGGAATTATT 6420 

TTCAATTCCA TCGAAAGAAA TTGAATTATT TGTTGAGAAA GAAAATATCA TGAGTGTAAC 6480 

AGTTCCTAGA ATGCATATGA ATATTACTTC TCAAAATGAG AACAGTGAAT ACAGCTATTT 6540 

ATCTTCTAAT AGTGAAATGG ATGATGTATT TGCTACAATG AATAGTTTAA TTTATAAATT 6600 
ACTAAGACTG GCAGAAGTTG AAAAAACGTG TCAGTTAATG GCTGATGAAA TAGAAAAAAC . 6660 

ACGTAGACGT GTAAATGGTT TAGAATACTC GATTATTCCA AACTTGTCGG AAACTATTCA 6720 

TTATATAGAA TTGAAACTAG AGGAGGCAGA AAGAGCCAAT TTAGTTCGTA TTATGAAAGT 6780 

GAAGTAGATC CTTTATTTAG ATTATTAATT AGATGAACAA ATATCAGCTT GGATAAGGCT 6840 

TTAAGCCTTT CTAAGCTTTT TTTATTGACA CTATCAGGAT ATCTTTTTCA AAATT-rrtJ^T 6900 

TTGTTAGATA ATGAAAATGT TTCTACTAAT CTAGATTTAG GATTAGTAAA TCGTAAATGT 6960 

AATTATATAG AAAGTAAGCG CGTCATAACA AGGTATCTAT CATTCATGGA GCTCCTCCTG 7020 

TATACTATTA GTAAAGTAAA ACTATTGGAG GATATTTTAA TGCCACAACC TATTGTTCCT 7080 

GTAGAGATTC CACAATCTCG TCGTTTTGAT TCTAAAAAGA GAAATGATAT TCTGCTTAAA 7140 

ATTCGTATTG GCAAGCTTGA AGTAAGTTTT TTTCAATCTC TCAATCTCGA AATGGTAGAA 7200 

CAGCTTTTGG ATAAGGTGTT GCTCTATGAC AATTCATCTA TCTAGCCTAG GGGAGGTCTA 7260 

TCTCGTGTGT GGGAAAACTG ATATGAGACA AGGAATCGAT TCACTGGCTT ATCTGGTTAA 7320 

AACCCACTTT GAATTGGATC CTTTCTCCGG TCAAGTCTTT CTCTTTTGTG GTGGACGTAA 7380 

AGACCGCTTT AAAGTCCTTT ACTGGGATGG TCAAGGATTT TGGCTACTAT ATAAACGCTT 7440 

TGAGAACGGC AGATTGATTT GGCTAAGTAC AGAT^GGAT GTCAAAGCTC TCACACCAGA 7500 

ACAAGTAGAC TGGCTTATGA AGGGCTTTTC TATCACTCCA AAAATATAGT AGATTGAAAC 7560 

TAGAATAGTA CACCTCTGCT TCTAAAACAT TGTTAGAAAT CGATTTTACT GTCCTGATCG 7620 

ATTTGTCCTG TTCTTATTTC ATTTTACTAT AAATCCATCA GAAAGTCGTG ATTTCTATTG 7680 
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AAATGAGGAC TTTCTTTTTA TACTCATCTG CTTTCAAAAA GCATTCTAGT CCATCTCCGA 7740 

TTAACGATGG ACTTTATCAC CTCCTTCTCC AGTCCTTGTA TAACATCTTG GAGTTGATTC 7800 

ATGACATCTT CCAAAGTTTA AAAGGCTTTA TTCTTAAATC CACGTTTACG AATCTCTTTC 7860 

CACACTTGTT CAATGGGGTT CATCTCTGGT GTGTATGGAG GAATAAATGC AAAGCCAATA 7920 

TTAGTCGGAA TCTTTAAGGT ACTTGATTTA TGCCATATAG CATTGTCCAT AACGAGTAAA 7980 

AGATAATCAT CTGGATAAGC TTGTGAAATC TCCTATTCCT AAAGCCCCTT TAGCGCATAA 8040 

CTTTGGCTCA GCTTCTATTA TCGCTCACAC CATCCATCAG AAGTTTAATC TGAAGGTACC 8100 

CAATTATCGC CAAGAAGAAG ATTGGGCTAG GATGGGTTTA CCT^TCACAC GTAAGGAAAT 8160 

CTCTAATTGG CATATCAAGG CGAGTCAATA CTATTTGGAG CCCCTTTATA ACCTCTTGCG 8220 

AGAGAGACTA TTGACTCAGC CCTTACTTCA TGCGGATGAA ACTTCTTATA GGGTGCTAGA 8280 

GAGTGATAGT CAGCTGACTT ACTATTGGAC TTTTTTGTCA GGTT^GCAG AGAAACAAGG 8340 

GATTACGCTT TACCACCATG ATCAGTGTCG AAGTGGTTCA GTAGTACAAG AATTCCTAGG 8400 

AGATTATTCT GGCTATGTGC ATTGTGATAT TTTGCGGCAG TAACTTAGGA CTTTAGTCCT 8460 

CTAGTTCTGC CTATGCGATA GCAGTCCAAG GTTTAGGAGC AAGGCGACGC TAAGCTTGGT 8520 

AAACTTCGAA CCGCTCGTCT GCTTATCGTC AACTGGAAGA AGCTGAACTT GTTGGATGTT .8580 

GGGCGCATGT GAGAAGGAAG TTTTTTGAAG CGCCCCCCCA AGCAAGCGGA TAAATCATCC 8640 

TTAGGAGCTA AAGGTTTAGC TTATTGTGAT CAGTTATTTT CCTTGGAAAG AGACTGGGAG 8700 

GCTTTGCCAG CTGATGAACG ACTACAGAAA CGTCAAGAAC ATCTCCAGCC CTTAATGGAA 8760 

GACTTCTTTG CTTAGTGCCG GCGTCAGTCA GTTTTAGCAG GTTCAAAACT AGGAAGGGCA 8820 

ATTGAATACA GCCTCAAGTA TGAAGAAACC TTTAAGACCA TTTTGAAAGA CGGACATCTG 8880 

GTCCTTTCCA ATAATCTAGC TGAACGCGCC ATTAAATCAT TGGTTATGGG ACGGAGTAAA 8940 

AGAGTCCAGT GGACTCTTTT AGCCTAAGCT CAGTTTAAAA AAGCGAGGGT GGTTATTTTC 9000 

TCAAAGTTTT GAAGGAGCTA AAGCAAGAGC TATTATTATG AGTTTGTTGG AAACAGCTAA 9060 

ACGTCATCAA TTAAATAGCG AGAAATATCT ATCCTATCTT CTAGAATGTC TTCCAAACGA 9120 

GGAAACTCTC GTAAACAAAG AGGTTTTAGA GGCTTATTTA CCATGGACTA AAGTTGTACA 9180 

AGAAAAGTGC AAATAAGAAA TCTCCAGATT AGGAACTATC CGTGAGTTCT CCAGTCTGGA 9240 

GATTTTTCAA TAGACTTCCT GCGAAACAAA ATATGGTATA ATAGTTCTAT GAATGATGAA 9300 

GCAAGTAAAC AACTAACCGA TGCACGATTT AAGCGTCTTG TTGGTGTTCA ACGCACGACT 9360 

TTTGAAGAGA TGTTAGCTGT ATTAAAAACA GCTTATCAAC TTAAACACGC AAAAGGTGGA 9420 

CGAAAACCTA AATTAAGTCT AGAAGACCTT CTTATGGCCA CTCTTCAATA TGTGCGAGAA 9480 
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TATCGAACTT ATGAACAAAT TGCGGCTGTT TTTGGTATTC ACGAAAGCAA 
CGGAGCCAAT GGGTTGAAGT AACTCTTGTT CAAAGTGGTG TTACGATTTC 
CTCAGTTCTG AGGACACGGT AATGATTGAT GCGACGGAAG TAAAAATCAA 
AAAAGAATTA GCGAATTATT CTGGTAAAAA GAAATTTCAC GCTATGAAGG 
TGTCACAAGT CAAGGGAGAA TTGTTTCTTT GGATATCACT GTGAACTATT 
GAAGTTGTTC AAAATGAGTC GCAGAAATAT CAGACAAGCT GGTAAAATCT 
TGGTTATCAA GGGCTCATGA AGATATATCC TCAAGCACAA ACTTCACGTA 
ACTCAAACCG CTAACAATTG AAGATAAAGT CTATAACCAT GCGCTATCTA 
CAAGGTTGAG AACATCTTTG CCAAAGTAAA AACGTTTAAA ATGATTTCAA 
AAATCATCTA AACGCTTCGG ATTACGAATG AATTTGATTG CTGGTATTAT 
CTAGGATTCT AGTTTTGCAG GAAGTCTATT ATCAAAAATA CCATCAAGAT 
TGATACAGGA AAAGTTTTAT TTGATGGTGT AAATATTAAT CAAATAGATA 
AAGTCAAAAT TTAGGAGTAG TTCCACAGGA TTCATTTTTA TTGAACCGAA 
TAATATAACT TTAAAGCACG AAGTTACTTC ACAAAAGATA GAGGAAGTTT 
TCAAATCTAT GATGAAATCA TGGCTATGCC GATGAAATTT AATACTATCA 
GGGGTCAAAT ATTTCAGGTG GGCAAAGGCA ACGGATACCA CTGGCACGTG 
TAATCCTAGT ATTGTAATTT TAGATGAAGC AACTAGTGCA TTAGACACTA 
AAGAATAACA AAGTATATAC AAAGTCAGGG CTGTACTCAA ATAATTGTAG 
GTCAACGATT AAGGATGCGG ATGTTATTTT TGTAATGAAA GGTGGTAAGA 
AGGAAATCAT AAGTACTTAA TGGATCTTGG TGGAGAGTAC TACAGCTTAT 
GAAATGAGGT GTAAAGAAAA TGAAGAAAGA AAATGAATAT GTAATTTTAA 
ACTAGGGGTG ATGATTGGAA TAGTGTTTGC AATTTTTTTA GATTTTCCAG 
TATTTCTTTA GGCTTGTTGA ATGGAATAGT ATTGGGTTCG CTGATTGTTT 
TAAGAATTAA GCATAATTTT TTGCTGTAAA CTAAGGAGTA GAGATGGCTA 
TATAAATCTA ACAAAAAGCT TTAAAGATAT TGAAGTTATT CATAACACTT 
GCAACTACAG TAGTAGCTTA AAAACATGAT TAAATCGCTA TTCTTAGGAG 
TCTTTTTGTT TAATACTCTT TGAAAATCTC TTCAAACCAC GTCAGCTTTG 
CTCAAGTACA GCCTGCGGCT CGCTTCCTAG TTTGCTCTTT GATTTTCATT 
AGGGTCAAGT AAGTATAGTA AATTGAAATA AGATATGAAC AAATCGATTA 



CTTAATCCGT 
AAGAACTCCT 
TCGCCCTAAA 
CTCAAGCGAT 
GTCATGATAT 
TGGCTGACAG 
AATCCAGCAA 
AGGAGAGAAG 
CAACCTATCG 
CAATCATGAA 
TATATAAGAT 
AAAAAATATT 
GTATTCTTGA 
GTAAAGCAGT 
TCTCAGAGAT 
CATTAATAAa 
TTAATGAGGA 

ctcatagatt 
ttgttgagtc 
atacaaaaag 
caacagcctc 
ttgaatatgg 
acaaaaacaa 
tagttgaaat 
aaataataga 

TAGCGGTTTT 
CTTTACCGTA 
GAGTATAAAA 
GAAAAGTCAA 



9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
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ATTAATTTCT 


AGAAATATGT 


TAGAAATTGG 


TTTGAATTCC 


GCAATCAATT 


TGTTCAGTTT 


11280 


TTATTTCATT 


TCATTTTATT 


TAATTAGATT 


TTCCAATTTT 


TTAATTCAAG 


CTAAAAATCC 


11340 


CCAATCGTAG 


TGATTGAGGA 


TTGAGTAAAT 


AAATCTTAAA 


CAATACCTTG 


TGCAATCATG 


11400 


GCATTTGCTA 


CATTTTCAAA 


GGCAGCAATG 


TTAGCTCCTG 


CAAGGTAGTC 


TTTATCAAGA 


11460 


CCGTATGTTT 


CTGAAGTCGT 


TTTAGCTGTG 


TTGAAGATGT 


TTGTCATGAT 


GTCTTTGAGA 


11520 


CGGCCATCAA 


CTTCTTCACG 


AGTCCATGAG 


AGGCGAAGAC 


TGTTTTGGCT 


CATTTCAAGA 


11580 


GCTGAAACGG 


CTACACCACC 


AGCGTTGGCA 


GCTTTTGCAG 


GTCCGTAGAA 


GATACCATTT 


11640 


TCTTTGTAAA 


CTTTGATGGC 


ATCAAGGTCG 


CTCGGCATGT 


TGGCACCTTC 


AGATACACAG 


11700 


ATAACGCCTT 


GAGCAACCAA 


ACGTTTAGCT 


GCTTCACCGT 


TGATTTCGTT 


TTGAGTGGCA 


11760 


CATGGAAGAG 


CAATGTCATA 


GTTTCCAGCG 


TAAGTCCATA 


CAGTACCTTC 


GTGGTAGGTT 


11820 


GCAGTTGCTT 


TTTCAGCTGC 


ATACTCAGTC 


AAACGAGCAC 


GACGTTTTTC 


TTTAACATCA 


11880 


ACCAAAAGAT 


CGAAGTCGAT 


ACCATTTTCA 


TCGATGACAT 


AACCATTTGA 


GTCAGAAACA 


11940 


GAAATAACAG 


TTGCACCGAG 


TTCAGTTGCT 


TTTTGAAGAG 


CATATTGAGC 


AACGTTACCA 


12000 


GAACCTGAAA 


TAACGACTTT 


CTTACCAGCA 


AAGCTGTTAC 


CGTTAGCTTT 


GAGCATTTCT 


12060 


TCAGTATAGT 


AAACCAAACC 


GTAACCAGTT 


GCTTCTGGAC 


GAATCAAGCT 


ACCACCAAAT 


12120 


CCAAGAGGTT 


TACCAGTCAA 


GACACCAGCA 


TCAAATTGGT 


TAAGACGTTT 


GTATTGACCG 


12180 


TAAAGGTAAC 


caatttcacg 


TCCACCAACA 


CCGATATCAC 


CAGCAGGTAC 


GTCAAGTGAT 


12240 


GGTCCGATGT 


GTTTTTGCAA 


TTCAGTCATG 


AAGCTTTGGC 


AOAAtat, (jC A T 


CACTTCAGCA 


12300 


TCTGTTTTAC 


CTTTAGGATC 


GAAGTCTGAT 


CCACCTTTAC 


CTCCACCGAT 


AGGAAGTCCA 


12360 


GTCAAGACAT 


TTTTAAAGAT 


TTGTTCAAAT 


CCGAGGAATT 


TCAAGATCCC 


TTGGTTTACA 


12420 


GTTGGGTGGA 


aacgaagtcc 


ACCTTTGTAT 


GGTCCAACAG 


CTGAGTTGAA 


TTGAACACGG 


12480 


TAACCACGGT 


TTACTTGAAT 


TTTTCCATCA 


CGGTCAACCC 


AAGGAACACG 


GAAAGAAACC 


12540 


ACGCGCTCAG 


gctcagtaat 


ACGTGCCAAG 


ATATTTTCTT 


CGATATACTC 


AGGGTGTTTT 


12600 


TCAAATACAG 


GTTCTAAAGT 


GTTGAAAAAT 


TCTTCAACAG 


CTTGGAGGAA 


TTCAGCCTCG 


12660 


TGCCGG 












12666 


(2) INFORMATION FOR SEQ ID NO: 138: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3083 base pairs 

(B) TYPE: nucleic acid 
to STRANDEDNESS : double 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 

AGCAACTGTT GTGAACCAAT TCCGATAAAT TCCAAGAATT GGTTAATAGA GCCATTTTGA 60 

CCAAAAATCC CGATAAAAGC ATAGGCTTTA AGGAGCAAAT TGATCCAGGT AGGAAGGATA 120 

ATCAGCATGA GCCAGAGTTG ACGGTGTTTG AGACGGGTCA AAAAGAGGGC CGTCGGATAA 180 

CTGATAAGCA GTGCCACAAA GGTCACAATG CCTGCATAAA GCACTGAGTT GAAACTCATT 240 

TTAAGATAGG TCAAGTTTTG TGACGCAAAG TAAGATTTGT AATTTTCTAA ACTGAACTGG 300 

CCTTCGATGT TGAAAAAGGA TTGACCGAAA ATCAAGACCA AGGGTGCCAA TACAAAGAGC 3 60 

GCAATCCAAA GCATGTAGGG TACTACAAAG AGTTTAGAGC TTGTTTTCTT CATCTCTTTC 420 

CTCCTCGATT GCATTGATCA AACCTGCTTC TTGCTCTTCG ATTTCTACGT ACTCCTCAAT 480 

ACGAGCATCG AACTCTTCTT CGGTTTCATT GAGACGCATG ATGTGGATGT CTTCTGGTTC 540 

AAAGTCCAGA CCGATTTCCT CACCCACGAT AGCCTTACGG GTTGAGTGGA TCATCCATTC 600 

ATTTCCAAGT TCGTCATAGG CGATAATTTC ATAATGAACT CCACGGAAAA GCTGGGTATC 660 

GACCTTAACT TGGAGCTTGC CTTCTTCAGG AAGGGTAATG CGCAAGTCCT CTGGACGAAT 720 

AACGACCTCA ACAGGTTCAT TTGGCTTCAT CCCACCATCA ACCGCTTCAA AGCGTTTGCC 780 

GTTAAATTCG ACCAAGTAGT CCTCAATCAT GGTACCTGGC AAGATGTTTG ACTCCCCGAT 840 

AAAGGTGGCA ACAAAGTGGT TGATTGGCTC ATCGTAGATG TCCACAaGQG TTCCAGACTG 300 

GACAATCTCG CCATCATTCA TT^CGAAAAT CCAGTCACTC ATGGCAAGAG CTTCTTCCTG 960 

ATCGTGAGTG ACAAAGACAA AGGTAATGCC CAATCGTTGT TGTAATTCAC GCAATTCGTA 1020 

CTGCATGTCT GTTCTCAATT TCAAGTCCAG CGCTGATAAA GGCTCGTCCA ACAAGACCAC 1080 

ACGGGGTTGG TTGATGATAG CACGGGCGAT GGCCACACGC TGACGTTGTC CTCCAGAAAG 1140 

TTTGCGGATG GAACGTTTTT CATAACCTTC CAACTGAACC ATCTTGAGAA CTTCCGCTAC 1200 

ACGCTGCTCG ATTTCTTTCT TATCAATTTT ACGCAAGCGA AGTGGAAAGG CAACATTTTC 1260 

AAACACATTC ATATGTGGGA ACAAGGCATA GGATTGGAAG ACGGTATGTA CGTCGCGCTT 1320 

GTTGGTTGGA ATATCATTGA TACGAACACC GTCTAGCATG ATATCTCCTG TCGTCGCATC 1380 

CAGTAAACCT GCAATAATGT TTAGGATAGT TGATTTCCCC GAACCAGATG CACCTAGAAG 1440 

GGTGTAGAAT TTCCCTTCTT CCAACTCAAA GTTGATGTCT TTGAGAACCT TGGTGTTGCT 1500 

GTCTTCAAAA ACTTTAGAGA CGTTTTTGAA TTCGATAATT GGCTTTTTCA ATTGGCATAA 1560 

ATTCCTTCTT TTTCATAGAT TAACCGATCG GGGCTCTGTC AGGTCCCCAC TACCTCTTGC 1620 

AGGGAGTAAA ACCACCTGCA TACATCTTCG CTACCGATAG GCTTTCACCC AAGATCCGGA 1680 
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CTTCTCTTTC AAGCGTAATA CCTGAGTGTT CCTTGACTTT- TTCGATAACC GATTGGATCA 1740 

AGTCCTCGTA GTCTTTGGCC GTTCCATCTG CGACATTGAT CATAAATCCT GCATGCTTTT 1800 

CTGACACTTC TACGCCACCG ATACGATAGC CTTTCAAGCC AGCTTCTGAA ATTAACTGAC 1860 

CTGCAAAATG CCCGACTGGA CGCTTAAAGA CCGAGCCACA AGATGGGTAT TCCAAAGGTT 1920 

GCTTGAGTTC ACGTAGGTGC GTCAAGCGGT CCATTTCCTG CTTGATAACC TGATGGGTTC 1980 

CTGGAGCTAG GGCAAATTTA ACTGACAAGA CAACTGCACC AGACTCCTGA ATAGCTGAAT 2040 

GACGGTAACC AAAAGCCAAG TCTTTAGCAG ACAGGGTTTC GATTTCTCCA TCCTTGGTCA 2100 

AGACCTTACA AGACTGCAAG ATGTGAGCAA TCTCGCCACC ATAGGCACCC GCATTCATAA 2160 

AGACAGCACC GCCAACGCTT CCTGGAATAC CACAAGCAAA CTCAAAGCCA GTTAAACTAT 2220 

GACGGAGGGC AATGCGAGTT GTTTCAATCA AGTTAGCCCC AGCTTCTGCT TCAATGGTAT 2280 

AGCCATCAAC AGAAACGTTA TTGAGCTTGT CACACAAGAT GACAAATCCA CGAATCCCAC 2340 

CATCACGAAC GATGATATTG CTTGCATTGC CAAGAACCAT CCAAGGGATA TTTTCTTGGT 2400 

TGGCAAATTT CACAACGCGA GCCAACTCAA AACGATTTCG TGGAAAGACC AAATAATCAG 2460 

CCTCTCCACC TACTTTTGTA TAACTATAGC TATGCAAGGG TTCCTTAAAA CGGATATCAA 2520 

TTCCTTCTAA GATTTCAAGC ATTTTTTCTC TTACAGACAT GTCACTCTTC CTTTTACAAA 2580 

ATTCATTCCA TTATACCATT TTTAGAGACA TTTGACGACC ATAAAAATAC CTTGTTTGGA 2 64 0 

TTTTGCATAA GAAAAAGAGG TTCCCCCCTT TTTATGATTT TTTACAAAAG ATTTCCTTGG 27O0 

TTCCATAGGC GACCAGAACG AGCTCCAGTG CTAGAATCAC TTCAACCAAG ACTGGATTTG 2760 

TCAACCAGCC TACTTGGAAA AGAGATGGTG CCAGATCAAA GAAGGCATGC AAGCCATAGG 2820 

CTGCTAGGAG ATAAATCCAT TTCTTCTGGC GAACAGCTTG GTAAACCCAA ACTGTCAAAA 2830 

GTAATTGGAA ACCAAGCGCC AAGATTCGCT CAAAACCAAG CAAATAAATC TGCCAGACCG 2940 

AAAGTGACTG AATGGTTTTT AACATATTTT CAGACAGTAA TTGCATAACC TGTGGATTC;- "iCOO 

GAGTTTGAAC TGCCGAAAGA ACAATGTAAA GATTGAGTAA ACTAGTAAGG CCTAGAAAAA 3060 

TCAACTCCAA GCCACCATGC CCC 3083 
(2) INFORHATION FOR SEQ ID NO: 139: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15363 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 
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CCGGAGGATA 


TTGACCACCA 


CCAAAAGCAG 


GGGGAAAATC 


GAAATCAACC 


AATAGTAGGC 


60 


TACTGCGACA 


CTGGTCAACT 


CACTATCTGA 


TGCTTGATAA 


TAATGCAAAA 


AAGCTTTTAA 


120 


TAAAGGTTTG 


TCTATCAGCT 


CTTTCCACCA 


CTTTTTCATG 


TCATACTCCT 


TCACTTATAA 


180 


TCTTATACTC 


AATGAAAATC 


AAAGAGCAAA 


CTAGAAAGCT 


AGCCGCAAGC 


TGCTCAAAAC 


240 


ACTGTTTTGA 


GGTTGTAGAT 


AAGACTGACG 


AAGTCGATCA 


CATACATACG 


GTAAGGCGAC 


300 


GCTGACGTGG 


TTTGAAGAGA 


TTTTCGAAGA 


GTATTAACTA 


ATTTCTTCTT 


ACCAATTCCA 


360 


CCATATCATA 


CGGTAGGGTA 


TTGGCAGCTT 


CCTTCAAGGA 


ATAGTTCTCT 


AAGTTATTTA 


420 


CATTTTGTCG 


TAATTTCTTG 


GCATACTTAG 


TCGTAATCAA 


TCGTTTTTCT 


TCGTATTCGA 


480 


AAATCAACTT 


GCGCTCCAGA 


TAATAGCCTC 


TCAGCATTTC 


ATCGATATTG 


TTGGGTTTGA 


540 


CACGATTGAT 


AACCCGTTCG 


ACAAAGGCAC 


CACTGCTGAT 


AATAGCTGTT 


TCTCGAAGAC 


600 


GAGACTCCTG 


CATAAAACTA 


ATCAAAGAGC 


GTCTGTAGAC 


TCCCTTCAGG 


TTTTCCAAAC 


660 


TTTCAATAAT 


CATCTCTGTA 


TTGGCAAGAT 


AGAGCTCTGC 


AATTTGGTCA 


TAATCAAGAG 


720 


CACGGAGACG 


GCTTTGCTCC 


TTGTTCTTCC 


AGCTACGGAA 


GGTCTTTCCG 


AGAGTAAAAA 


780 


CTTCATGAAG 


GAGAAAACGT 


AAAATCCTCA 


AGGAAACAAG 


AAAATAATAG 


GTCAGTCTTG 


840 


AGGCAAGTTT 


ACGATTGATT 


CCTTGTTCTA 


TATTTTTCAG 


ATAACGTTGG 


TAAACTCGGT 


900 


AAGCACGATT 


GCTAATGTTC 


CCCTCTTCAT 


AGRrcTGTTC 


CAAACCATCA 


CTTTC*^»AT*iC 


^ v\> 


TAAGAATCAA 


GAGTTTCAAA 


GCAGCCCAGT 


CTTCTTGATC 


ATCCTGGTTT 


TCTTGGCTTA 


1020 


AAATGAGATT 


TTCAATACGT 


CCATGATAAT 


TGTCAATAGC 


CGCATAGAGG 


GGAAGTTTAT 


1080 


TTCTGGTGTC 


TTCCAACTCT 


TTTTCCAACT 


CTAGCGTTAC 


TTCATTCAAA 


ATGGCGATAT 


1140 


GCATAAGATA 


ATCCTTGCTT 


TCTTCCTCTT 


CATCAGAAAG 


ATGAGGCAAG 


ACCAAGAGAC 


1200 


CTGTTAAAAA 


GCTAACAAGC 


GTCACACCTG 


CAACAAGGAA 


AAGCAAAAGA 


GGATACTCCT 


1260 


GTTCTAGATT 


ACTTGGTATC AAGAGAATCG 


TAGCAATCGA 


CACCGTTCCC 


TTAACACCTG 


1320 


AAAAGGTCAA 


GAGAAACATG 


TCCTTCATAT 


ACTTATTTAG 


CTTTTTCTTG 


AGGCGTCGGG 


1380 


TTCTATAGGC 


ATAATAGCCA 


TAGATCATAA 


TAAAACGAAT 


GACAAAAAGG 


ACAAAGGTAA 


1440 


GGGCGATAAG 


AGATAGCAAT 


AAAAGTAGAG 


GATTATAGAT 


TGGATTGGTC 


AAGATAGGTT 


1500 


CTGCTATCAT 


TTCCAACTCC 


ATCCCTAAAA 


TCACAAAGAC 


AGAACCGTTG 


AGCATAAAGG 


1560 


TCACTGTATG 


CCAGACCGTC 


TCGGTCACCG 


TATCCACTTG 


GGCTTCGAGG 


AGCGTGATTT 


1620 


TCTTGAAGCG 


ACTTGCCTTT 


AAAATTCCAG 


CAACTACGAC 


GGCAATAATA 


CCTGAAACAT 


1680 


GAACTTCTTC 


TGCCAGAAAG 


AAGGTCACTA 


GAGGCAAACT 


CAATTCTAAT 


AAAAGTTCAC 


1740 
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TGGCAATATC 


CGTTGCGCGC 


ACACTTAGCA 


AGAAGGTATG 


GAGGAAGCGG 


TTGGTCATGG 


1800 


CTGTTAAAAA 


TCCAATTAAA 


AAACCGCCTA 


GGATTGAAAA 


GATGAGCGAA 


CTGCTAGCTT 


1B60 


GCCCCAGAGA 


AAAAGCTCCA 


GTTGTCCAAG 


CTGTCAAAGC 


TACCTGAAAA 


GCCACCAAAC 


1920 


CAGAAGCATC 


ATTCAAGAGT 


CCTTCGCCCT 


TAAGAATATT 


GGACACGCGC 


TTAGGAAAGC 


1980 


TAAAACGCTC 


CGAAAGAGAG 


GCAAAGGCCA 


CCAAGTCCGT 


AGGACCAAGG 


GCTGCCCCAA 


2040 


CAGCCAAGCA 


AGCTGCCAAG 


GGAAGGCTGA 


ACCAAAGAAG 


ATGGGCCAAG 


CCACCCAAAC 


2100 


TCAGGGTCGA 


GATAAAAATC 


ACTGGAAATA 


TGAGATAAAC 


AATGATTCGC 


CAGTGTTTTA 


2160 


AAATAGCCGT 


AACATCTGCT 


TCTTCAGCCT 


CTCGGAAAAG 


CAAGGGTCCG 


ATAACCAGTG 


2220 


CCAAAAACAA 


CTCCGTATTA 


AGGTGAAAGT 


CAGTATTGGG 


TAAAAAGAGA 


CCAATCACAA 


2280 


TTCCCAAAAG 


AATTTGGACC 


AAAGGGAGAG 


GCAAAAAGGG 


CAGGAGCTTA TTGGTTGTAC 


2340 


TTGAGACAAT 


CAAAACCAGT 


AAAAATAGGA 


TGAGGTAAAT 


CAGTAATTCC 


ACGCACGTCC 


2400 


TCCTTAATCT 


TTTTTACAAC 


AGGATTCAAA 


TATCTCCTTC 


TGCTCTTTGA 


TTTTTTGGTC 


2460 


AATCTTGGAA 


CAGTCTTTGT 


GCTCAATTTT 


TCTCTGGCAC 


CGTTCCATTT 


CAAGAGCAAC 


2520 


TAATTTTTTC 


TTGATTTTAA 


GCATTTTTTT 


GCTCATATGC 


GCTTGGTCTA 


GCACGCCCAT 


2580 


CGCTCGTTCG 


TGGTGGGTTG 


ATTCAACAAA 


ATTCTGGCGC 


ATGGCATCCA 


GCTTTTCGTG 


2640 


TAAGTATTGT 


TTATCCATGT 


CTGTATCTCT 


CTAATTTTTC 


AATCATCACT 


AAAAACGGCG 


2700 


GGT-TGl'TGAC 


TTGGTTTAAA 


GTTCGGTAAA 


TGGCAGCTGT 


GTACTCTTGT 


TGGTTCAACT 


2760 


GGATCACAAA 


ATCCAAGACA 


GCATCTCTCT 


CGAGATCGCC 


TCCTTCATGA 


CCATAGTAAA 


2820 


TCATAATAGC 


AATTCGTCCA 


CCTTTGACAA 


GTAAGCCACA 


TAGCTTTTCT 


AATGCCTCAA 


2880 


TCGTTGTCTG 


CGGTCGGGTG 


ATGACAGACT 


TATCAGCTGC 


CGGCAAATAG 


CCCAGATTAA 


2940 


AAATCCCTGC 


CTTAGCTTTT 


ATCACAAACT 


GGTCCAGTGT 


CTCATGGCCT 


TGCAAGATTA 


3000 


ACTGGGCATT 


TGTCAAGTCA 


GCCTGATGCA 


AACGCTCTTG 


GGTCTTTTCC 


AAGGCTTGCT 


3060 


TCTGAATATC 


AAAGGCATAG 


ACTTGCTTGG 


CTAGCTTGGC 


TAAAAAAAGC 


GTGTCATGAC 


3120 


CATTTCCCAT 


AGTCGCATCC 


ACTACGACAT 


CCTCTTTTGT 


CACGACCTCA 


GCCAAAAAAT 


3180 


CATGTGCCAT 


CTCAAGTGGT 


CTTTTCATTT 


TCAAACTCCT 


GTTTTACAGC 


CTTGCATCCT 


3240 


TGAACACTTC 


CACGACGTCG 


CATCTCCATC 


TCAATGCTGT 


TGAGGACTTC 


CCATTTATTG 


3300 


AGGCTCCACA 


TAGGACCAAG 


CAGCATATCC 


CTAGGCGCAT 


CTCCTGTAAT 


TCGATGGATG 


3360 


ACGATATGTT 


TGGGAATAAT 


TTCCAGTTGG 


TCACAGATGA 


CCCTGACATA 


TTCGTCCTGA 


3420 


CTCATCAATT 


GTAAACGCCC 


CTCATGGTAA 


TCTCGTTGCA 


TACGAGTATT 


TGTCATAAGA 


3480 


TGGAGCAAAT 


GCAGTTTAAT 


CCCTTGAATA 


TCGTTATCCG 


TGACACAACG 


GCGGACATTT 


3540 
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TCAACCATCA 


TCTCATGGGT 


TTCACCAGGC 


AAACCATTGA 


TCAAATGGGA 


AACAATCTCA 


3600 


ATTTTTGGAT 


ACTTTCTCAA 


ACGCTTGACC 


GTTTCCACCT 


ACAATTCATA 


AGAATGCGCA 


3660 


CGGTTAATCA 


GGTCAGAGGT 


TGCTTCATAA 


GTAGTTTGCA 


AGCCCAATTC 


AACCGTCACA 


3720 


TGCATGCACT 


CCGATAACTC 


AGCCAAATAT 


TCGATGGTTT 


CGTCTGGTAA 


ACAGTCTGGG 


3780 


CGCGTTCCAA 


TATTGATTCC 


TACCACACCT 


GGCTCATTGA 


TAGCCTGTTC 


ATAACGCTCT 


3840 


CGAATAACTT 


CCACCTTTTC 


ATGGGTGTTG 


GTAAAATTTT 


GAAAATAAAC 


CAGATACTTC 


3900 


CGAACATCCG 


GCCACTTGCG 


GTGCATAAAG 


TCAATTTCCT 


TATAAAATTG 


CTCACGGATA 


3960 


GGCGCATCCG 


GTGCCACAAT 


GGCATCTCCA 


GAACCAGAAA 


CCGTACAAAA 


AGTACAGCCC 


4020 


CCATGAGCCA 


CAGTCCCATC 


ACGATTGGGA 


CAATCAAATC 


CCGCATCAAT 


AGGGACTTTA 


4080 


AAAGTCTTTT 


CTCCAAAGAG 


TTTTCGATAA 


TAATCATTCA 


AGGTATTATA 


AGATTTCATG 


4140 


ACTTTCATTA 


TAACAAAAAT 


CACCCACAAT 


CTCAAAAGCC 


TGACTTTCCT 


ATAAATTCCT 


4200 


CTGTTTCTCG 


TTTCCATTAG 


CCTTTTTTTA 


TGATACAATA 


TGGGTATGAT 


TTTAATGAAA 


4260 


TTAGCATCTA 


TTTTATTATT 


GATACTGACC 


TTAGTCGTCT 


GCATTATCCT 


AACCAAACTT 


4320 


TTTAGATTAA 


AAAAACTAGG 


ACGAAACTTT 


GCGGATTTGG 


CTTTTCCAGT 


CTTGGTATTT 


4380 


GAGTATTACT 


TGATTACAGC 


TAAAACCTTT 


ACCCATAATT 


TCCTCCCTAG 


ACTGGGGCTA 


4440 


GCCCTCTCGA 


TCCTAGCCAT 


TATTCTCGTC 




TTTTGAAAAA 




AT n/\ 

•M -/ \J\J 


TACTACCCTA 


AATTTATCAA 


ATTCTTCTGG 


CGTGCAGGAT 


TCTTATTAAC 


CCTTATCATG 


4560 


TATATAGAAA 


TGATTGTTGA 


ATTGTTCTTA 


ATGAAATAGT 


CGAATCCCTA 


AGCATTTTCT 


4620 


AGGGATTTTT 


GCTTTCTCTA 


CAAAATAGTA 


TAGACAATAA 


CACTATACAA 


TTTTATACAA 


4680 


AGAAAAGAGT 


CTGGGACAAT 


AGTCTCTTAT 


ATCCAAAAAG 


GCAACGGATT 


TGCCGTTGCT 


4740 


TTTTTGGATG 


GTTACGATAG 


TCTTGGTAAA ATAGAATTGC 


CCAATAAACC 


ATTTAGAAAG 


4800 


GCTATCCCAT 


GCATATTCAC 


TATAACACAA 


ATCAAACAAC 


TTTACCACTA 


GAAATCAGTT 


4860 


CCTTCTTACC 


ACAAGATCAT 


CTCGTTTTTA 


CTATTGAAAA 


AGTGGTGAAT 


ACCTTGGAGG 


4920 


AACGTCACTT 


CTACACCTCC 


TATCATGCCT 


TTGATCGCCC 


GTCTTATCAC 


CCT7VAAATGC 


4980 


TTGTATCTAC 


TCTTCTATTT 


GCCTATTCAC 


AAGGGATTTT 


CTCTGGTCGA 


AAAATTGAAA 


5040 


AATGGAAGAG 


TTAGTGACCT 


TAGATTGTTT 


GTTTATTGAC 


AGAACTAAGA 


TTGAAGCCAA 


5100 


TGCCAACAAG 


TATAGTTTTG 


TGTGGAAGAA 


AACGACAGAG 


AAATTCTCCG 


CCAAACTTCA 


5160 


AGAACAGATA 


CAGGTCTATT 


TTCAAGAAGA 


AATCACTCCC 


CTTCTGATTA 


AATATGCCAT 


5220 


GTTTGATAAG 


AAACAAAAGA 


GAGGGTATAA 


AGAGTCAGCT 


AAAAACTTAG 


CGAATTGGCA 


5280 
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CTATAATGAC AAGGAGGATA GCTACACACA TCCTGATGGC TGGTATTATC GTTTTCACCA 5340 

TACCAAATAT CAGAAAACAC AGACAGACTT TCAACAAGAA ATCAAGGTTT ACTACGCCGA 5400 

CGAACCTGAA TCAGCCCCTC AAAAGGGACT GTATATGAAC GAACGCTATC AAAACTTGAA 54 60 

AGCTAAAGAA TGTCAGGCGC TTTTATCTCC CCAAGGTAGA CAGATTTTCG CTCAACGCAA 5520 

GATTGATGTG GAACCTGTCT TTGGGCAGAT AAAGGCTTCT TTGGGTTACA AGAGATGTAA 5580 

TCTGAGAGGG AAGCGTCAAG TGAGAATTGA CATGGGATTG GTACTTATGG CCAATAACCT 5640 

CCTAAAATAT AGTAAAATGA AATAAGAACA GGACAAATCG ATAAGGACAA TCAAATCGAT 5700 

TTCTAACAAT GTTTTAGAAG TAAAAGTGTA CTATTCTAGT TTCAATCTAC TATACAATAA 5750 

GAGAATGACT CAAAATTAAA AAGCTAGAGT TCCACAATTG GAAATATCTA GCTTTTTTGT 5820 

GGTTGAGAAC TATTTTGTCT CAGGCTCTTT ATCTTCTATT TAGGACAAGA GTTTTTCTTT 5880 

GGTCTTTAAT GATAAAGAAG GTATCAAAAT TTCTAGTCTT CTTTTTTACC TTTAGTAACT 594 0 

ACTAATCCTG CACTCAAACC TAGAAGAGTT AAACCTGCTG CTACTGCTGC TTGGCTTGCC 6000 

GCACTACCTG TACTTGGTAA CTGGGCTTTA TTAGTTTGAC TAGCTTCACT TGAATCAATT 6060 

GGTTTTGTAT CTGCTTTTTC TGACACTTGT GGTTTTTTAG CTTCTTGAGC TACTGGTTTG 6120 

GTTCCAACCA AGACGATGCG GTCTGTCGGA ACTTCTACCA CTTCACGGAG TTTTTCTTCC 6180 

TTACTTCCAT CAGGATTAAT CGCTGTAAAG ATACGTTCTT TTCCAACTTT TCCTTCTTGT 6240 

TCTACACCAG TTTCACCTAG ATACAGT'CiTr GAATCTTTTT TCTCAACTGT CTTGTATGCC 63 00 

AAATCTTTTT CAACAAATTC GATTTTTGGA AGATCTTCTT GTACAGCAGC AACTGTCTTC 63 60 

TCAGAAACTG GTTTTTCCTT AGTCAAGTGG ATACGGTATT CCTTGACTTG TTTTCCACTT 6420 

TCTGAAACGA GGCGAACAAG TACTGGAAAG CTATCTTCTC CACTATCTAC CACAGTTGAA 6480 

GCTACTTGAT TGTTTTCTTC AACTGAGACT TTTGGCCGTT GACCTTTATA GGTAATTTGA 6540 

TAGTCTTGAC GATTTTCAGC GAAATCAGCA AGTTCTTTTC CATCTACAAG AATCTTTGAT 6600 

TGAGTGCTTT CTTGAGGCAA TTCACTTGGT GCAAGGAAGG TCATCTCAAT CATCGCAACA 6660 

CCGCTCTTAT CTGCTTTACG CTCCATACGC CATCTCATAG CTTTGGCTTT GATAGCTTTA 6720 

AATGTTACGT TGATTTCATC ACCAGCTGCA ATGTCTTTAT CCGCACGATA AGGAACAGCT 6780 

TCCCAATTTT CTGGATTGTT GAATGGATGG TCTGCGTCGT AGGCTTGGTA GTTTGAATAG 6840 

TAGGTTGGCA CTTCAAACTC TGGACCGACA TAGCGTTCTA AAACGAGTTT AGATGGTGCA 6900 

TCCGTACCAC TATCTGCAAA GAACTGAACT TTTCCTTGTG TAACAGTCCG TTCTACAATC 6960 

TTACCATTTT CACGGAAAAT CACACCCGCT GATACTTCTG GATTAGAAGA TGGTGTTGGT 7020 

GACCAGTTTG TCCAACGACG ATTTTCTGAA TGATCTCCGT CATTGAGATA GTCAACGCGG 7080 
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TCATGAGAGT TTTTGTCAAT ATCATTGGTT GCTGAAGCAA AGGCCTGGTT ACTGTTTTCA 7140 

TCATAGTTAG GGTTATCTGA AAGAGTCTCA CCAAGTTTGT CTGTCACTCG TACAGTGATC 7200 

TCAGCAACAA GGTTACTACC AAGGACACGG CCTCGAACAG TAAATTGACC TGCTTTTGTC 72 60 

AGATTTTCCG CTGGAACTTC TTCCCATTCA ACTGTCAGGT CTTTTGTTTC GTAGCCGTCT 7320 

TTACCTGTGA AGTAAACTGG AACCTTAGTC GGCAATTCAA GTGCTTGACC TACTTGTAGC 7380 

AAGCGAGCTT GTTTAACCGC AGCAACTGGT TTATGAGAAA GTAAGCTCTT ATCCTTAGTG 7440 

AAGTGCAGAC GGTATTCTCC TAAGATGTCG CCATTTTCAG CTTTCGCGAT GACACGAACT 7500 

GGCTCACCTT CACGAACGCT TGGAACGACG GTAGCGAGAC CATTGTTGCT AACACTTGCT 7560 

GTGACTGCCG GAACTTTTCC ATCTACAGAC TCAAGGTAGT AGTCTGTCAA ATCAGGGTTG 7620 

AAGTTTGCTA AGTCTTTGCC GTCAACTTGG ATTCTTGTTT GTCCTTGCTT GGCTGCCGCA 7680 

ACTTGTTTCG CAAAGATTTG TACCTCTGTG ATAGACGTTC CACGCTTGTT ATCTGCTTTA 7740 

ACCATGCGAA TACGAACAGC ATAGGTTTCA ACTTTATCAA AGCTAAAGTG GTTCATTTCT 7800 

CCAGCCTTGA GTTGAGCAGG GGCTTTTAGA TTAGTAACTG GTTTCCAGTT GGCAGAATCA 7860 

TTAAAGACAT GGTCCTCATT ACCAACAAAA CTAGGGTTTT TAGGAGCTGT TGGGACAGTC 7920 

TTACCAACAT AATACTCAAT CACATAAGAC TTCGGTACAC CAACTCCATG GTCTTCATGG 7980 

AATCCGACAC TTAGATTATC AACGGAGCGT TTGCTCAAGA TACCTGAATC TCCAAACAGA 8040 

ACACCGACTG AAGCTTCTGG ATTAGTACGA TTCCAGTTTG TCCAACGATT GGCTGGTTGG 8100 

TTATTGTAGG AAATGAGCTT GTCATTAACA TTTGAAACTG GGTCGCTTGG ATTTGAGTCT 8160 

GAAGCAAAGG CAAGTGGCAA TTCTGAACCG GTCCATTGGT CAGAAATGTT TGCACCTTGC 8220 

TCAGTTTGAG CAGATACGCG AACATGAAGT TTAGTTGTTA ATTGCGTACC TTCTAAGCGA 8280 

CCATTAACTG TAAAGACACC TTCCTTAGCG TATTGCTCTG GACGAATCGC ATCCCATGCA 8340 

ACCTTAGCTG ATGAAACGTG ACCATTTGAA TCATATGTCC GAACACTTTC TGGTAATTGT 8400 

GGTGCTTCTG CGATTGGAGT TGTCACACTG ACTTCTTCAA CTGAAACGAT ACCTTCTACA 8460 

GAGACTTTTG CACGCGCTTC AAGGTCAATT CCTTCAACTT TACCTAGTAC TTCAAATGTT 8520 

TGATAGGAGT CTAGTTTTTC TTTCGGAATA GCTTGCCAAG TGACTTTATG AGTTTTAGGG 8580 

AAACCTTTGT CATACTCAAC TGTTACTGTT GCTGGAAGAC TTGGTTCCTG ATGCAAATCT 8640 

GTCACTACAT TTACAGGACG GATGGATTGC GCAATCTTCT TCTCAGTATT GGCTTGGATA 8700 

GTGAGTTCAA CTTGGTCTTT AGCTCCCTCA TATTCAGCGT TCAGAGTGAC TGCTCCTGGC 8760 

TTATGCAACT CAAGCATTCC TTTACGAATT GCGACTTCCC CTTCACCACT TGTAGAGAAG 8820 
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GTTACTTTAT CAGCTGGTAA TACAGCTTGC GTTCCATCTT GATAGTGAGC TCGAACCGAC 8880 

AATTTGACAG TTTGGTCTTC TTTGAGACTG TCAGCTTTTT CCACTTGCAA GCTCAAGTGA 8940 

GCAATTTTTG GCGCTTCTTC T^GGAATTGA ATTGCATAGG TTTGAAGAGG GCCACCATCT 9000 

TTAGGCTGAA TAAAGATGCT CGCACGCATG CCGTTTGCTG CGCTTGCTTG AAGAACTGTA 9060 

ACAGCTGCAT TTTTAGCACT TGCTGTGACT TCTGGCAACT TAGCTCCATA AGCAAGAGTG 9120 

CGGTATTGCA TTGGTTTTTG ACTAGTAAGA CCTGTTACTG CCTCACCACC AACCGTTACA 9180 

GTTGGTACTG CAGGTGCCGC AGGATTGCCT TCTTCTACCA CAAGGGTTGC ATGAATTGGT 9240 

TGACCTTCTA AATAACCGGT CGCTTGAATA CGAGAACCTG GAATTGCTAA CTTAGCTTTA 9300 

TCTTCTTCGG CAATCTCCCA CTTGTCCACT TCATACTCTT CAACACTTCC ATCAATCAAA 9360 

ACATAGGAAA CAGATTTGTC TACAGAATTC AAGTCAGTAT TTGGAGCAAT ACGTTTCACA 9420 

ACTGGTAGCT CTGATTTAAG AGCAATCACT TCTACACGAG CTTCTACTTC TCGTCCGTCA 9480 

GCCATACCTT TCACCGTTAC AATACCAGGC TTGCTCACAT CTACTGAAGA CCAGGTTACA 9540 

GGACGTTCTG CACGGCTACC ATCACTGTAT ACAAACGGAA CAGTGGTAGG CATTTCAGGT 9600 

GCCTCTCCAA TAATGGTCTG TACTTTTGGC ACTTCTGTCC CCAAAACAGT CTTCTCTTGT 9660 

CCTTCTTTCT TACCAGTAAA GACAGTGACT TGGTTCGATT TCAAGAGATC AGAGTGGGCA 9720 

GTCAGGGTGA ATTTCCCTGC TTGTTCAGTT GATTTGACAA TGGCAACACC TTTACCATTA 9780 

AATGCTTTAC GAATCCaAGa ACCATCTGCT TGCGCCTTAT AGCGTTCACG GCTGGCTTGT 9840 

TCTCCGTTAT CTACACCGAC CAGTTGACCT TGGCCATGCA ATTGGAAGCG AACCAGATTA 9900 

TTAGCAGTTG GAACCACATT CCCCTGGCTG TCAACAATTT CATAGTAGAT GTAAGTCAAG 9960 

TCTTTTCCAT CTGCTGCAAT CGCATGGTCT TCCTTAATAA GACGAACTGC CGCTGGCTTA 10020 

CCAGCAGTCG TAATCTTATC TCGAGCAATT TCCTTGCCAG ATTCATCACG AGCAATTGCT 10080 

TCCAAGGTAC CTGGTTGATA GGCAACTTTC CATTCAAGAT AAAGTTCATT AGCATTTGCA 10140 

CCTTCTTGGT AAGTCCGCCC ATCGCTGGTT TGTTTTTTAT TGAAAGTCTT AAGACCAAGA 10200 

GATTTTCCAT TCAAGAACAA TTCTACACTA GAAGCATTCG AATAAGCACG AACTGGAATC 10260 

TTACCTTCTG AGTCAGCTAC TTTGGATGCT AATTCTTTGT TTTCCCAGTT CCAGTGAGGA 10320 

AGAAGGTGTA CCATCGGTTT CTTCTTAACA GAAACCCATT GGCTTTGGTA GAGATAGAAG 10380 

TCATGTTTTG GAATGCCGGC TGTATCTACG ATACCAAAGT AAGAGCTCTT AACAGGAGTT 10440 

TGATTTTGGT TGTGCCATGG TGTAGGTTCA CCAATATAGT CCGTACCTGT CCAGATAAAC 10500 

TGTCCAGCAT AGCCAGCGTT GTCACGGTCA AAAGTCCATG AAGCGGTTGC TGTTTTCCCC 10560 

CAACCCACAC GATCATTTCC ATAATCTGAC TGTTCATAAT TACGCTCAGG TCCATTGCTA 10620 
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TGTTTCAATT CACGTTCAGG GCGATAGTAA CTTCCACGTG TACGGGTAGC TGAAGATGTT 10680 

TCTGATCCAT AAATCAACCA TTTTGGATGC TTAGCTCTAA GGGCTTTGTA ATTATCTTCA 10740 

GAATAGTTAA ATCCAACAGC ATCGAGTTCA TCAGCAATTT TCTCATGCCC TCCGCTACCA 10800 

TTACCGAAAC GGAATTTATC TGCTCCCATG GTAACATAGC GAGTCTTATC AACATCCTTG 10860 

ATAACCTTAA CCAAACGTTT AACAGTTGCT AAAGAGTGGG CATCACCATT AGCTTCACCT 10920 

ATTTCATTAC CAATTGACCA CATGAAGATA GCAGGGTTGT TTTTGCCTCT TTCGACCATG 10980 

GTACGTAGGT CAAAATCAGA CCATTTTTCA CCTTTTCGAG CTTCTGGGTG AGTGGCATCT 11040 

TTTTCAAAGA AACGTCCATA GTCATAAGGT TTCTTGCCAC CATACCACGT ATCAAAGGCC 11100 

TCTTCCTGAA CGAGTAAACC TAGTTCTGCT GCGATTTGCA AGGTTTGCTC ACTAGCAGGG 11160 

TTGTGGGTTG TACGGATGGA GTTAACTCCC ATCTCCTTCA TTTGTTTGAG ACGGCGATAT 11220 

TCTGCTTTAT AGTTTTCTTC TGCTCCAAGC GCCCCATGGT CGTGGTGCAA GGATACTCCA 11280 

TGGAATTTAA TACGTTCACC ATTCAAAGAG AAACCTTCAT TTGGAGTCCA GTGATAGTAA 11340 

CGGTAACCAA ACAAATCCTT CTTAGCATCA ACCAATTGAC CGTCACGGTA AACACGCGTA 11400 

ATCAATTCGT ACAAGGCAGG TTTGTCATTT AAAACAGTCC AGAGTTTTGG TCTTTCAACT 114 60 

TCTAAAATCG CATCTAGGCT TGTTGATTCA TGTGCTTTTA AGGTACGACT CGCTGTACGA 11520 

ACTAAGCCTG TTACAGCATG ACCACCTCGT TCAACGATTT GATATTCGGC TArAAGTTCA 115 SC 

TGGTCTTTGT CGTCCGTATT GACGATTTTG CTGGTCACAT GAGTTTCAAC CTTGCCATGT 11640 

TGTTGTTCTT CAAGTTTTGG TGTTAAAATA GTTGTCCCAT TTTTCTCAAC ATGCACCTTA 11700 

TCTGTCACTT GTAAAGTCAC ATCACGATAG ATACCACTTC CTGAATACCA ACGGCTACTT 11760 

GGCTGTTTGT TGACTGCATG GACAGCAATC ACATTCTCAC GACCATCTTT TTGAAGGTAT 11820 

TTGGTGATAT CATATGAGAA CTGGTTATAA CCATTTGGAT AATGCCCCAC TAACTGACCA 11880 

TTGACATAAA CTTGAGAATC CATGTAGACG CCATCAAAAG TAAGGCGAAC ATTTTTCTTG 11940 

AGGTCTTTTT CATCTAGTTT GAAAGTCTTG CGATACCAAG CTTCCCCACC GTTGAGCTGT 12000 

CCACCTTCAT TTTGTGCAGG AGATTCATGA TCGAAATCGT TAAAGATACT CCAGTCATAC 12060 

GGTAAATCTA ATTTTTTCCA CGTAGATACG TCTGCATCAG GTTTAATGGC TTCCTTAGAA 12120 

TTTGCATTGA GTTTAAAGTA CCAATTTTGA TTAAAATCCA CTTTCCTGTC TTCAATCATT " 12180 

TGATTCACTT CTTCATTTGT TACAGCTTTA GCATCTTCCT TGAGCGGTTT TTCTTGATTT 12240 

GAAGCTTGTG ATTCTATCCT TGGAGCTTTT TCTTCCGGTT TAGCAGACAC TTTTTCCTCT 12300 

TTTGGAGTTA CGGCTTCATC TTCTTTCTTC TCAGATGCAA TAGCCTCAGT TGAACTAGGT 12360 
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TCACTTTGTT CTGTCCTTTC AACTATATTT TTAGTTTCCA AAGCTTTATC AGCCTTTTCT 12420 

TCTACTATCA TTTTTTCCTC TTTAGGTTTC TCAGCAGTAT GAGTAATAAG TGTTTCATCC 12480 

GCATAAACTA CAGATTCTCC AGCTATATTT CCTCCTAATA AAACTGCACA AGTCCCAATC 12540 

ATTACTGAGC AAGCTCCCAC AGCAAACTTA CGAATGCTAT AAACTCTTTT CCGATTCCAA 12600 

TGGCCTTTCC CCATAAAACC CTCCTTATAT TATATTTAGT GCAGTTAGCT ACTACCAAAG 12660 

CCCAAGTGGT ATACATGGTA TGACAACCTA GTTTCAACAA TTTACACTCT GCGAAAATCC 12720 

AATTCAAACT TCGTCAGTGT CGCCTTGCCG TAGATATGAT TACTGACTTC GTCAGTTTCA 12780 

TCTACAACCT CAAAACCATG TTTTGAGCTG ACTTCGTCAG TTTCATCTAC AACCTCAAAA 12840 

CCATGTTTTG AGCTGACTTC GTCAGTTTCA TCTACAACCT CAAAACCATG TTTTGAGCTG 12900 

ACTTCGTCAG TCTTATCTAC AACCTCAAAA CTGTGTTTTG AGCAACCTGC GGCTAGCTTC 12960 

CTAGTTTGCT CTTTGATTTT CATTGAGTTT ATATTTTATA GGAGCGCATT ATTTTGCTTT 13020 

TGCTGCGTAC TCTTCGTTAC GTTTGATCAT TTGTTTTCTG TACCAAGCAA AGATACCGAT 13 080 

ATAGAATACA AGGAAGACTA CTGCACCAAG GATTGCTTTG ATATCACCAG TTGTAGTGTT 1314 0 

ACCAATTGTC CAACCAAGAA GTTTTTCGAT TGGTCCTTCA AGAGTAGAGT GAGTAATCAA 13200 

TTGAGTTTGG CTCACACCTT CTGGGAAGGC ACCTACACCT TTAGCAAGTT CTGTTGCAAA 13260 

TGGTGCAATA AGTGTACCTG AAAGAAGGAA GAGTGGCAAC AAGAGTGTTC CGAAGATAAT 13320 

CATACGGAGC AATTtACCAC GAGTTACAAC CAAGAGAGCT GGAGTAACAC CCATAGCGAT 13380 

GATACCTGCA AGTGGCAAGA TACCATTTCC AACTTTTGAA AGAAGCACTG CTTCAATCAA 13440 

CATGATTGGT GCAAGTACGT TGGCACAAGC CCAGATTTCA GCACGACCAG CGATGAATGG 13500 

CCAGTCAAGA CCGATATTGA ATTTACGTCC TTGAAGACGT TTAGTAGCAA CGTTTGTAAT 13560 

ACCTTGTGAT AGTGGTTCTA CGGCTGCGAT GAACCATGAA CCGATAAGTG AGAAGAGTTC 13 620 

CAAAGATACA CCGGCAGTCA AACCAAGAGA CAACCATCCT TTGATAACAA GACGCCATTT 13 680 

ATCTGCATCT GCAACACCTG CAATTGGATG TGGAGTTCCC ATAATACCGA TAACGATACC 13740 

AAGGATGAAA CCGATGAAGA ATTTAGATCC CCAGAAACCG ATTTTCTTGT TCAATTTAGC 13800 

AGCATCAAAG TCATATTTAT CAAGGCCTGG GAAGAATTTT TCAAAAATCT TATCCAAAAC 13860 

CATGATAACT GGGTTCATCA TGTAGTTCAT GTGAGTTGAT GTCATTGGTG ATGAACTTGG 13 920 

GGCGTTAAGA AGGTCATCAA ATGTAGGTTT CATCAAGTCA GAGTTGATAA TTTTCAACAC 13980 

ACCGACAAGG ACGATAGCTG CTGTAGCAAT AAAGAGTGAA ACCCCTTGAC TCACACCATT 14040 

GTTATCAGCA TACCATTTAA TCAAGAGACC TGTGATAGAC AAGTGCCAGA TATCAAAGAT 14100 

ATCGACATCA AGTGTATCTG TTTTCTTCAT AGCTAGCATC ACTATGTTGA CAATCAACAT 14160 
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GATGAGCAAG AAGTATAGTG TCCAAGCAGA ACCCCAAGTG ATTGTAGCAA GTGGTGCCCA 14220 

ACCAACGTCG GTAATACTCA ATTGGATACC AGTGTTTTCA ACGAATTTTG CTAGTGATGC 142 80 

TGAGAAAGCA GTGTTTAGCA TACCGATGAT AGCACCGATA CCTGTAAGAG CGATGGCAAG 14340 

TTTGATACCA CCTTCAAGCG CTTTGGAGAA TTTCACTCCA AAAAGTAAAG CCAATACTGT 14400 

CAAAATGATT AACATGATGA CAGGTCCACC CATTTCTAAG ATGGGATTGA AAACCTTTCC 144 60 

GATTAGGTCA AAGATTGCAT CCATAACAGT TCCTCCCTTT TTGATGTTAT ATGAATGTTA 14520 

ACAAATTAGA ATTAGCTTAA TCCGTCTTCT TTAATAGCTG CTTCAATATT GTCAAATACT 14580 

GGAGCGCTCA TTGCTGGGAT ACGGAATAAG ATTGGCCCAG CTTCGATAAC TGGGATACCT 14640 

GGTTCAAAAC CAAGGTCTGT TGCAGCGATT GGTGTAAAGA TATCGTAACC TTTCATAAGG 14700 

TCTTCGTTTA CATCTTTCAC CATGACTGCA TCACAGTGAA CATCATAACC ACGGTTTGAA 14760 

AGTTCTTCTT CTAGAGCACT TTTAATTTGG TGACTTGAGT TAACACCTGC ACCGCAGGCA 14820 

GCAAGAATTT TAATCATTTA GATTTCCTCC GATTTTATTT TTTAATAGAC AAGATTAAGC 14880 

GGTTGCTTCA GCAATGTAAG TATAAAGGGC TTCTGGTTCA GAAATTTTTG ATAGGTCTTC 14940 

AAGATGACCA TTTCCTGTGA AGAAGTCCAT TAACTGAGCA AGAATGTTCG TTTGACTTGA 15000 

ACTTGAATTA TTAATGATAA AGAAGAGTAG GGATACTTCT ACTTCCTTAT CAGGAGCTAT 15060 

CATATTGTGA AAAGTTATTG GTTTTTCTAA TCGAACAACC ACCACTTTCT CAGCTAGATT 15120 

ATGAACAATA TCTGTGTGAG GAATCGCTAC ATTTGGCAAG TCCTTTCCTA GAAATTCCAT 15180 

ATCTAAACCA GTTGGAAATG ACTTTTCACG CGTGATCAAG GCTTCACGAT AAGTTGGAGT 15240 

GACAATTTCT CGTTCTTCCA ATAAAGTTGC AACCTGATCA AAGAGTTGTT CTTGACTATC 15300 

CGCTTCTAAG CAAAACACAA GGTTTTTGTC AAAGAAATAA TCTAATACCA TAAGTTTTTC 15360 

CGG 15363 
(2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28882 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY; linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 

TAAGACTATT TAATAGTGGA GTGAAATAGG ATACGAACAA ATTGATTAGG AAAATCAAAT 60 

GAATTTATAG AAATCTTTTA GCAGTTATGT TATCCTATTC TAGTTTCAAA ACGCTATAGA 120 
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AGCAGCATTG TGCTAGTCkA GATTCAGTTT ACTATACTAA AACGAGTAGC TTGAAATCAA 180 

AAAACCCACC CTCACAGGCA GGTTTTATCT GTATTATTCA GCTAGATTAT GCTTTACCTT 240 

CTGAACCGAA TACGTCGATA CGTTCTTCAA CCGATGCTTG GATAGCTTTT ACACCGTCAG 300 

CCAAGAATTT ACGTGGGTCG AAGAGTTTTT TCTTGTCGTA TTCTGCTTCG TTTGCTTCGT 360 

AGTCACGAGC AAATTTACGA GTTGCGTTAG CGAATGCGAT TTGGCATTCT GTGTTAACGT 420 

TAACTTTGGC AACACCAAGT TTGATAGCTG CTTGGATTTG CTCATCAGGA ATACCTGATC 480 

CACCGTGCAA TACGATTGGG AATCCTGGAA GAGCTTCTGT CAATTTTTGC AAGTGGTCAA 540 

GGTCAAGACC TTCCCAGTTT ACTGGGTAAG GACCGTGGAT GTTACCGATA CCAGCTGCCA 600 

AGAAGTCGAT ACCAGTTTCA ACCATTGCTT TAGCGTCTTC GATTGGAGCC AATTCACCTT 660 

TACCGATGAT TCCATCTTCT TCACCACCGA TAGTACCAAC TTCAGCTTCT ACTGAGATAC 720 

CTTTAGCGTG TGCTTTTTCA ACAACTTCTT TAGCCAATTT AAGGTTTTCT TCAACTGGAA 780 

GGTGTGAACC GTCAAACATG ATTGAAGTAT AACCAACTTC GATACACTCA AGTGCATCTT 840 

CGTAGTGACC GTGGTCAAGG TGGATAGCTA CTGGTACAGT GATACCCATT GATTCAACAA 900 

GGTTAGCGAT CAAGTTGCGA GCAACTTTGT AACCACCCAT GTATTTAGCA GCACCCATTG 9 60 

AAGTTTGGAT CAAAACTGGA GCTTTTTTAG CTTCTGCTGC GCGCAAGATA GCTTGAGTCC 1020 

ACTCAAGGTT GTTTGTGTTA AATCCACCAA CTGCATAACC GTTGTCACGG GCTGCTTGGA 1080 

CaAATTTTTC TGCTGAAACG ATTGCCATTT TATCAGGCCT CCTGTATATT TTTATGGGTC 114 0 

ATCCCATTTA CATTGTTCAT TTTATCACTT TTTGCCAAAA AAATCTAGTT TTTCCCGCAG 1200 

TTTCGATTGA TTTTCTTCTA ACTCCATCTA TGTAAACCCT TTCTCTCCCT AGTCTTGGAC 12 60 

GACTTTTGGA AAATCTATAA AGAAGGTTAA ACTATTCTCC TCCATCTCGA AACGATAAGC 1320 

TAATTTTTCA TGTTCTAATA GACTCTTAAC CACAAAGAGC CCCATACCAG ACCCCTTGAC 1380 

CTTGCGACTG GCATTGTCAG AAAAAGACTG GGCTAGTTTT TCTTGTTCCT CTGAGCTACA 144 0 

GCTATTTTCG ATAAAAAGTT CTCCTTCTCT TTCTCCAATT CGAACTAAGC CACCTGGAAC 1500 

AGAGTGCTTA ATGGCATTGC TGATGAGATT AGAAAGAATC AACTTCATAA CTGATGGGTT 1560 

TAGATAAGCC TGCTGATGGG TCAAACTATT GTCTATCTGG AGCTCTCTTT CCTTGGCTAG 1620 

CAAGGCATAA TCTTTGACCA GATTTTGCGT CATCTGGAGG AGGTCAATTG TTTCCCTATC 1680 

ATCTCGCAAT TCCTGCACAG AAGAGAGGGA AAGTATCTGC AGAACATGGT GATTGAGTTC 1740 

ATCCACAATC CCCAAGGCAA CTCCCAGATA CTGGTCTCTA TCCTTATAAC GACCGATATT 1800 

CTCTCTCATA TTTTCGATTA GGATTTTCAA ACTAGCCAGC GGTGTTTTCA ATTCATGAGA 1860 

AGCTCCTCGT AGGAATTCGA CCTTCATCTT CTCCAGCTGG AGAATGGCTT CATTCTTTTC 1920 
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ATGCAAGTCC GCAATAACAG TCAAGAGATG CTGGTAGAGG CTATTGATTT GTTCCTTGAG 1980 

ATTACCTATC TCATCCTTAG AATCCACGCG CAATCGCACT TGGGAATCCA GGTCCATCAT 2040 

CCGACGGGTC ACCCGCTTGA TTTCCAAAAT CGGTGCAACA ATAGTCCGAG CGTAGATGTA 2100 

GGCCACCAAA AGGGAAATCA GAAAGGAGGC CAGCAAGGTA TAGGGAAGAA ACTGGAGACT 2160 

GATTTGCTCC GCTTCCTTTT GTAAATCCAT GGAAGCTAGA AACTGGAGAA TCATAGTACC 2220 

ACCGTCTTGC GTTTTCACCT CGCGCTCCTC AATAAAGAGA GAGGTTGTCT GGCGGTCTGT 2280 

GTCCAGAGGA AGACTGTCCT TGACTTCTAA CTTGTCCTCG GTCATCTCAC CTTTGACGGT 2340 

CCCCTTGATA TCACTAGTCT GGGAATACAA GTCTAACACT TGCTCGATAC TCTGCCTATC 2400 

TTTCCCTTCT AGGGACTGGG CAATGGCTGT TGCCTTTTGA CCAATGGTTT CCTGACGATG 2460 

ACTCAGATAA GTCGAAGGAA AAAGAAAATA AATAGCTAAA TGAAGGCAGA TAACCAGAAC 2520 

ACTAAATATC GAGAAGGTAT AGATAAATAT CTTTGCAAAT AAACCTGTTC GTTTCATTTT 2580 

CGCTCCAATT TATAACCAAC ATTGCGCACA GTGAGGATAC AATCCAAGTC TAGCTTTTTC 2640 

CGCAATTCCT TGATATAAAC ATCAATAACA CGGTCAAAGG GAACCTCATC TGTCGCTTTC 2700 

CAGACGGCAT CGATAATCTG AGATCGAGTC AAGGCCCGGC CTTCATTTTT CACTAGATAG 27 60 

TCCAGAATTT CCAACTCTTT GGCATTGATA GGCACTTCTT GACCTGCGAG GCTTGCACTG 2820 

TAGCTTTCAA AGTCCACCTT GGTATCCTTG TAAGAAAAGA TTCGTCCTGT ATCCTAGTAG 2830 

CGCTTGAAAA TCGCGTCCAC CCTCACTTTT AAAAGGGAGA GGGAGAAAGG TTTTTCCAGA 2940 

TAGCCATCTG CCAAAGAGGC AAAGGCACTC ATCTTGTATT CCTCATCTTG AAAAGCTGTC 3000 

AACATCAAGA CAGGAACCTG ACTGGTTTTA CGAATCTCAG CTAGGACTTC TAAGCCGTTG 3050 

AGCTTGGGCA TCTGGATATC CAGTAAAACC AGGGCCACCT CATAGCTAGA AAATTGCTCC 3120 

AGAGCTTCCT GACCGTCCGC TGCCTCAATA GTTTCATAGC CACAATCCGT CAAATAATCA 3180 

CTGACCCCCT CACGGATCAT CTCTTCATCT TCTACAATTA AAATTTTCAT ACTTTAACTG 3240 

CTCTCTATTT TTTATTTTTC TTAGAATAAA TACCTACCCT ATTTTCTATT ATAGTCTCTT 3300 

GCTGGCCTTT TGTCTGCAAG CAACTGACCA CTAGATAAAA CGTTGTGAAA TTCCTTTCTC 3360 

ATAAATTCCA TAACTTTAGT ATATTATATT TAAGCACTAA AGTACAAAGA AAGCAACTGA 3420 

AAGCAATGAT TTTCACCACT GCTTTCGGAT TTATTTTGAA TTGTTAJ\ATA GCCATTCCTA 3480 

TCCACTATTC TTGAATAGAA ACACAAGATG CAATCTTTAT TCTAGACTCA TTTTTTCAAA 3540 

TTTATTCACC ATCCAGCAAG AGCTCTTTTG GTTGTTTTCT AAGGAGATTG CTTGAAGCAA 3600 

GCGCCATAAC GAGAACCACT AGAACCAAGG CAAGGACAAA AATGATGATA AAGTCTGATG 3660 
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TCTGAATGGA AATGTCTAGG CTCGACAAGG TCTTGCTAAA GCCATCTACT TCTGCACCAC 3720 

CACCAAGGTT AGAGGCTTGA GCCGCCTTAC TAGCCTGTTT GGCAACACCT GAAGTCACAT 3780 

TGGCAAGGAC AGTGTTTCCA ATTGCACGGG CAGTGTAATT AGCTAGGAAG TAAGCAGAAA 3840 

CTAGAGCAGG GATAGCAATC AAGATAGATT CGGTGATGAA TTGACCCAAG ATACTTGCCT 3900 

GCTTGAGGCC GATAGAGAGG AGAATTCCCA CTTCCTTGCG ACGGGCGTTG ATCCAAAGGC 3960 

TGAGCAAGAG GGCAAGGAGG AGAACTGAGA AGCTCAAGCT ACCCCAGAAG AGGAGGTTGG 4020 

CCATCTTGTA CATACCAGAG ATAGATTGCT CAAGAGCTGG GTAGTTAGAG GAGCTCTTGA 4080 

CGAGTGTGTA GCTCTTCCAG TTGATACCAC TGATGCCATT CAACTCTTTC ATAACATCAT 4140 

CCAAGTTCTT GTCTGCTGTT ACAAAGAAGG TTGCGTCCCC ATAAATGGCT GTGTCTTCTG 4200 

TGTATCCATA AAGTTTTGCA GCAGTGTGAA TGTCTGTAAT AGCTGTGTTT TCGTAAAGTT 4260 

CTTGTGAGTA GGTTACTGCT GACTTATTAT GACCATCAAA GAGTCCCTTG ATTGTCACTT 4320 

CAACTGTTTC CTTGGCTCCT TTTTCATTAT CTGCATCGTA GATATTAGAG TCCAGTTTAA 4 380 

CCTTGTCCCC TACTTTCCAG CCGTGTTTGG CTGCCAAGTC CTTGTGCAAG AGGATTTTAT 4440 

CCTTGTCGTC GTTGGTTAAG TGCTCTCCTT CGACTAGTTT ATAAGAACCA GAGACAAACT 4500 

TGTCTTCTTT AGAGGAGTCA TTGACACCTG TAATCATCAA GCTACTTCCA AAACGCTTGG 4560 

CACGATCAGC AGTGAGATTC TTCTTGGTTT CTGGCGTTTC AATCAGGTCA TATCCAGTCA 4 620 

AatcTCCGAT AGCGTTGATA CGTTTGACAT AAGACTCAAT GGCCTTGTTT TCGGTGATTT 4 680 

TTTTGATGTC TTCACCCTTG ATATTCCCAG CACCACGAGG CGTTCCTTGG TTGACGCGAC 474 0 

GATTGATTTG CATGGAGAAG CTATTGGTGA TATTTTTAAA GGTCTCCTGA GAAGCCTTGG 4800 

CAGTAGCTCC CTTGATTGAC AAGCCGACCA AACTCAAGCT CGCCATGAGG AGAATAATCA 4860 

GGAAGATGAC AATCGATTTG AAAAACTTCC TTGTAACATA GGCAAATGCG TTGTGTAACA 4920 

TAGATTCCCT TTCTAGATTT TGTTTTAATC ATTCTATTAA AATAAGCTCA AATTATTTAC 4980 

TAGTATTGCG CGTTTCAGTC AGTTTCTTAT CCTTTAATTC AAGTGTAATA TCTGACGCTT 5040 

GTGCCACTTC TTTACTGTGA GTTACGACAA TCACACATTT ACCTGTTTTC TGGGCAAGTG 5100 

ATTTGAGTAG TTCGACAATA TCTCCAGCAG TTTTAGGATC CAGATTTCCT GTTGGCTCAT 5160 

CAGCTAGAAT AACTGGAGCT TCTGAGACCA AACTGCGAGC AATGGCAACA CGTTGCTGTT 5220 

GACCACCTGA TAACTGGAGA ACATTCCGCT TGATCTGGCT TTCATCCAAA CCAAGCTCAA 5280 

GAAGTGTATT CTTGCTTGCC TTTTTGTTGA CCAATCGGAT ATTTTCCAGC GGAGAAAGAT 5340 

AATCTATCAA GTTATAATTT TGAAAGACCA GGGAAATATG GTGCATGCGA TGGTAAGAAT 5400 

AGCCCTTCTT ACGAATATCC TCTCCTTGAA AAAGGATAGA ACCTTCAACA GGACTATCTA 54 60 
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GACCAGCAAG TAGGGACAAG AGTGTGGATT TTCCTGCTCC TGACTCCCCA ATAATACTGT 5520 

AAAATTTTCC GGGTTCAAAA TTATAATTGA TCTGATATAG GACTGCTTCA GCAGTATTCT 5580 

TATAACGGTA GGTAACATCT TGTAATTGTA ATAAAGTCAT GATTTCTCCT TCTTAACTAA 5640 

TAGATGATAA AATTTCTTTC GGTGATTTTC TAAATAAGAA TAGGAAACAA AGGGCTACAG 5700 

ATAAGCAACT AAGCAGAACT AGAAAAACAT AGGATTCTGC AAAAGATAAG ATGCTAGTTG 5760 

ATAAACTGCT TGCTTTGGCT AGTGTATCTT GTAAGCTTGC CTGATCTCCA CTTGCTAGTA 5820 

GAGTTTGGAG TAGGTAAGTT GTGATTGCGT TTCCTGCAAC AAATGCTGGA AGCAAAGCTC 5880 

CAAGAGATAC CAAAACTACC TCTAAACAGA ATTGTAGGAA GATCGAGCTC TTGCCTTTTC 5940 

CAAGTGCAAG TAAAATCCCC ACTTCATAGA CCCGTTCTCT CAACCAGAGA GACAAAACCA 6000 

GAATTAAGGC TCCAGCTCCT GCTATCAACA TCCCATAAAG GAAGATGGTC AGGAAGGTTT 6060 

GGAAAGTTGC AACTGAGTCT TTGATTTGTT CAAAAGCCTT GTTTTCCTTT TCGACTTGGT 6120 

AGCCTTGATT TTCCAAGGCC AAGTTTTCTA CCTGCTTCAT GAGTCCGTCC ATTTCCTTAG 6180 

GATTTTCTAC ATAGAAGCGT GCTGCACTGA CTTGAGCTTC ACTATTGCCC AAAAGGGTTT 6240 

GGCTACTTTC ATAGTCTGTA AAGACTTGAT TTTCACTGAA GTCAGAAGAC AAGCCTGTGA 6300 

ATTTCTCTTG TTTTTTACCA GAAAAGATGC CGATAATCTC AAACTCTACT GTTTGTCCTT 6360 

TTCCAGATTC AGACTGACCA GCATCCAAGC CAATCTTGTC ATGAAGCGAA AGACCGTTCT £420 

TCTTAGCCAA TTCTTCGTGG ATAAGGATTT TCTTGGAATC CCCTTTTTGA AGGTGTCGCC 6480 

CTTCTTTTAG ATTGAAAGCC GAACTGGTAA AGGTTACATC CTTGGATGAA TCCTCAAGAG 6540 

CCGTTAAGCT AACCAAGTTA TTGTCTGCAG CTGATAAATC ATCACGCTCC ACGCTCTGCT 6600 

CGCCAGTCAC TGCTTCCTTG TCTTTTAGTT TTGCGACCGT CTCAAGTTCA GGAGAGACAT 6660 

TTTCCAGCCC CTTAATCTTG CTTACAGATG CTAGGTCTGA CAACTTGAAT GTCTGACCAT 6720 

TCTCTATCTT CTTAATAGAA AAAGATGTAT TGAGTGATTT ATAAAGATTG CTTTCTACTG 6780 

TTTTGTTGGA CTTCATCAGA GTCAAACAGG CTGAAATTCC GGCCAATAAG ACCAATAAAA 6840 

TCAGAAATAA AATAAAACTT CTCAGTCGCT TTCTGCTGAC ATAAGCCCAA GATCTTTGGA 6900 

TTGGATTCAT TTGTCACCTC CATATTTGTA AGACTATTAT AAAACCCAAA TATGAAATAT 6960 

TTATGAAATA CGAAAAAAAA ATATCGAGTA GGGGATAATC TCTAGCCCCT CTCACACCAC 7020 

CATACGTGCC GTTCGGCATA CGGCGGTTCA ACTAACTTTT AACGCATGTC GTTCAAGGTA 7080 

ATAATCCAAA CACGAAACCA GTCCACGTTT TTCAAGGACT GGTTTTGATA TAGCACGTTT 7140 

AAGTACCGAC. TTCTGAGCTA CTATAGTAGA TTGAAACTAG AATAGTACAC CTCTACTTCT 7200 
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AAAATATTGT TAGAAATCGA TTTGACTGTC CTGAACAATT CGTCCTATTC TTATTTCATT 
TTACTATAAT TGATAGTGGT CGCCCCAGCC AGATACCTTA TCTGCTATCC ATTTAGGAAC 
CCCTAACTTA AGCAATCCCC ATAATCGTCT CGATTTCTTC TTCCATTGCT TCCAGATAAT 
CACTCGTAGG CGAGTACGCA AGCGCTCATC TATGCTAGTG ACTATACTTT TCATATTTAT 
AATTCATTCC TTTCGTTTCA CTCAAGGCAC AACACAGAAT GAAAAAGTGT TGTGATCTTT 
ATTTTGTTTT ATAATAATAG TGAGAAAACC TATCACTACT ACAAATCACG GGGAGGTGAA 
TAAGTGAGTG GTACAGCCAC TACCTCGCAT ATTTTGTCAC ATCATTTAAC GGTACATAAT 
AAGTTGTACC ATCTGAATAA GTTGCTACAA TATCATTTGC ATGCTCTCCT TCACCTTTAG 
CAAAGGTTGG AGCTCCTGCT GGATGATTTT TATTTGCCTC TTTCAATTTT TCAATAATGG 
CATTTTTTCT GTATCTTTTA TATTATCAGG ATTTTTCACT AAGATTTTGT CTGGATATGT 
CGGTTTAGCA GAAACAATTT TTACTGTTAC TTCTTTTTTA TTCGAAGCAC TTGTCCAGTT 
TCCAGCATTA TCTTTAGCAT TTAATTTTAC AGTAATTCCT GAACTAGGAA CTTCAGTAGC 
AGGTTGATTA TCAACATTAT TCAACTTTAA TTTCAAAAGA GCTGTTGCAT CAGACGTTTT 
ATCAATCGTT ATATATAATG ATGAATTGTT ATTATAAACA GTTCCTTCAT ATTTAGCTGT 
TTGTGAGCTA CTTGATUVCAG AACTGAAATT ATACCCACTA CCTCCCTGAT TATCTTCAAT 
GCTTACGTCT AAATGAACTT CCCCACTATT ATTTGGCTTA GCAACAACTG TTATAGTAAA 
ATAACATAAA ATTTGCATAA ATAGATTAGG GAAATCAAAG CAGCTTCTAG GAATGTTTTA 
GCAGTCACAG TGTACTTTCC CAGCATCAAG CCACTATAAC TCTGCACATA AAAATGGAGA 
AGATGGCAAT CCTCTTCTCC AAATATTAAC TTCTTTACAA ACCAACTATA GTTGACAAAG 
AACCTAAAAT CAATTGATAA CACAAGGTCA GGTCGGTCAA CTCTTTCAAC TGAAGCCCTG 
TCAACTCTTC CCATTTATCA ATCTTGTATT GGAGAGAATT GCGGTGCAGA TAGAGTTGCT 
GGGCTGTTTT AGTGAGAACA GCACTATTTT CCCAAAGAGA GAGAATGATT TCCTGAATCT 
GATCTTGATC CAAAATCATC TGGTGTAGAC ATTCCTTGAT TGGCTTCAAG TCCACGAGTC 
TTTCTCCCAT ACTCCAAAGA TAGAGCTGAG AAAAAGTATG AACACCTTGG TGACCCTGAC 
GCCACCATGT CTTGAACAAA TCCCGCTCAG CTTTGATTAA GTCTGATAGG GCTTGATGTC 
CCGTCTGAGA CCAAACCTGA CCCAACATGA TAGAAAGACG AAGTCCAAAG TCATACTCAA 
CCGCTTCAAT CGTATCACTT AAAATATCTC TTACAGAAGT GTATTTGTCT TGTTGAAGCA 
CGAAAACATA ATCCTGAGCT CCGACCTGTA GCACTGTCTG ACAATTCGGA AAAAGAGTCC 
GCATCATATC TAGCCAAGAA GCCAGATTTT CCTGCTGAAA ATAAGAAAGA TGGCAATAAA 
CCAACTGAAT CTTTTTAAAA ACTTGCGGTG CCTGTCCCTT GCCCTCAACC AGATAGGAAT 
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ACCAAGGGTT TAGCGAACGA GCCTGCTCCT GCTGGGTCAA AAGGGCAACC AACTGCTTTT 9060 

CACGCTCGCT GAGCCCAGCT TCCTCCAGCA AAATCCACTG CTGAGAAGCT AAAGGGAGCG 9120 

TGAGATAGCC CTCTTTCTCT ACTGGTTGGT CTGAAATCCG AGCCTCAGGA AACCAGTCTT 9180 

GTAGTTCTTT TGCCCTCATG TTCTAGCCCT CCACTTTTTG GATGCACCAT GAAACCAAAC 9240 

TCTCAAGACG TTCCAGATTC TCAGTCATAT GGAGATAGCC CATAACCGCT TCAAATCCCG 9300 

TGGACATACG ATAAGTCACG ACATCTGCAT TTTTAGCCTT TGTGTGGCTA TTGGTATTGC 9360 

GGCCACGTTT GTAGATTTCT TCTTCTTTTT CCGTTAGGAC CTGCTCCTCC AACATGAGAG 9420 

CAATCAGGCG AGCCTGAGCC TTGGCTGACA CGTACTTAGT TGCTTCTTGA TGGAGTTTAT 9480 

TGGGTTTGGT CATACCTTTG AGGATGAGGT GACGGCGAAT ATACATAGAA TACACCGCAT 9540 

CCCCCTCAAA GGCTAGCGCA ATCCCGTTAA TGAGATTGAC ATCAATCACG TGTCCACCTC 9600 

ACTCCATCCT TGGTATCAAG GAGCTTAATT CCTTGAGTAA CCAATTGGTC ACGGATTTGG 9660 

TCTGCTGTCG CAAAGTCACG ATTGGCACGC GCCTCTTGGC GTTTTTGAAT CAAGTCTTCA 9720 

ATCTCTGCAT CCAAAACTTC CTCAACAAAG ACAATTCCAA AAATTTCTAA CATATCTGCA 9780 

AGAGCTTGCT TGACACTTGC ATCATAGTTC CCTGAGTTGA TCCATTTGGC CATTTCAAAG 9840 

ACAACTGTGA TACCGTTGGC AGCATTAAAA TCTTCATCCA TAGCTGCTAC AAACTTATCT 9900 

TTAAAGTTTT GTAACTCTTG GGCATCCACA TTTCCTGTAA ATGGTTGTTC GTAAGTATTC 9960 

TTGAGATACT TGAGATTGGT CTCGGCATCG CGAACTGCCT TTTCCGTGAA GTTGATAGGC 10020 

TTACGGTAGT GCTGGGTCGC AAAGAAGAAA CGAAGTACTT GCCCATCAAG AGTTTTAAGG 10080 

GCATCGTGTA CCGTAATGAA GTTACCCAAG GACTTAGACA TTTTGACATT GTCGATATTG 10140 

ACAAAGCCAT TGTGCATCCA GTAGTTAGCA AAAGCCTTGC CTGTTTTAGC TTCAGACTGG 10200 

GCAATTTCAT TGGTGTGGTG TGGAAACTCT AGGTCAGCTC CACCACCGTG GATATCAATG 102 60 

GTATCACCTA AAATCTCTGT CGACATGACT GAACACTCAA TATGCCAACC CGGACGTCCA 10320 

GGTCCCCAAG GACTATCCCA AGAAATCTCA CCTGGTTTGG AAGATTTCCA TAGAGCAAAG 10380 

TCTACAGGAT TTTCCTTACG AGCCGTTTCT TCATCGGTAC GACCTGAAGC ACCTAGCTCC 10440 

AAATCTTCCA AGGTTTTATT AGCCAATTTA GCATAGTTGT GGGATTTTTC TACACGGAAA 10500 

TAGACATCCC CTTGACTCTC ATAGGCAAAG CCTTTCTCGA TCAAGTCTTC CACAAAACGG 10560 

ATGATGTCTG CCATAAACTC CACTACACGC GGATGGCGAG TCGCAGGTTT CACGCCCAAT 10620 

GCCGTCACAT CCTCACGAAA GGCAGCGATG TACTTATCCG CAACCTCCTG AGGCGTGATA 10680 

CCTTCTTCCC TGGCACGGTT GATAATCTTA TCATCCACAT CTGTAAAATT GGAAATATAG 10740 
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GCAACCTTAT ACCCACGGTA CTCAAAATAG CGACGAATCG TATCAAAAGC TACCGTCGAA 10800 

CGGGCGTTTC CTACGTGGAT ATAGTTGTAC ACCGTTGGCC CACAAACATA CATCTTGATC 10860 

TTGCCGTCCT CAATCGGGAC AAATTCTCGC AAATCACGAG ACATGGTGTC ATAGATTTTA 10920 

ATCATAAATC ATAATCAGGA AAGCTGAAAT CCAAGAACAA TTAGTTTCAT CACTAAAAGT 10980 

TCAAGTAAAT TTCAGTCCGA ATATCTCTAC ACTTCGGAAT CCCTTGCTCC TTTCTCATTC 11040 

AGATAAACCA CCTGAGTCTG TTTGACAAAG CCAATTTTTT CATACAAACG TTTGGCACCT 11100 

ACATTGCTAT CTTCCACTGC AATCTGAAAT TCCTTGTCAT TTTGCTCAAT TAGTTGGTTG 11160 

ACGAGGGATT TTGCTAAGTA GCTTCCATAG CCTTTTCCAC GTTCAGGTTC CAATATTGCT 11220 

AAACCGTAGA GGTAATTCGT ATTAGTCGAT AAATCAACCG TACAAGTTCC AATAACCTGA 11280 

CCAGCTTTTA ATAAAATATA TAGTCGGCTT TCTGGATCTT TCAGAGCTTC AGCGACATAT 11340 

CTATCCACAA CTTCTCTCGA TTCATGTTCC TCTGAAAATG CCTGAAATTT TAATTGACTA 11400 

ATTTGATCCT GATACGAACT ATCTGCTAAC AAAACTTCAA GATGGGAAAC ATTTGCTAAC 11460 

GGATAAGGTC TTCTATCCTT ACCTAACCAA GTTTCTGTCT CTTCATCCTC GATTAGTCCC 11520 

CAGTTACTGG CAAAGTCAGG ATGATTCTCT AAAAAAATAC GTTCTGTCTG AAAAGTGACT 11580 

GACCGAATGG GGAAAGAAGC TGTTTCTCTC TCAAAACTAG TAAACAATGC ACGCGCAATC 11640 

CCCTGACGGC GATGACCTGG ATGAACCAGT ATCGTCACTT CTACATCTTG GTCATCTGCA 11700 

TAGACAGTTA ATAAACGAAC AAGrrCGCCT TTTTCATAAT AAAGGAAAAA GGCGGGCATG 117 60 

TTTGGGTCAA AATTAAGCAT GTTAGAGAGA TAGGGATCGC GATAGGTACC GTCATAGTTT 11820 

TGGCAACAGT TAATTACTTT TTTCGCCTCA GATAGCTCCT CTTGGCTTAA CTTGTTTCTT 11880 

GCTTGAATCA TATAGGTATC CTCTACAAAC CAGACGATCT GTGACTGGCA TCTTTAGCCT 11940 

GCTCGAGTTT ATTGACATAA TACTCTCGTT TTTCTTCGAC TTCGTGAATG ACAGGCTCAT 12000 

CTTTCTTACC ATGAAGACGG ACAATCTTGG CCGGAATACC GACAACCGTC ACGTCACTAG 12060 

GTACATCTGC TACGACAACT GCTGCAGCAC CGACCTTGGC ATTTTCACCA ATTTCCACAG 12120 

GCCCGATAAC TTGGGCATGG GCTGATATGA GGGCTCCCTT TCGTACAGTC GGATGGCGTT 12180 

TGCCACAGTC TTTCCCTGTT CCCCCGAGAG TCACTCCGTG ATAGAGAAGA ACGCCTTTTT 12240 

CAACAATCGC TGTCTCTCCA ATCACCAGAC CAGAACCATG GTCAATAAAA ACACCTGAAT 12300 

CAATCTGGGC TCCTGGATGA ATCTCAATCT GAGTCCAAAA GCGCCAAAAC TGACTGTACA 12360 

TACGAGCTAA TAGTTTGAAG CCGTGCTTCC AGAGAAAATG CGAGAGACGG TGGGCCGCCA 12420 

AGGCCTTGAC ACCTGGATAA GTCAGCAAAA CCTCCAAAGT GGTGCGGGCC GCTGGATCAT 12480 

TTTCTTTTAC AATATCAATG GTTTCGCGCC ACCACCCCAT ACATTTCTCC TTTTCTTATT 12540 
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CTGAATCTTT TGATGTTTCT GTAAATTCTT TCTTAGGTTT GTAATCCTTT TGATGACGTG 12600 

GGCGGTGAGG GCGCTCAGAC TTTTCACCTT TTTCATCATG CTCAGGTTTT GGCGGACGAG 12660 

GTAGAAGAGC CTTCATAGAG GCATCGATAC GGCCTTTTTC ATCAATTTTG ATAACCTTAA 12720 

CATCAACTTC ATCCCCGATT TCTACCAAAT CCTCTACACG ATTGGTACGA GTCCAAGCCA 12780 

TCTCAGAGAT ATGAACAAGG GCATCTGTCT TATCAAAGAG GTTAACAAAG GCACCAAATT 12840 

TCTCGATACG AACGACTTTA GCACGGTAAA CTTCATCCAC TTTGGCTTCA CGAACCAAAC 12900 

CAGCAATAAT TTCTTTGGCA CGGTTAATAG CATCTTGGTC ACTAGAGTAG ATAGACACAT 12960 

TTCCTTCTTC GTCTATATCA ATCTTAACAC CTGTTTCAGC GATAATCTTG TCGATGGTTT 13020 

CTCCACCCTT ACCGATGACA ATCTTAATCT TGTCCACATC AATCTTGATC GTATCAATTT 13080 

TCGGAGCAGT TGGAGCCAAT TCTGGACGAA CTTCTGGAAT GGTTGCTTCA ATGACATCAA 13140 

GGATTTCAAA ACGCGCTTTC TTGGCTTGAG CAAGAGCCTC CGTCAAGATT TCTGCAGTAA 13200 

TCCCTTGAAT CTTGATATCC ATTTGAAGGG CTGTAATCCC ATCACGAGTA CCTGCAACCT 13260 

TGAAGTCCAT ATCTCCAAAG TGATCTTCCA AACCTTGGAT ATCTGTCAAT ACTGTGTAGT 13320 

TATTTCCATC TGAGATAAGC CCCATAGCAA TACCAGCTAC TGGCGCCTTG ATTGGCACAC 13380 

CACCAGCCAT AAGGGCAAGA GTTCCCGCAC AGATAGAAGC TTGAGATGAA GAACCGTTTG 13440 

ATTCCAAAAC TTCTGCTACT AGACGGATAG CGTAGGGGAA TTCTTCCAAG CTTGGCAAGA 13500 

CTTGAGCAAG AGCACGCTCA CCAAGGGCAC CGTGACCGAT TTCACGACGA CCTGGCGCAC 13560 

CGTAACGACC TGTTTCCCCT ACAGAATATT GAGGGAAGTT ATAGTGGTGC ATAAAGCGTT 13620 

TCTTGTACTC TGGATCCAAA CCATCAATGA TTTGAGTTTC TCCCATCGGA GCCAAGGTCA 13680 

AGACTGAAAG AGCTTGAGTT TGCCCACGAG TAAAGAGACC TGAACCATGT ACACGAGGAA 13740 

GGAAGTCAAC AACCGCATCC AAAGGACGGA TTTCATCGAC CTTACGACCA TCAGGACGCA 13800 

CCTTGTCTTC TGTAATTAAA CGTCGCACTT CTGCGTGTTC CATTTGTTCC AAGATTTCAG 13860 

CCACATCACG CATAATACGG TCAAATTCTT CGTGGTCCGC ATATTTTTCT TCGTAAACGG 13920 

CAGTCACTTG GTCTTTCACT ACTTGAGTCG CAGCTTCACG GGCCAATTTC TCTTCTACTT 13980 

GAACTGCCTT TTGGAGGTCA CTGTTGTAGG CTGCAATGAT TTCAGCTTGC AATTCAGCAT 14040 

CCACGTGAAG CAATTCCACT TCTGCTTTTT CTTTACCGAC AGCAGCAACG ATTTCTTCTT 14100 

GGAAGGCAAT CAATTCTTTG ACAGCTTCGT GCCCTTTAAG GAGCGCTTCC AACATGATTT 14160 

CTTCTGACAA TTCTTTGGCA CCAGACTCTA CCATGTTGAT AGCGTGCTTG GTTCCAGCTA 14220 

CTGTCAATTC AAGAAGAGAT TGCTCTGCTT GTTCTTGACT TGGGTTGATG ATGATTTGGC 14280 
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CATCTACATA TCCCACTTGT ACCCCAGCAA TTGGTCCGTC AAATGGAATA TCTGAAATAG 14340 

ACAGTGCCAA AGATGAACCA AACATAGCAG CCATTGGTGC AGATGCATTT TCATCATAAG 14400 

AAAGCACTGT ATTGATGACT TGGACTTCAT TACGGAAACC TTCCGCAAAC ATAGGACGAA 14460 

TCGGACGGTC AATCAAACGC GCTGTCAAGG TCGCATCTGT TGAAGGACGT CCTTCACGTT 14520 

TCATAAAGCC ACCAGGAAAC TTCCCAGCCG CATACATTTT TTCTTCGTAG TTGACTTGGA 14 580 

GTGGGAAGAA ATCCCCAGTT GCCATTTTCT TAGACATAAC GGCAGCAGTC AAGACAGTTG 14 640 

ACTCACCGTA ACGTACGACA ACAGATCCAT TTGCTTGCTT AGCAACCTGA CCAGTCTCTA 14700 

CAATTAACTC ACGACCCGCA AAAGTCGTTT GAAACACTTG TTTTGCCATT TTAATCCCCT 14760 

TTGGATTGAT GAAATTATAC GCCTTGCCTA CAAAGATCAA GATACCAAGG ACGTCAAAAG 14 820 

CAAAGTAAAA ATAGGAAACT GACGAAGTCT TCGATGAAGA CAAGACAGTT TATCTTTTTT 14880 

ACACAGCTTT TCGGCCGTGT TCAATTACAC AAGATATTTT GGACGGTTCG GCTTGCCGAA 14940 

CATTTCTGTA GAAAAATAGG AAGGTGACGT CGCACTCGAC GAGTGCTAGG AAGCTTATCT 15O00 

TTTTTCCTAA GAAATGAGAC CAAAATTCAA GTCATCAAGA TACCAAGCCG TCAAGCAACT 15060 

CAAAGGAAGA TAGGAAATCG AACGACGGAG CGACTACTCC TAGGGAGATT TATCTTTTTC 15120 

CACAGAGTTG TAGGCAAGTT CAGTTTTCAA GATACATCAT TAGAAAGGTT TAATACTAAA 15180 

GTATCTAAAG CTTTCACGCT AATCGCTATC GGGCGATTAG CTAAATGCTT TACTAACTCT 15240 

CTGGxCAAAt AACATCGATT TGACTCACTC GTGTCGTTAA ATCTTACAGT TTAAATGCAT 15300 

TGTATTATTT AATACCTTCA TCTTTGTATC AAGTACGTAC AGAATTTATT TTATCATATT 15360 

TTTCTTAAAA AGTGAGGTCT TTACCATTAA AAAGGAACCA TTCCCCTCAC CTGAGAAGAA 15420 

TGGTTTGCTT TTATTATCCT AGAGACTGGT GATTAAACAA GGCATGGGTT GCTTGATGGA 15480 

TGTATTTTGC TGTATCAGCA TTATTCATCG TATAGAGATG CACACCGGCA ACATCCTGAG 15540 

TTACCAAGTC CACGATTTGG TCCACTGCAT AGGCAAGTCC TGCTGCTCTG AGCGACTCAG 15600 

GGTCATGCTC ATACTTGTCT AAGATGGCTT TAAATTTGCG TGGAAGATGG ATATTCTCAC 15660 

AAGTCTTCAA GAGTCGGAGA GCCTGATTTC GATTCAGAAT TGGCATAATT CCTGCATGAA 15720 

TGGGAACATC AATCCCAGCC AAGATACACT TGTCCTGAAA ATCATAGAAG CGCTCATTGT 15780 

CAAAGAAGAG CTGAGTTACG AGGCTCGAAC AGCCTGCATC CACTTTCTTC TTAAGATTTT 15840 

GAATATCTGA AATCTGATTT GGCGAATCTG GATGCCCTTC TGGATAGCAA GCTCCAATAA 15900 

TATCAAAGTG AGGGGTTTGT TCCTTGATAA ACTCAATCAA GTCGGTTGCA TAGCGGAAAT 15960 

CCTTTTGTGG TTCCACGTCT GGAATAATAT CCCCACGAAG AGCCAAGATT TTCTGCACCC 16020 

CAACTTTGTC CAAGTCAGCA ATAGTTTCAG CAACCTTGTC CTTAGTTAGA TAAATAGCTG 16080 
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GCAAGTGGGC AATGGTCGGA ATCGCCAAAT CATTTTGGAT AAAGTCAGCC AAACGAACCG 
TCGTTTCCTT GATATTAAAT TTATTATTGC TGGCAGTTAC ACTGATAAAA TGGGGAGCCA 
ACTCCTGCAT ATCCTGCAAG GCTGAAATAA TGTTATCATT ACCCACGGCT GGGTTTGGAG 
GGAACACTTC AAATGAGAGT GACGGTGTTT GGCGTGACAT ATGTAATAAC CTTTTCTAGT 
TGATTTCTTT TTGAACAACC ACTGTATGGA GAGAAATCCA ATCTTACAAT TTCTCACGCG 
CAGCTTTAGC TGCTTCAACA AGGCGGATCA AGCTTTCTTT TGTTTCTGGG ATACCACGTG 
TTTTCAAACC ACAGTCAGGG TTGATCCAAA CTTTCTTGCT TGGCACTTTA GCAAGGATGG 
CTTCGATTGT GTTGTCGATT TCGCCTTCAT TTGGTACACG AGGTGAGTGG ATATCGTAAA 
CCCCAGGTCC CACTTCTGTT TGGAAGTTTT TCGCTTTGAG TTCGTCCAAG ATTTCAAGGT 
TTGAACGGTT AGCTTCAAAG GAAATAACGT CTGCATCCAT GTTATCGATA GCTGGGATGA 
TATCTGTAAA TTCTGAGTAA CACATGTGAG TGTGGATTTG TGTGTCTGGC GCTACTGTTG 
AGTGTACCAA GCGGAAGGCA GGAATAGCCC AGTCAAGGTA GTCTTCGTAC CAGTCGCTAC 
GGCGGAGTGG CAATTTTTCA CGAAGAGCAG CCTCGTCGAT TTGGATGATT TTCACACCAG 
CAGCTTCAAG GTCAAGTACT TCATCCTTGA TAGCAAGGGC GATTTGGAGA GTTGAATCCT 
TGATAGAGAT GTCTTCACGT GGGAATGACC AGTTAAGGAT GGTAACAGGT CCAGTCAACA 
TACCTTTAAC AGGTTTGTTT GTACGACTTT GTGCATAGCT AGACCATTTA A.CAGTGATAC 
GGTTAAGACG AGTGACATCA CCCCAGATGA TTGGTGGTTT TACCCCACGC ATACCGTATG 
ATTGTACCCA TCCATTTTTA GAGAAGAGGT ATCCTGACAA GTTTTGACCG AAGTACTCAA 
CCATGTCATT ACGCTCAAAT TCACCGTGAA CAAGGACATC AAAGTCAATA TCTTCTTGCC 
ACTTGATCCA TTCGTCAATC GTTTCAGCAA GGAAAGCGTC GTACTCTTTT TGAGACAATT 
CACCTTTACG GTAAGCCAAA CGTTTGGCAC GAACTTCTTT TGTTTGAGGG AATGAACCAA 
TCGTTGTTGT TGGAAGAGCT GGAAGTTTGA AAGCTTCTTC TTGGATAGCT TCACGTTCTG 
CAAAGGCTGG CAAACGAGTG TAGTCTGCGT CTGTCAAGCC AGCGATACGC GCACGAAGTT 
CAGCATTTTC ACCAACACGC TCAGTCGCAA AGAGTTCTTT GTTGGCTGCA AGAGCTTCTG 
AACCTTGACC ATTTCGGATA GCATCCAAGT CACGGATTTC ATCCAATTTT TCAACTGCAA 
AGGCAAAGTG GTTCAAGAGT GCTGGTTCAA ATTCTTCATT AGCAGTTGTA AATGGCACAT 
GAAGAAGTGA GCAAGAGCTT GTCAAAACGA TGTTTTCAGC TGGAATTTGC TCAAGAACAG 
CCAAGCTCTT TTCGTAGTTG TTGCGCCAGA TGTTTTTACC ATTGACAATA CCTACATAGA 
GAGTCTTGTC AGCTGGGAAG CCACCTTTAA CGAGTTCAAG AGTTTTCTTA CCTTCAACAA 
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AGTCAAGACC GATAGCATCT ACTGGTAAGT 
CACCGAAATA AGTTTGAAGC AAGACTTCAA 
AAAGGTTCAA GAAGAGAGCT TTTTCTTCAG 
CCAATTGGAT GCGAGTCGCA CCAAGTTCAG 
CCACTAAGCT ATCTACGAAG TCGTCTGCTT 
GGAAAGTGAA GGGACCTACA AGAACAGGAC 
GGAACTCATC GAAAATCTTG TGACCAGCCA 
GAACGATGTA GTGGTAGTTA GTGTTGAACC 
CTTTTTCTCC CTGGTAACCA CGTCCCAAAG 
AGTTTTGAAC GGATGCAGGC ACCACGTTGA 
AGTGAGAAAA GTCATTTGAT GGAATTTCAG 
GTTTAGCACX; CAAGTCTTTT GCTGCTGCTA 
AGTATTTTTC AGTTGTAAAT TTTAATTCAC 
TTGTAGTTGA CATGATGTGT CCTCCAAAAT 
AAGCGTCTGT ATAATTGTAA AAAATTAGGG 
TCGGACAAAA GAAAAAGACT TGAAGCAAAC 
AGCTATATTC CAATTAGAAT ACTAAAACAT 
CCTGTTATTA GAAAAGACTA TAACTGATTC 
GATTGCTAGT GTCTTTCCTA AACTGGCTAG 
AGTCTTTCCT GTCTCCATCC TAGCTATGAC 
CTTTTTCTGA CTGATTCCTT GTTCATACCT 
TCGCATATCA CTTTCAAGGA TTTCCTCCTT 
TTACTCCCAA TAGCACTATT CTTCATCACT 
TTAAAAAAAT GAGCGAATTA TGATTCGATA 
GCCTATTTCT TAAGCGATTT TCCTTTTCTA 
AATTTTCCAC GATGAATCCA ATAGTAAATG 
ATCACCATCA TTTCAGGCGA AAATTTTTGG 
CTTTTCCTTT AGTTTCTTT^ GACTGTTGAG 
GCCCTTTTCC AAGACATTGT CGGACTGTCC 
GAGCTTTTCC AAGTAGAGAG GCAATTTCTC 



950 

TTACAAGGTC AGCGTATACG 
GACCTTTTTT GTCAGCCAAG 
CTGTCAAGTC TTTTACAAGA 
CCAATTTAGC AAAAACTTCT 
TCACGCCTTC TTCAAAGTCT 
GAGTGTTCAA TCCAAGTTCT 
ATTTTACTTG AGTGTCTTTT 
ATTTCTTCAT TGGAAGGGCG 
CGAAGTAGCG CTCAAGGTCA 
AAAGGAAAGC CGCATCTAGG 
TGATGCCTTT TTCTTTGACA 
AAAGTTCTTC TTCTGAGATT 
GGAATTCGCC CAAACGAGGG 
TTGTTGTTGA AACTATCTTA 
TTTGATATAG TTTGAAACTA 
GTCTCAAATC CTTTGTAATT 
GTTATTAGTA ATTCTTATAA 
TAGTCAACTT TTTCCCTGTT 
GACTTTTAAG ACTGTATCCA 
AGGCTGGCTT ATTCCACTGA 
AGCCTCAATC AACTCGCTCA 
GCTAAAGAGC TCAGATGGAC 
TAACCCTCTT TTTTTTACGT 
GATTGACCAG TGGGTTTAAA 
GGATAAAGCA GTTCCTGCTT 
GTTGAAATTC CCACGTTAAC 
TTATGTTTTT GGTTATGTAT 
CGTAGTCGGC AGAATAAATC 
CACGCTTGAT TTCAGTGTGG 
TATTTGATTT TCCTTCTTTT 



TCACGAACAT 17880 

AGTTTGTTGT 17940 

GCCGCTTCAT 18000 
TGGTAAGCAG • 18060 

GACAATTGAA 18120 

TTGGCTTCTT 18180 

TCAAATTTAG 18240 

CGAACGTCCC 18300 

GACAAGTCCA 18360 

AAGTTATCAT 18420 

ATGTTCCAGT 18480 

TCTTTTCTAA 18540 

AAACCGATGA 18600 

ACAGAAAAGA 18660 

TATATCTGTT 18720 

CTTACTTTAC 18780 

GTGACTATGA 18840 

CAAGTGGGAC 18900 

ACTGAGGACT 189 60 

CTTCTTCCAG 19020 

TGATAGCCAC 19080 

ATCCTTCCAA 19140 

CTATGTATTT 19200 

GTTGGTGCTA 19260 

GCTTAACCCC 19320 

CCCTTTAGCC 19380 

AGTGGAGAAT 19440 

TCTTTGAAGC 19500 

ATAGTTTGAG 19560 

TTCCATCGTT 19 620 
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CGATTAAGCG ACGGCTATCG ATTGTCAAAT GTTTGCCTTT TGTAGTATAA TTGTCTTGCA 19680 

TTTCTGTGCC TTTTAATCAT TTCAATCTTA AATTGGACTT TTTTTACTTG GGTTGTACTT 19740 

AATCTATGAG GAAGACAAGA AAAAGAATAT CAATCAAGTA AAGTCACAAA GTCACATTAG 19800 

CTCCGAGCAA CCATTGCAAA TTGAGGTACT CACACAATGA TTAAAACATT TCTCTCTGCC 19860 

CTTTCGGTCA TTCTCTTTTC TATCCCTATC ATAACTTATT CTTTTTTCCC ATCTTCTAAT 19920 

CTTAACATTT GGCTATCTAC CCAACCTATC TTGGCACAGA TTTATGCCTT CCCCTTAGCT 19980 

ACTGCAACTA TGGCTGCTAT TTTAAGTTTC TTATTTTTTT TCCTATCTTT TTACAAGAAA 20040 

AATAAACAAA TACGGTTTTA CTCTGGCATT TTGCTCTTAC TATCGCTCAT ATTACTATTA 20100 

TTCGGAACAG ATAAAACCCT TTCTTCTGCA TCAAATAAGA CTAAAACCTT AAAATTAGTA 20160 

ACTTGGAACG TCGCTAATCA AATAGAAGCA CAACATATTG AGCGAATTTT TAGCCATTTT 20220 

GACGCCGATA TGGCTATATT CCCTGAACTA GCTACCAATA TCAGAGGTGA GCAAGAAAAC 20280 

CAGAGAATCA AACTATTGTT TCATCAAGTT GGACTTTCTA TGGCCAACTA TGATATTTTC 20340 

ACTTCTCCAC CTACCAATAG TGGAATAGCT CCTGTGACTG TGATTGTCAA GAAAAGTTAT 20400 

GGTTTCTATA CAGAAGCTAA AACTTTTCAT ACAACACGGT TCGGGACAAT TGTATTACAT 20460 

TCGAG7VAAAC AAAATATACC AGATATCATT GCCTTGCATA CTGCGCCTCC TCTGCCAGGT 20520 

TTAATGGAAA TCTGGAAGCA AGACTTAAAC ATCATTCATA ATCAATTGGC TTCAA^J'.TAT 2Cj3G 

CCAAAGGCTA TTATTGCAGG TGATTTTAAT GCAACTATGC GTCATGGAGC ACTTGCAAAA 20640 

ATAAGCTCTC ATAGGGACGC ATTAAATGCA CTGCCACCTT TTGAAAGAGG AACTTGGAAT 20700 

AGCCAAAGTC CAAAACTTTT TAATGCAACA ATAGATCATA TTTTATTGCC TAAAAACCAC 20760 

TACTATGTTA AAGATTTAGA CATTGTAAGT TTTCAAAACT CTGATCATAG ATGTATTTTT 20820 

ACAGAAATCA CATTTTAATT ATTTTATATA AAATCACCCC TCTAATGTTC ATAAACTAGA 20880 

GGGGGAATTT GTATCCTACT ATCGTTTAAC GCACTTCTGC ATTGACTTTT TCTTCGAGAG 20940 

ACGCTTGGAT TTTTTCCATA TAGCGTGCGA CTTCTTCGTC CGTTAAGCTG TCTTCTGGAT 21000 

TTTGGAAGGT CAAGCTATAA GCCATTGACT TCATACCAAG TCCCAGTTTT TCACCTGAGA 21060 

AGACGTCAAA GAGTTTGATA TCTGTCAAAC GTTTCACGCC GGCAGCTTGG ATAGCATCTA 21120 

CAACTTCTTG GTGAGTCACT TCTGCCTTGA GGAGAAGGGC AACGTCACGG CTGACTGCTG 21180 

GGAATTTGGT GATTTCCACA AATGGAACAG CAGGTTGGAG CGCCCCTTCG ATGGCTGAAA 21240 

GGTTAAGCTC AGCTACATAC GTTTCTGGAA TATCGTAAGC CTTGGCAGTG ACTGGATGCA 21300 

CTTGGCCAAG GAAACCAAGA ACTTGGTCAC CGAGTGAAAT CACGGCTGTA CGACCTGGAT 213 60 
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GAAGGCTAAC GATTTCAGAT GTTGCTGTAT AGGTTACTTG GAGTCCCAAA CGAGTAAATA 21420 

GGGCTTCAAG GATTCCCTTA GCATAGAAGA AATCAACTGG AACTGCTGCT GTTTGGAAAT 21480 

CTTTTTCAGC AACCAAGCCT GTCAAGGCAA AGGCAAAGCT GTTGATCTCA TTTGGAAGTT 21540 

CTTCTTTTGG ATTACCTGTT TGTTCAAAGA CTTTTCCAAT CTCATAAAGG GCCAAGTTTT 21600 

TATTCTTACG AGCCACGTTG TAGGCAACGG TATCAAGGAT CCCTGAAATC ATATTTTGAC 21660 

GGAGGACTGA ACGATCCACA GTCATTGGCC ACATGAGTTC AGTAAGGTTA CTTGGTTGAG 21720 

CTGTGAACTC AACTGCTTTT TCAGGAGTTG TCAGAGCATA GGTGATGATT TCTGTCAAAC 21780 

CTGCTCCTTC AGCAATGGTA CGAACTTGAC GGCGGAGTTT TTGTATCACA GTCAATTCAC 21840 

CAGCTGTACC ATCGTCTTTT GGAAGGCTGG TTGGCAAGCG GTCATATCCA TAGATACGAG 21900 

CGATTTCTTC AAAGAGATCA GCTTCGATTG TGATATCCCA ACGACGACGT GGTACGCTGA 21960 

CTGTAAAGCT GTCTGCATTT CCAGAAAGAC CAAAGCCAAG ACGACGGAAG ACGTCTTCTA 22020 

CATCAGCATA AGACAGCTCA GTTCCGAGGA CACGGTTAAC ATCAGCAAGG GTTGAAGAAA 22080 

CTTCCACATC AGAGGTATCA AGCTCACCCG CTGAAACGAT ACCCTTACGC ACCGTCGCGC 22140 

CTGCAAGCTC TGCAATCATG CTAGCTGCCG CATCAAGGGC TTCATTAACT GTTGCCACAT 22200 

TAATTCCTTT TTCAAAGCGA GAAGATGACT CAGAACGAAG GTTCAGGCGA CCACTTGTCT 22260 

TACGGATAGA TTTGCCATTA AAAACAGCAG CTTCAAGGAT AACACGACTA GATTTTTCAG 22320 

AAATTTCTGT AGCCTGACCA CCCATAACAC CGGCAAGGGC TACTGGTTTG TCAGCAACTG 22380 

TAATCACGAG GTCTGTCTCA GCCAAGTCTC GTTCTTCACC GTCCAGGGTC ACTAATTTTT 22440 

CACCATCACG CGCTTCACGC ACACGGATGT CAGTCCCTTC AAATGTGTCC AAGTCAAAAG 22500 

CATGCATAGG TTGACCAAAG TAGAGCAGGA TGTAGTTTGT CACGTCTACA ACGTTATTGA 22560 

TGGGACGGAT GCCTTCGTTC ATGAGAAGGT ITTGCAACCA TTGTGGACTT GGTGCGATAG 22620 

TCACATTGTC CAAGATACGA GCTGCATAGT AAGGCGCCTT GTCTGTCTCA ATGCTGACAG 22680 

AAAGGGCATC TGCCGCAGCT TCATTAGTTT CTGTTAGAGT AAATTTTTTA AAGTTGACTG 22740 

CCTTGTCATA GATGGCTGCC ACTTCGTGAG CCACTCCACA CATAGAAAGG GCATCTGCAC 22800 

GGTTTGGTGT GATGGAAAGT TCGATGATTT CATCATCCAA GTCTAGGTAA GAAAAGACTT 22860 

CCTCACCTGG CACGGCATCT TCAGGCAAGA TTTGGATGCC ATCTGCGAAT TCCTTAGGCA 22920 

CAACTGAGTC AGAAATTCCC AATTCACCAA GTGAACAGAT CATTCCAAGT GACTCCAAAC 22980 

CACGGATTTT TCCTTTTTTG ATTTTGTAGT TATCAGCGAT ACGAGCTCCT GGAAGAGCCA 23040 

CCATGACCTT GATCCCAGCA CGCACATTTG GGGCACCACA AACGATCTGA CGCTCTTCTT 23100 

CTTCGCCAAC GTTAATCTGA CAAACATGGA GGTGAGTCTC TGGCACATCT TCGCAAGACA 23160 
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AGACCTCACC GACGACAATT TTTGAGAGAC CAGCAGCTGG TGATTCGACA CCCTCTACCT 23220 

CGATCCCTGT AGTTGACATT TTTTCAGCCA ACTCTTGTGA TGGCACATCA ATGTCCACCA 23280 

ATTCTTTTAA CCATTTATAA GATACAAGCA TAATTTAGTT CTCCAGAATG ACAGTTGTCA 23340 

CTCTAGTTCT TTTCCTTTCC TATCATTTCA ATAGAAGAAT CCTCTTCTTA CCTTAATTTC 23400 

TTTCTCAGTA ACCAATCCGT ATCTACTTTT TGACCAACCA TAAAATGATG TTGGCTAAAT 234 60 

TTTTCAAAAC CATATCGGTT ATAAAACGCT TGAGCTTTTG TATTATGCTC CCAAACACCT 2 3520 

AGCCAAGCCC AAGAAAAACT ATTTTTTGTA GCAAGTTCAA GTGCGAATTC AAACAGTTGC 23580 

TTACCTAGTC CAAATCCTTG GAATTTTTGT AGCACATAGA GACGTTGAAT TTCAAAAGCG 23640 

TCCTCTAATT CTCTCTCAGT TTGAGCACTT CCCCAGTTGA CTTTGAGAAA ACCAGCTATC 23700 

TCCTCCTCAT GCATAATGAA ATAGGTTTCA GAGTCAGGAT TTCCCAACTC AGTTGACAAA 23760 

GTTTTCAGAC TATAAGCCTC TTCAAAGTAT TCCTGTAACT GCTCTTCCGT ATTATCATAC 23820 

GCAAAGGTTT CACGAAAGGT TTGTTTGGCA ATTTTAGCCA ACACCTCAAC ATCTGCCATT 23880 

TCTACTTTTC TAATCATTAT TTAAACTGTT CTGAGAAGCG GACATCTCCT TGGTAGAATC 23940 

CACGGATATC GTTGATTCCA TAACGGAGCA TAGCTACACG CTCTTGTCCA AGACCAAAGG 24000 

CAAAGCCAGA GTATACAGTC GCATCGATAC CACTCATTTC AAGGACACGT GGGTGAACCA 24060 

TACCGGCCCC CATAATTTCG ATCCAACCTG TTTTCTTACA TACATTACAG CCTTCTCCAC 2413 0 

CACACTTGAA GCAAGAAACA TCCACCTCAA CAGATGGCTC TGTGAATGGG AAGTAAGATG 24180 

GACGCAAACG AATTTGACGC TCTTCACCAA ACATTTTTTG GACAATCAAC TGAAGCGTTC 24240 

CTTGAAGATC AGCCATAGAG ATATTTTTCC CAACTACCAA GCCTTCGATT TGGTGGAATT 24300 

GGTGACTGTG GGTCGCATCG TCCGTATCGC GACGGAAGAC ACGCCCTGGC GAGATCATCT 24360 

TCAAAGGACC TTTAGAAAAA TCATGGGCAT CCATAGCACG CGCCTGAACT GGAGACGTGT 24420 

GGGTACGGAG CAAGATTTCT TCAGTGATAT AGAAAGTATC CTGCATATCA CGAGCTGGGT 24480 

GGTCTTTTGG AAGGTTCATA CGTTCAAAGT TATAGTAGTC TTGCTCCACT TCAAAACCAT 24540 

CCACGACTTG ATAACCCATA CCGATGAAGA TATCTTCGAT TTCTTCACTG GTTTGTGTCA 24600 

AAACGTGACG GTGACCAGTC GCAACTGGAC GACCTGGAAG CGTCACATCT ATACTCTCGC 24660 

TAGCCAGTTG AGCCGCGACT TTCTTTTCTT CCAAGAGCTT AGCTGTTTCT TCAAAAGCAG 24720 

CAGTCAAGAC ATCACGAGCT TCATTGACGT GTTTCCCGAT GATTGGACGC ATCTCAGCAG 24780 

AAACATCTTT CATCCCTTTG AGGATTTCAG TGAGCGAACC CTTTTTACCA AGGACAGAGA 24840 

CACGCAAATC TTGCATCTCT TTTTCATTTC CAGCAGTAAT CTGCTTCAAG CTAGCCAGCG 24900 
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TTTCTTCGCG AAGCGCTTTT AATTGTTCTT CAATAGTTGA CATATTTCCT CCATCAGTCT 24960 

CTCGTAGATA AAAAGAAAAC CACATGCCAA AAACTCCACT CGGAGCGTTG ACACGCGGTA 25020 

CCATCCGTTT TCATCTGACA AGTCAGACCT TCATTTCTAA ATCCATGCGC AAGTGAATTC 25080 

ACCCAGCTTT CATATAGAGA GCTTGCAGTC ACGGCTCTCC TCCCTGATAT ACTTCCCTTG 25140 

GGCTACTAGT CTTTCAGATT CCTATTCAAT TACTACTTAG TTTATCAGAT TTTTACCATT 25200 

CTTGCAAGAC CTATCTTACT TCTGCTTGTT AGCTTATTCT TATCTAAATT TATATAAACc 25260 

TTATCTAAAT TAACTATTTA TAATTTTTGT AACAAAATTA AATTAATTGA CACTCCCCTA 25320 

TAAAATAAAG AAGTTTAGAA TTTAATGTCT TCCAAACTTC TTTATTCCAT ATTTAATGAA 25380 

ATGCCACCTT AACCGTGATA ATAGCTAGTC ATCAATAAAA AACTATTTGA ATAAGGATTC 25440 

TCCATTTGAT TCAATCACTT CTTTATACCA AGTAAAAGAC ATTTTCTTAT ATCGATTTAA 25500 

TGTACCACTT CCATCATCGT TTCGATCAAC ATAAATGAGA CCGTACCTTT TAGAAAGTTG 25560 

TGCAGTGGAC ATAGAAACAC AGTCAATACA TCCCCAAGAC GTATAGCCCA TAATTTCAAC 25620 

ACCATCCTGT AGAGCTTCAG CAACTTGCAA TAAATGTTCT TTCATATACT GAATTCTATA 25680 

ATCATCTTGG ACGGTTAAGT TATTAAGTTC ATCTTTTATT AGTTGATCTT TAGCACCTAA 25740 

TCCATTTTCT ACTATAAATA ATGGGATTTG ATAACGGTCA TAATATCTAT TTAAAATTAT 25800 

ACGTAGTCCA ATTGGATCAA TTTGCCATCC CCACTCTGAA GACTCTAAAT AAGGATTTAC 25860 

TAAACCACCA ATAATATTCC CTTCTCCTGA ATTATACTGT GTTGGAAGAG CAGATTGAGT 25920 

CACACTCATG TAATAGCTAA AGGATAAAAA ATCTACGGTA TAATTTTTTA ATAACTCTGC 25980 

ATCTTCAGCT GCA/ACTCTA TGTTAATGTC ATTTTCCTTA AAATATCTTT TTGCATAATT 2 6040 

CGGATAATAA CCTCTAACAT GCACATCTGA AAATAGATAA TTTAGATTCT CATACTCATG 2 6100 

AGTCGCCCAT ACATCTTTTG GATTTGGAGT CATTGGATAA GCTGGCATAG CTAATACCAT 2 6160 

ACATCCCACC TTAAACTCTG AATTAATCTC ACGAGCAATT TTTGTAACCA AACTTGAGGC 26220 

GACTAATTCA TGATGTATAG CTTGATATAA TTCTTGTTTC GAAAGATTCT CCTTAGGTAT 2 6280 

ATCTATTCCT CCACTAGTAA ATGGTAATTC CAAAACAGAG TTTACTTCGT TAAATGTAAG 2 6340 

CCAATATTTA ACTTTATCTT TATACCTTTC TAAAACTGTT CGAGCAAATT TTTCATAAAA 26400 

ATGAATCATT CTCCTATCAA CCCATCCATG ATATTTTCTT GCTAAATATA ATGGAGTCTC 26460 

ATAGTGTGAA AGAGTTACAA CTGCTTCTAT CCCGTGAGCA TGTAGTTCAT CAAACAATTC 2 6520 

ATCATAATAT TTCAACCCAG CTTCGTTAGG TTCTTCCTCA TCTCCTTTTG GAAAAATTCT 26580 

ACTCCATGCA ATAGAAGTAC GAAAAACATT AAAGCCCATT TCAGAAAACA AGGATATATC 26640 

TTCCTTATAT TTATGATAAA AATCAATACC TATCAATTTT AAGTTATCTT CTGTAGGATT 2 6700 
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TTCTGTTGCT TCTCCTAATC CACCTTTGGG TAACACATCC TGAACTGATA AGCCCTTACC 26760 

ATCTTCATTA TATGCTCCCT CTACTTGATT AGCTGCAACA GCTCCACCCC AAAGAAAATC 26820 

ATCTGGAAAA ATGGTCATAA CTTTCCTCCA TTATAATATT ACCAGTAATT CCTTAGAATG 26880 

CTCGATTGTC TGATTATTAG GTAATACTAA TACATCTAGA AAATCATTGG TATTCGTTAC 26940 

AATTACTGGT GTAACTGTTT CGTAGCCTTT AGTCTTGATT AAATTCAAGT CCATTTCAAA 27000 

AATCAACTGA TTTTTGAAAA CTCTGTCTCC TTCTTCTACA TGACT7ATAA AACCTTGACC 27060 

TTTTAGCTCA ACAGTATCTA ATCCAATATG AATTAGTAAC TCAACACCCT CATCACTCTT 27120 

CAATCCAATT GCGTGCTTAG TCGGAAAAAT ATTTGTAATT TTCCCATCAA ATGGTGCATA 27180 

AACCTTACCT TCACTTGGGA TAATCGCTAC TCCGTCTCCA ATTAGTTTAT CTGAAAATGT 27240 

TTTATCCTGG ACATCGCTTA ACGGAATGAT TTCTCCTGAT ATAGGAGAAA ATATCATTTT 27300 

TTTATTTGAA ACTCCAGCTT CAACTTCTAA ATTGCTAGAA CTCTCTTCTT CATCGATTCC 27360 

AAATATATAA GCTAATACAA AGGTAATAAC AACCGAAATG ACCGCCACAA TTAAAGCATT 27420 

TACAATATTT GATGGCACAT CAGAATAAAT AAATTGAGGC AACGCTATCA AAGATGGGAC 27480 

AGCAAATAGA TATGCTTTAA CACTAGTAAG ACCTGCAAAT AATCCCGCTA ATCCACCACC 27540 

AATCATAGCT GCATAAAGCG GTTTTTTATA TTTTAAAGTC ACACCATATA ATGCAGGTTC 27600 

GGTAATCCCT GCAAGTAAGG CTGAGAAACC TGCTGCAAAA GC7VATTTGTT TTGTATTATT 27660 

ATTTTTACTC TTTAATGCAA CAGCCATCGA AGCAGCCCCT TGAGCTAAGT TTGACCCTAA 27720 

CATTGCTGGA AGAATTAATA CGTCTGGAGT AGCAATAGAT GCCGCCAAAA AAATAGGTGC 27780 

AAAAGCCCAA TGCATTCCAG TCATAACAAT AAATGGCATA ATAGCACCAA GAATAGCTAA 27840 

TGTAAGCCAT CCAGCTACAC CATACATTTG CCCAACTAGA TTTGATAATC CTTCACCAAC 27900 

AATTACTCCA ATAGGTCCGA CTACAACTAA GGCAATACAG CTTGATACTA ATAATACTAG 27960 

CGTAGGTTGC AAAAAACTCT TAGTAATAGC TAGTGTTAAT TTAGCAATTA TTTTTTCA^T 28020 

ATATTTCATC AACCAAACCA TAATAAGAAT TGGAACGACT GATGAACCAT AACTAGCTGG 28080 

TGTCACAGGT GCACCAAATA AACTAAGAGG ATTCCCTGAT TGCACCATTT GAACAAAATT 28140 

TGGATGGAGA AGTACACCTG CTACAGACAT AGCTAATGTA GATGTTACTT TTAATTTTTG 28200 

TGATGCAGAA TAAGCTAATA ACAGCGGTAA GAAATAATAT GGAGCATCCC CAAAAAATGT 282 60 

CAAAAAAGCA ATAGTCTGAG AATCTGATTG CAATATACCA AGCATTGGTA AAATGATTAC 28320 

CAAGACTTTC AACATACCTC CCCCTAACAT TGCTGGAATG ATTGGAGTCA TGGAACCAGC 28380 

GATATACTCA ATGATTCTTT CTAAAATATT CCCTTTGTGC CCTTGAACAA CTGAATCGGA 28440 
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TTCAAAATTG CCAAGTTTAA CGAATTCTTT ATAATAATTA GCTACATCAT TACCAAGTAT 2 8500 

AATTTGATAT TGTCCATTCT TTTTCATAAT ACCTATTACA CCTGGTATCT TCTTCACATC 28560 

ATCATCATTG ACTAAATTTT CATCTTTTAA TTCTAATCTT AAACGTGTTA CACAATGGGT 28620 

AACTCTATTG ACATTTTTTT CACCTCCAAT TACATCGAGG ATTTTTTGTA CCGTATCTTT 2 8680 

ATAACTCATG GTATTCTCCT ATTCTATTAA TCTAAATTTT TTGTTAAGCG ACGAATATGA 28740 

GCCATCAAAT AAACTAATTC ACTAGAAGTC AGCAAATAAT TGTACTCCGT TTGTATAAAC 28800 

ATTGCTACCT GTTCACCACA TTCATATTCT CTAGGATATT TATTTTTCAT TAATGCTAAC 28860 

AAGTCTTCAT CATCATCGTC GG 28882 



<2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12835 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 



GCCTATGTCT 


TTTTCAAAAA 


AATGCTTGAC 


TTGAGACGGG 


AACTAGGGAA 


GTCTAAAGGC 


60 


GGAAGGCATT 


GATTTATACT 


CTTCGAAAAT 


CTCTTCAAAC 


CACGTCAACG 


TCGCCTTGGA 


120 






GTCGATGCrr 


ATCTACAACC 


TCAAAGCAGT 


GCTTTGAGCA 


180 


ACTTGCGGCT 


AGTTTCCTAG 


TTTGCTCTTT 


GATTTTCATT 


GAGTATTATA 


TTACTTTCTA 


240 


TTTGTAGGAG 


GTGGCTTATG 


AAGATTCCTC 


TCTTAACTTT 


TGCAAGGCAT 


AAATTTGTTT 


300 


ATGTCTTGCT 


TACTTTGCTT 


TTTCTTGCTT 


TGGTTTATCG 


TGATGTTTTG 


ATGACTTATT 


360 


TCTTTTTTGA 


TATTCATGCG 


CCCGATCTAG 


CTAAATTCGA 


TGGACAAGCA 


ATTAAAAATG 


420 


ACTTATTAAA 


ATCAGCATTA 


GATTTTCGTA 


TTCTCCAGTT 


CAATCTAGGT 


TTTTATCAAT 


480 


CATTTATTAT 


TCCAATCATC 


ATTGTTTTGC 


TAGGTTTTCA 


ATATATTGAG 


CTGAAAAATA 


540 


AAGTTTTACG 


ATTGAGTATT 


GGAAGAGAAG 


TGAGTTATCA 


AGGGTTAAAA 


AGAAAGTTGA 


600 


CTTTGCAAGT 


TGCAAGTATC 


CCTTGTTTGA 


TATATTTAGT 


GACTGTGCTG 


ATAATTGCAA 


660 


TTATAACCTA 


TTTCTTTGGG 


ACTTTTTCTC 


CTCTTGGATG 


GAATTCTCTA 


TTTTCTGATG 


720 


GAAGTGGTTT 


ACAAAGACTC 


CTAGATGGAG 


AGATAAAAAG 


CTATTTGTTC 


TTTACTTGTG 


780 


TCCTACTAAT 


CGGTATTTTC 


ATCAATGCAA 


TCTATTTTTT 


ACAAATAGTT 


GATTATGTGG 


840 


GGAATGTGAC 


TCGTTCGGCA 


ATCACCTATT 


TGATGTTTCT 


TTGGCTTGGT 


TCTATGCTGC 


900 


TTTATAGTGC 


CTTGCCTTAC 


TATATGGTTC 


CTATGACGAG 


TTTGATGCAA 


GCTAGCTATG 


960 
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GGGATGTAAG TTTGATGAAA CTCTTTACTC CTTATATCCT TTATATTGTC CCTTACATGG 1020 

TGCTTGAAAA ATATGAAGAT AATGTTTAAG AATTTTAACA ATATTTTGCT AAATAGAAAG 1080 

ATTGTTTTAC TACTTCGTAT AGTTCTGATG ATGATTTTGA TAAACCATCT ATTGTCAACA 1140 

GCGGTTCAAA AGCAGGATGC TGTTATCTTT TTCAAGAGAG AATTGATTTC AATTTTTTCC 1200 

TATAATGACT ATTCTGAAGC GAATTTAGAA ATCCCCAAAC TATTGTTAAA CCTTTCGCTT 1260 

TTCATGGTAG GATGGCTCTC TGTCATTTTA CTTGAAAGTG ATTTGGCAGA CCATTACCAT 1320 

CACTTGATTC GCTATCAATC AAGCTCCTTT TTCGATTATA CAAGGAAACG ATTGGTTGTC 1380 

ATTTCTAAAT TTTTTACTCA AGATTTGTTT GTCTGGTTTC TTGGTTTACT TCCTCTAGGA 1440 

ATTCATTTCA AAACAGTCGC ACTTTTCTTT TTACTTGCTC AGTTAATGAT GTTGTACTTA 1500 

CTACTGTCTT ATCTGATAGC ACTGATTAGT GCGGGCGCTG GTTTTTCCTT TTTTCTCTAT 1560 

TTTTTAGCAT TTGTGGGACA AGAATGGATG ATGGATCATA TTGTAACAGT GTATTTAGTA 1620 

CTCTTAAGTT TATTAGTTAT GTTGATTGTT AGTCGCTTGG AAGAGAAATT TAAGAAAGGA 1680 

TAAACGATGA GACTTGAAAT TATAAATGGA CAGAAAATTT ATGGGAAAAG ACCTATTTTA 1740 

AATCAGTTGA ATTTGGTGTT TCAATCAGGA AAAATTTATG GACTTAAAGG TGATAATGGA 1800 

TCTGGCAAGA CGGTTCTTTT AAAGATACTT GCTGGTTATA TTAAGCTTGA CAAAGGAAAA 1860 

GTTCTTCAAG ATGGTAAAGT TTACGGGGTA AAAAATCATT ATATTCAGGA TGCAGGAATT 1?20 

TTAATTGAAA AAGTCGAGTT TTTATCTCAT TTATCCCTGA GAGAAAATTT GGAACTGTTA 1980 

AGGTATTTTT CATCTAAAGT TACGGAAAAA AGAATTGCCT ATTGGATTCA ATACTATGAT 2040 

TTACAGGAAT TTGAAGACAT TGAATACCGT CATTTATCCT TAGGAACAAA GCAAAAAATG 2100 

GCCTTGATTC AAGCCTTTAT TTCCTCTCCT TCTATACTCT TTCTCGATGA ACCTATGAAT 2160 

GCTTTGGATG AGAAGAGTGT GAGGTTAACC AAACAGGTCA TTTTATCTTA CCTGAAAAAA 2220 

GAAAATGGTC TGGTTATCCT GACGTCGCAC ATATCGGAAG ATATTTCAGA CCTTTGTACA 2280 

GATGTATTAG TTGTCGAAAA TGGACATATA CAAATGTAAA GGATATACAA TCCTAGGAGA 2340 

TGGCTTATGG CACATCTAAA ATCATTTATT ACACGATATT CCAAGGTTTA TATTGGTTTA 2400 

GTTCTGCTGA TCTGGCTGTC TTTCTTCTTT ATCCCTTGGG ATAAACCACT TCTGGGGATA 24 60 

AGGATTGACA TCTTCATCAT ACAGAAAATC TTGCTAGCTT TTGGAATTCT GTCCATTCTC 2520 

ATGGCCTTGC TGTCCAAGAA AGTCAGTCTC TTTGTTTTTG GACTGATTTG CTGTCTTTCT 2580 

CTTTGGATTA ACTTATTTAT CACATTTGCC ATTTTGCCGA TTTTTGGCAA TTAAACAGTC 2640 

ATAAAAGTCG GAGAGGTTAG CTTGAAAACT AACCTCTTTT TCCTTTTCAA AATGGGGATT 2700 
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CTTCCTTGAA AATAATCAGT AATTGTGCTA AAATTAAAGG AACATTCTAA AATATTCGGA 2760 

ATTTAAAGTA AGGAAAAACA TGGCTAATAT TTTAAAAACA ATTATCGAAA ATGATAAAGG 2820 

AGAAATCCGT CGTCTGGAAA AGATGGCTGA CAAGGTTTTC AAATACGAAG ACCAAATGGC 2880 

TGCTTTGACT GACGACCAAC TAAAAGCAAA AACAGTTGAA TTTAAGGAAC GTTATCAAAA 2940 

TGGAGAATCA CTGGATTCAT TGCTTTACGA AGCATTTGCG GTTGTCCGTG AAGGTGCCAA 3000 

ACGTGTCCTA GGTCTCTTCC CTTATAAGGT TCAGGTCATG GGGGGGATTG TTCTTCACCA 3060 

TGGTGACGTG CCAGAGATGC GTACAGGGGA AGGGAAAACC TTGACTGCGA CCATGCCGGT 3120 

ATACCTCAAT GCCCTTTCAG GTAAAGGGGT TCACGTAGTT ACGGTTAATG AATACCTGTC 3180 

AGAACGTGAC GCGACTGAGA TGGGTGAATT GTACTCTTGG CTTGGTTTGT CAGTAGGGAT 3240 

TAACTTGGCT ACCAAATCTC CAATGGAGAA AAAAGAAGCC TATGAGTGTG ATATTACTTA 3300 

CTCAACTAAC TCAGAAATCG GATTTGACTA CCTTCGTGAC AACATGGTCG TTCGCGCCGA 3 3 60 

AAACATGGTA CAACGTCCGC TTAACTATGC CTTGGTCGAT GAGGTTGACT CTATCTTGAT 3420 

TGACGAGGCT CGTACACCTT TGATTGTATC AGGTGCCAAT GCGGTTGAAA CCAGTCAGTT 3480 

GTATCACATG GCAGACCACT ATGTAAAATC TTTGAACAAA GATGACTACA TCATCGATGT 3540 

GCAGTCTAAG ACTATTGGTT TGTCTGATTC AGGGATTGAC AGGGCTGAAA GCTACTTCAA 3600 

ACTTGAAAAC CTCTATGACA TCGAAAACGT GGCTTTGACT CACTTTATCG ATAACGCCCT 3660 

TCGTGCCAAC TACATCATGC TtCTCGATAT TGACTATGTG GTGAGCGAAG AGCAAGAAAT 3720 

CTTGATTGTC GACCAATTTA CAGGTCGTAC CATGGAAGGT CGTCGTTATT CTGATGGATT 3780 

GCACCAAGCT ATTGAAGCCA AAGAAGGTGT GCCAATCCAG GATGAAACCA AGACATCTGC 3840 

CTCAATCACG TACCAAAACC TCTTCCGTAT GTACAAGAAA TTGTCTGGTA TGACGGGTAC 3 900 

AGGTAAGACT GAGGAAGAAG AATTCCGTGA AATCTACAAC ATTCGTGTTA TTCCAATCCC 3 960 

AACAAACCGT CCTGTTCAAC GTATTGACCA CTCAGACCTT CTTTATGCAA GTATCGAATC 4020 

TAAGTTTAAA GCGGTTGTCG AAGACGTTAA GGCTCGTTAC CAAAAGGGTC AACCTGTCTT 4080 

GGTTGGTACA GTAGCGGTTG AAACTAGTGA CTACATTTCT AAGAAATTGG TTGCAGCTGG 4140 

TGTTCCTCAC GAAGTCTTGA ATGCCAAAAA CCACTATAGA GAAGCCCAAA TCATCATGAA 4200 

TGCTGGTCAA CGTGGTGCCG TTACCATCGC AACCAACATG GCGGGTCGTG GTACCGACAT 4260 

CAAGCTTGGT GAAGGTGTTC GTGAACTTGG AGGACTTTGT GTTATTGGTA CAGAACGTCA 4320 

TGAAAGTCGT CGTATCGATA ACCAGCTTCG TGGACGTTCA GGTCGTCAAG GAGATCCAGG 4380 

TGAGTCACAA TTCTACCTAT CTCTTGAAGA TGATTTGATG AAACGTTTTG GTTCTGAACG 4440 

CTTGAAGGGA ATCTTTGAAC GCTTGAACAT GTCTGAAGAG GCCATTGAGT CTCGCATGTT 4500 
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GACGCGTCAG GTTGAAGCAG CTCAGAAACG TGTCGAAGGA AATAACTACG ATACCCGTAA 4560 

ACAAGTCCTT CAATACGATG ATGTCATGCG TGAACAACGT GAGATTATCT ATGCTCAACG 4 620 

TTACGATGTC ATCACTGCAG ATCGTGACTT GGCACCTGAA ATTCAGTCTA TGATCAAACG 4 680 

CACGATTGAA CGTGTCGTTG ATGGTCATGC GCGTGCCAAA CAAGATGAAA AACTAGAGGC 4740 

AATTTTGAAC TTTGCTAAGT ACAACTTGCT TCCTGAAGAT TCTATTACGA TGGAAGACTT 4800 

GTCAGGCTTG TCTGATAAGG CCATCAAGGA AGAGCTTTTC CAACGTTCCT TGAAGGTTTA 4 860 

CGATAGTCAG GTTTCAAAAC TACGCGATGA AGAAGCAGTT AAAGAATTCC AAAAAGTTTT 4920 

GATTCTACGA GTGGTGGATA ACAAGTGGAC AGATCATATC GATGCCCTTG ATCAATTGCG 4 980 

TAACGCGGTT GGACTTCGTG GCTATGCTCA GAACAACCCT GTTGTTGAGT ATCAGGCAGA 5040 

AGGTTTCCGT ATGTTTAATG ATATGATTGG TTCGATTGAG TTTGATGTGA CACGCTTGAT 5100 

GATGAAAGCA CAAATTCATG AACAAGAAAG ACCACAGGCA GAACGTCATA TCAGTACAAC 5160 

AGCGACTCGC AATATCGCTG CTCACCAAGC AAGTATGCCA GAAGATTTGG ATTTGAGCCA 5220 

GATTGGACGC AATGAACTTT GCCCATGTGG TTCTGGTAAG AAATTTAAAA ACTGTCACGG 5280 

TAAAAGACAA TAAAATGAGA TAGTTTAGAG GCGGATATCT TGTGAAAAGT AAATTTTTAC 5340 

TGGGTATCCG TTTGCTTTAT AAGGAGATGA GTTATGGTAT TTACAGCAAA AAGCTCTAAA 5400 

ATAAATATAG AAGAAGTTCG TGCCTTGTCA AAATTAGAAG GTCAGGCTTT G<?AGAGGAAA 54 5C 

TCACAGCGAG ATCAAGAGCT AGAAGCCATT ATACGTGGAG AAGACCAGCG AATTCTCTTG 5520 

GTAATCGGGC CATGCTCATC TGACAACGAA GAAGCTGTCC TTGAATACGC TAAGCGTTTG 5580 

GCAGTCCTAC AAGAAGAAGT GGCAGATCGT ATCTTTATGG TTATGCGTGT TTATACTGCC 5640 

AAACCCCGTA CCAACGGAGA TGGCTATAAG GGCTTGATTC ACCAGCCTAA CGCGACAGAA 5700 

GCGCCTAGTC TTATCAATGG AATCAAAGCC GTTCGCCATC TTCACTATCG TGTCATCACA 5760 

GAAACAGGGA TGACAACTGC TGATGAAATG CTTTATCCTG AAAACCTTCC GCTTGTAGAT 5820 

GATTTGATTT CTTACATGGC AGTTGGTGCC CGTTCAGTTG AAGACCAGCA ACACCGCTTT 5880 

GTGGCAAGTG GGGCAGGATT TTCTACTGGT TTTAAAAATC CAACCTCTGG AAATCTCAAT 5940 

GTCATGTTTA ATGGGATTTA TGCTGCTCAA AACAAACAAA GTTTCCTTTT CTTAGGAAAA 6000 

GAAGTAGAAA CAACTGGGAA CCCGCTTTCA CACGCTATTC TTCGTGGTGC TCTTAATGAG 6060 

TATGGAAAAA ATATTCCCAA CTACTATTAT GACAATTTAA TTGATACCAT TGCCCAGTAT 6120 

GAGAAAATGG GCTTGGAAAA TCCTTTTATC ATCATTGATA CCAATCATGA CAATTCTGGT 6180 

AAGCAGTATA TTGAACAGAT CCGAATTGTC CGCCAGACCT TGATTAACCG TGCTTGGAAT 6240 
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GAAAAAATTA AGCAGTTCGT TCGTGGTTTT ATGATTGAGT CTTATCTGGA AGATGGTCGA 6300 

CAAAATGAGC CAGAAGTATT TGGTAAGTCT ATCACAGACC CTTGCCTGGG TTGGGATAAC 6360 

ACAGAAGCTC TTGTCAGAGA AATTTACAAA ACGTTAGGAG AATAAGATGG CATTTATTGA 6420 

AAAAGGTCAA GAAATCGATA TGGAAGTCAT CAAGGCTGAA ACCCAATTGT CTGCGGAAGC 6480 

CTTGAGACTC AAGGAAAGCC GTGACAGGGA ATTGGCAGAT ATTATTTCAG GGGAAGATGA 6540 

CCGTATTCTC TTGGTGATTG GTCCTTGCTC TTCTGATAAT GAAGAGGCGG TCTTGGAATA 6600 

TGCTCGCCGT TTATCTGCCT TGCAAAAGAA GGTAGCGGAT AAGATTTTCA TGGTCATGCG 6660 

CGTGTATACT GCTAAGCCTC GTACCAATGG AGACGGCTAT AAAGGATTAG TTCACCAGCC 6720 

AGATACTTCT AAGGCTCCAA GCCTGATTAA TGGCTTGCAG GCTGTGCGCC AGTTGCACTA 6.7 80 

CCGCGTGATT ACAGAGACTG GTTTGACAAC GGCAGATGAG ATGCTTTATC CGTCAAATCT 6840 

GATCTTGGTG GATGACTTGG TCAGCTACCA TGCCGTTGGA GCTCGTTCTG TGGAAGACCA 6900 

AGAGCACCGC TTTGTGGCTT CTGGGATTGA TGCACCAGTA GGGATGAAAA ATCCAACCTC 69 60 

AGGAAATTTG GGTGTTATGT TTAACGCCAT CTATGCTGCT CAAAACAAGC AAACCTTCCT 7020 

TTATCATGGG CAGGAAGTTG AGACATCAGG TAATCCTTTG GCCCATGTTA TCCTCCGTGG 7080 

AGCAGTCAAC GAGTATGGCA ATTATATGCC GAATTACTAC TATGAAAATC TACTCCAAGC 7140 

CATTGAACGC TATGAAACCA TGGGACTTGA AAATCCTTTT ATCCTCATTG ACACCAACCA 7200 

TGATAACTCA GGCAAGCAAT ATATGGAGCA GATTCGAATT GTTCGCCAGA CCTTGCAGAA 72 60 

TCGTGATTGG AATGAGAAAA TTAAAAAGAC GGTTCGAGGA TTTATGATTG AATCTTACCT 7320 

AGCAGATGGT CGTCAAAACC AACCAGAGAT CTTTGGTTGC TCTATTACTG ACCCTTGCCT 7380 

AGGTTGGGAA AATACAGAGG CCTTGGTAGA AGAGATTTAT GTTACCTTGA CAAAATAAGT 7440 

GAAAAGGATG GAGTTGGGGA ATCTCAACTC CTTTTGATGA GAATGATAGT TGGACACGGA 7500 

ATTGACATCG AAGAATTGGC TTCGATAGAA AGCGCAGTTA CACGACATGA AGGATTTGCT 7560 

AAGCGTGTAC TGACCGCTCA GGAAATGGAG CGCTTCACCA GTCTCAAAGG ACGCAGGCAA 7620 

ATAGAATATT TAGCTGGTCG CTGGTCGGCT AAGGAGGCCT TTTCCAAGGC TATGGGAACG 7680 

GGCATTAGCA AGCTCGGTTT TCAGGATTTG GAAGTCTTGA ACAATGAACG TGGGGCGCCT 7740 

TATTTTAGTC AGGCACCATT TTCAGGAAAG ATTTGGCTGT CTATCAGCCA CACCGATCAG 7800 

TTTGTGACAG CCAGTGTCAT TTTGGAGGAA AATCATGAAA GCTAGTCCAC ATAGACCAAC 7860 

CAAGGCTCTG ATTCATCTGG GAGCTATTCG ACAAAATATT CAGCAAATGG GGGCTCATAT 7920 

CCCTCAAGGA ACGCTCAAGT TGGCTGTGGT TAAGGCCAAT GCTTATGGTC ATGGAGCTGT 7980 

TGCCGTTGCC AAGGCAATTC AAGATGATGT TGATGGCTTT TGCGTTTCCA ATATCGATGA 8040 
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AGCCATTGAA 


CTCAGACAAG 


CTGGACTCAG 


CAAGCCAATC 


CTCATTTTAG 


GAGTTTCTGA 


8100 


AATCGAAGCT 


GTTGCTCTAG 


CTAAAGAATA 


TGACTTCACC 


TTGACAGTGG 


CTGGACTGGA 


8160 


GTGGATTCAA 


GCACTCTTAG 


ATAAGGAAGT 


GGACCTAACT 


GGATTGACAG 


TCCACCTCAA 


8220 


GATTGATTCA 


GGGATGGGAC 


GGATTGGTTT 


TAGAGAGGCA 


AGTGAGGTTG 


AGCAGGCTCA 


8280 


AGATTTGCTC 


CAACAACACG 


GTGTTTGTGT 


TGAAGGAATC 


TTTACCCACT 


TTGCTACTGC 


8340 


TGATGAGGAA 


TCAGATGACT 


ATTTTAATGC 


CCAGTTAGAA 


CGGTTTAAAA 


CTATTTTAGC 


8400 


TAGTATGAAG 


GAAGTTCCAG 


AGCTGGTTCA 


TGCTAGCAAT 


TCTGCAACGA 


CTCTTTGGCA 


8460 


TGTAGAGACT 


ATTTTCAATG 


CGGTTCGTAT 


GGGAGATGCC 


ATGTATGGCC 


TCAATCCAAG 


8520 


TGGAGCGGTC 


TTGGATTTGC 


CTTATGATTT 


GATACCGGCC 


TTGACCTTGG 


AGTCTGCTCT 


8580 


GGTTCATGTC 


AAGACAGTTC 


CAGCTGGAGC 


TTGCATGGGC 


TATGGAGCAA 


CTTATCAAGC 


8640 


GGATAGCGAG 


CAAGTCATCG 


CGACCGTGCC 


AATCGGGTAT 


GCAGATGGAT 


GGACAAGAGA 


8700 


CATGCAAAAT 


TTCTCTGTCT 


TGGTAGATGG 


CCAAGCTTGC 


CCAATTGTCG 


GCAGGGTTTC 


8760 


GATGGACCAA 


ATCACTATTC 


GATTGCCTAA 


GCTTTATCCG 


CTAGGAACCA 


AGGTAACCTT 


8820 


GATTGGCTCC 


AATGGGGATA 


AGGAAATCAC 


TGCAACTCAG 


GTAGCGACCT 


ACCGCGTAAC 


8880 


CATTAACTAT 


GAGGTGGTTT 


GCCTCCTCAG 


CGACCGTATT 


CCGAGAGAAT 


ATTATTAGAA 


8940 


AAGAAAGGAG 


TGGAGCATGA 


ATCTACATCA 


ACCCTTGCAT 


GTCTTGCCTG 


GT<-TGGGACC 


SCCC 


AAAGTCAGCA 


GAAAAATACG 


CCAAACTAGG 


AATTGAAAAC 


TTGCAAGATC 


TCTTGCTCTA 


9060 


CTTTCCTTTC 


CGTTATGAAG 


ACTTCAAAAC 


CAAGCAGGTG 


CTGGAGCTGG 


AAGACGGTGA 


9120 


GAAGGCAGTT 


CTTTCTGGTC 


AGGTAGTGAC 


TCCTGCTAGT 


GTCCAGTATT 


ATGGTTTCAA 


9180 


GCGCAATCGC 


CTGCGTTTTA 


GTCTCAAGCA 


GGGAGAGGTC 


GTTTTTGCGG 


TGAATTTCTT 


9240 


TAACCAGCCC 


TATCTGGCTG 


ATAAAATAGA 


GTTGGGAGCA 


ACCCTTGCTG 


TCTTTGGAAA 


9300 


ATGGGACCGC 


GCTAAGGCTA 


GTCTGACTGG 


GATGAAGGTT 


CTGGCTCAGG 


TAGAAGATGA 


9360 


CCTCCAGCCT 


GTCTATCGTC 


TGGCTCAGGG 


AATCAGTCAG 


GCCAGTCTGG 


TCAAGGTCAT 


9420 


CAAGACGGCT 


TTTGATCAGG 


GACTGGACCT 


CTTGATAGAA 


GAAAATCTGC 


CCCAGTCTTT 


9480 


ACTAGACAAA 


TACAAACTCA 


TGTCCCGTTG 


TCAGGCAGTC 


CGTGCTATGC 


ATTTTCCAAA 


9540 


GTATTTGGCA 


GAATACAAGC 


AGGCTCTTCG 


CCGTATAAAG 


TTTGAGGAAC 


TCTTTTATTT 


9600 


CCAAATGCAG 


CTGCAGATGC 


TCAAGTCTGA 


AAATAGAGTT 


CAGGGAAGTG 


GTCTGGTTCT 


9660 


GAATTGGTCT 


CAGGAAAAAG 


TGACAGCAGT 


TAAAGTAAGT 


CTTCCTTTTG 


CCCTGACCCA 


9720 


AGCTCAGGAA 


AAGAGTTTGC 


AGGAAATTTT 


AACTGATATG 


AAGTCCGACC 


ACCACATGAA 


9780 



wo 98/18931 PCT/US97/19588 



962 



TCGTCTCCTA 


CAAGGGGATG 


TGGGGAGTGG 


AAAAACGGTA 


GTCGCTGGCT 


TGGCCATGTT 


9840 


TGCGGCAGTG 


ACAGCAGGTT 


ATCAGGCTGC 


CCTAATGGTA 


CCAACAGAAA 


TCCTCGCAGA 


9900 


GCAACACTTT 


GAGAGTTTAC 


AGAACCTTTT 


TCCCAATTTG 


AAACTGGCTC 


TCTTGACAGG 


9960 


TTCCTTGAAA 


GCTGCAGAAA 


AGAGAGAAGT 


CTTGGAGACC 


ATTGCCAAGG 


GTGAGGCTGA 


10020 


TTTGATTATA 


GGAACTCACG 


CTCTGATACA 


AGATGGGGTG 


GAGTATGCTC 


GTCTTGGTTT 


10080 


GATTATTATG 


GATGAGCAGC 


ACCGTTTTGG 


TGTAGGGCAA 


AGGCGTATTT 


TACGGGAAAA 


10140 


AGGTGACAAT 


CCAGATGTCC 


TCATGATGAC 


GGCGACTCCC 


ATTCCACGGA 


CGCTTGCCAT 


10200 


CACAGCCTTT 


GGAGATATGG 


ATGTTTCCAT 


TATCGACCAG 


ATGCCAGCAG 


GTCGGAAGCC 


10260 


TATTGTGACG 


CGCTGGATCA 


AACATGAGCA 


ACTACCTCAG 


GTCTTGACTT 


GGTTAGAGGG 


10320 


GGAAATTCAA 


AAAGGTTCCC 


AAGTCTATGT 


CATCTCTCCT 


TTGATTGAAG 


AATCAGAAGC 


10380 


TCTAGATTTG 


AAAAATGCCA 


TTGCCTTATC 


AGAGGAGTTG 


ACGACTCATT 


TTGCAGGCAA 


10440 


GGCAGAGGTG 


GCTCTTCTAC 


ATGGTAGGAT 


GAAGAGTGAC 


GAAAAAGACC 


AGATCATGCA 


10500 


GGATTTCAAG 


GAGAGAAAGA 


CGGATATTCT 


GGTTTCGACG 


ACGGTTATTG 


AGGTTGGGGT 


10560 


CAACGTTCCC 


AATGCGACTG 


TCATGATTAT 


CATGGATGCC 


GATCGCTTCG 


GTCTCAGTCA 


10620 


ACTTCACCAG 


CTTAGAGGTC 


GTGTCGGTCG 


GGGGGACAAG 


CAGTCCTACG 


CTGTTCTCGT 


10680 


TGCTAATCCC 


AAGACGGATT 


CTGGGAAAGA 


CCGCATGCGT 


ATCATGACAG 


AAACGACCAA 


10740 


TGGATTTGtC 


CTTGCGGAGG 


AAGATTTGAA 


AATGCGTGGT 


TCTGGTGAGA 


TTTTTGGAAC 


10800 


CAGACAGTCA 


GGACTTCCAG 


AGTTCCAAGT 


GGCTGATATT 


ATCGAAGATT 


TTCCGATTTT 


10860 


AGAAGAAGCA 


AGAAAGGTTG 


CTAGCTACAT 


TAGTTCTATA 


GAAGCTTGGC 


AAGAAGATCC 


10920 


AGAGTGGCGC 


ATGATTGCCC 


TTCATCTGGA 


AAAGAAAGAA 


CATCTGGATT 


AAGCTTTCTC 


10980 


TAAGGAAAAC 


TTATACTCAA 


TGAAAATCAA 


AGAGCAAACT 


AGGAAGCTAA 


CCGCAGGTTG 


11040 


CTCAAAACAC 


TGTTTTGAGG 


TTGTGGATGA 


AACTGACGAA 


GTCAGCTCAA 


AACACCGTTT 


11100 


TGAGGTGGCA 


GATAGAACTG 


ACGAAGTCAG 


TAACATATAT 


ATACGGTAAG 


GCGACGCTGA 


11160 


CGTGGTTTGA 


AGAGATTTTC 


GAAGAGTATT 


AAGCTAGTTT 


TTAGGTTTGG 


CTCTTATACT 


11220 


AGAGTCATCA 


AAAAGAAACG 


AGGACTCTCA 


TATGACAGTA 


ACGATTAAAG 


TAAATTACCA 


11280 


AACCACTTTC 


CAAAAGAAGG 


AAGCAAAAAA 


CTAGTATAAA 


CAGAAGAGAG 


AGCGAAATGC 


11340 


TCTTTTTTCG 


TTTCTAAAAC 


TACTTTCAGC 


CCATCATCCT 


AAAAGTAAAG 


AATCTAAATT 


11400 


CACTTTCTAT 


TTACCCTTCT 


TTCTTGCATT 


GATTACATAG 


ATATGCTACA 


GTTGTGGTAA 


11460 


CGATTACAAA 


ATAAAAGGAG 


CATGCTATGA AAAATCCAGC 


TTTGCTAGAA 


GAAATTAAGA 


11520 


CCTATAGAGG 


AAGGGATGAG 


GTTCCGGAAG 


ACTTTGATGA 


TTTCTGGGAT 


GGGGAAGTGA 


11580 
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AAAATGTTTC CACGCTTCCA TCCTACCACT TGGAGGAAAG AGATTTCCAC ATTCCTCAAG 11640 

TCAAGTGCTA TGAGTTAACA TTTGAAGGAA GCAAGGAAGG AAAGGTCTAT GCACGCATTG 11700 

TTCTTCCAAA GAGTGAGGAG AAGGTCCCAT TAATCTTCCA TTTTCATGGT TATATGGGAC 11760 

GTGGCTGGGA CTGGGCCGAC ATGCTGGGCT TCACCGTAGC TGGTTACGGT GTTGTTTCCA 11820 

TGGATGTGCG GGGCCAGTCA GGTTACTCAC AAGACGGCTT GCGTTCTCCT TTAGGAAATA 11880 

CCGTGAAGGG GCATATTATC CGTGGTGCTG TGGAAGGTCG GGACCACCTC TTTTATAAGG 11940 

ATGTTTATCT GGATATTTAC CAGTTGGTCG AAATTGTTGC TAGTCTGTCT CAGGTTGATG 12000 

AGAAGCGTCT TTCTAGCTAT GGTGCCTCAC AAGGAGGGGC TCTAGCTCTA GTTGCAGCAG 12060 

CGCTCAATCC TCGAATTCAG AAAACAGTTG CCATTTATCC CTTCTTGTCA GACTTCAGAC 12120 

GGGTGATTGA GATTGGTAAT ACTAGCGAGG CTTACGACGA ACTTTTCCGT TATTTCAAGT 12180 

TTCACGACCC CTTCCATGAA ACAGAGGAGG AAATCATGGC GACCCTTGCC TATATCGATG 12240 

TCAAAAATCT TGCCCATCGT ATCCAAGGTG AGGTTAAGAT GATTACGGGC TTGGACGACG 12300 

ATGTTTGCTA TCCCATTACC CAGTTTGCGA TTTATAATCG TCTGACCTGC GATAAAACCT 12 3 60 

ATCGCATCAT GCCTGAGTAT GCTCACGAAG CCATGAATGT ATTTGTCAAT GACCAAGTCT 12420 

ACAACTGGCT CTGTGGAAGT GAGATTCCTT TTAAATATCT AAAATAAGGA GTCGACTCTA 124 80 

AGCACAAAAT CTTAAAAATT ACAAACACGC ATAGTATCAG GGGATTAAGA AAACTTTATA 1254 0 

CTATGCGTTT TATCATGGAA ATATAGTAAA ATGAAATAAG AACAGGACAA ATCGATCAGG 12 600 

ACAGTCAAAT CGATTTCTAA CAATGTPTTA GAAACAAATG TGTACTATTC TAGTGTCAAT 12 660 

CTATTATATT TATAGAATTT TTTGTTGCTA GATTTGTCAA ATTGCTTAAA ATAATTTTTT 12720 

TCAGAAAGCA AAAGCCGATA CCTATCGAGT AGGGTAGTTC TTGCTATCGT CAGGCTTGTC 12780 

TGTAGGTGTT AATACTTTTC AAAAATCTCT TCAAACCACG TCAGCTTCGC CTTGC 12835 

(2) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5020 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 

GGGGATATGA AGAACAAAAG AATATTTAAA GACTTCCAAG CTTCAAAAAT GAGTTTAAAC 60 

ATTTACACAA GCCCCTTGTT AGCCTTTGTT TTTGTCTTCA TAGGAGAGTT TGTGGCTTTT 120 
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ACTTTGTATG GTATTGGCTT GTTAGCTCTC ATCGGACTTG CTAGAAATTT TGGAGAGGCT 180 
GGTCAAAATC TTGCAAGCTA CTTGCAGACC TTGCATCAGA GCTTGACGGA TAAAACAAGT 240 
GACTTTCGTT TAATTTTAGG ATTACTGGCC TTTGGTTATT CTTAACACTG TGTTCAGATG 300 
GACAAGAAAA GTTGAGAAAA GACCTATTCG AACCTTGGGA TTTTATAGAG AGAATTTCCT 360 
CAGCAATCTT CTGAAAGGAT TTAGTCTAGG CCTGGCACTT TTTCTTCTGA CCTTGTTAGG 420 
TTTAGTGGTC TTAGGTCAAT ATCGTTTGGA ATCCATTCAC TTGAATCCTT ATTCTCTTGC 4 80 

CTTTGTCGTC TTTACTATCC CATTTTGGAT TTTACAGGGG ACAGCAGAAG AAGTGGTGGC 540 
CCGTGCTTGG CTACTTCCTC AATTGGCCTC AAGAACCAAT CTAAAACTAG CTATTCTTAT 600 
ATCTAGCCTG TTCTTTACCC TGCTTCATAT GGGCAATTCT GGTCTCACCC CTCTATCTCT 660 
AGTAAATCTC TTTTTATTCG GAGTTGCCAT GGCTCTTTAC CTTCTCAAAA CTGATACAGT 720 
TTGGGGTGTT GCAGGTATTC ATGGTGCTTG GAATTTTGCT CAGGGTAATC TCTTTGGGAT 780 
TTTAGTTAGT GGTCAACCGT CAGaACGTCT CTGATGACCT TTTTACCACA AGGCAATC7U\ 84 0 

GATTGGCTAT CAGGTGGTTC TTTTGGCATA GAAGGTTCCA TTATGACAAG TCTGGTATTA 900 
CTACTGCTGA TTGTCTATCT TGCTAATAAA TTAAAGAAAG AAAATGAAAG GATGTGACTT 960 

CGGTCCGTCC TTTTCTTCGT GAAAATACTA TAAGTATGCT AAAATAGGAA TAGCACATGG 1020 

AGAGAGGATT CTTATGATCA ATCACATTAC AGATAATCAA TTTAAACTAG TATCAAAATA 1080 

TCAACCATCA GGAGATCAAC CCCAAtiCTAT CGAGCAGTTG GTGGATAACA TTGAGGGGGG 1140 

AGAAAAAGCT CAGATTCTGA TGGGGGCGAC TGGAACAGGG AAGACCTATA CTATGAGTCA 1200 

GGTCATTTCT AAAGTCAATA AACCAACTCT GGTTATTGCC CACAATAAAA CTCTGGCTGG 12 60 

TCAGCTCTAT GGGGAGTTTA AGGAATTTTT CCCTGAAAAT GCAGTTGAGT ATTTCGTATC 1320 

CTACTATGAT TATTACCAGC CAGAGGCCTA TGTCCCOTCT AGCGATACCT ATATTGAGAA 13 80 

GGATAGTTCT GTCAATGACG AGATTGACAA GCTTCGCCAC TCAGCTACCT CAGCCCTTTT 1440 

GGAGCGTAAT GATGTTATTG TCGTGGCCTC AGTCTCTTGT ATCTATGGTT TGGGTTCGCC 1500 

CAAGGAATAC GCTGATAGTG TCGTTAGTCT CCGTCCTGGT CTAGAGATTT CTCGTGATAA 1560 

ACTCTTGAAT GACTTGGTCG ATATTCAGTT TGAACGTAAT GATATTGATT TCCAACGCGG 1620 

AAGATTTCGC GTTCGTGGGG ATGTGGTAGA GATTTTCCCA GCTTCCCGAG ATGAACATGC 1680 

CTTTCGAGTA GAATTTTTTG GAGACGAAAT TGACCGTATT CGTGAAGTTG AGGCTCTGAC 1740 

AGGTCAGGTG TTGGGAGAAG TGGATCATTT AGCGATTTTC CCAGCGACAC ACTTTGTGAC IBOO 

CAATGACGAC CACATGGAAG TTGCCATTGC AAAGATTCAG GCCGAGTTGG AAGAACAATT 1860 

AGCTGTCTTT GAAAAGGAAG GTAAACTGCT TGAAGCCCAG CGTTTGAAAC AGCGGACAGA 1920 
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GTATGATATC GAAATGTTGC GTGAGATGGG CTATACCAAT GGGGTTGAAA ATTATTCTCG 1980 

CCACATGGAT GGACGGAGCG AAGGAGAGCC TCCTTATACG CTTCTCGACT TCTTCCCAGA 2040 

TGATTTCTTG ATTATGATTG ACGAGAGTCA TATGACCATA GGGCAAATCA AGGGCATGTA 2100 

CAATGGAGAC CGTTCGCGTA AAGAAATGCT GGTTAATTAT GGTTTCCGTT TGCCGTCTGC 2160 

TTTGGACAAT CGTCCTCTCC GTCGGGAGGA GTTTGAGAGT CACGTTCATC AGATTGTTTA 2220 

CGTTTCAGCG ACACCTGGTG ACTATGAAAA TGAACAGACC GAGACAGTGA TTGAGCAAAT 2280 

CATTCGTCCA ACGGGACTCT TGGATCCAGA GGTGGAAGTC CGTCCGACTA TGGGACAGAT 2340 

TGATGACCTC TTGGGTGAAA TCAATGCCCG CGTTGAAAAA AATGAGCGTA CCTTTATCAC 2400 

AACTTTGACC AAGAAAATGG CAGAGGATTT GACCGACTAC TTCAAGGAAA TGGGTATCAA 2460 

GGTCAAGTAC ATGCACTCGG ATATCAAGAC CTTGGAACGG ACGGAGATTA TCCGTGACCT 2520 

GCGCTTGGGT GTCTTTGATG TCTTGGTCGG AATTAACCTG CTCCGTGAAG GAATTGACGT 2580 

TCCTGAAGTG AGCCTCGTAG CTATTCTCGA TGCTGACAAG GAAGGTTTCC TTCGCAACGA 2 640 

ACGTGGACTC ATCCAGACCA TTGGACGTGC TGCACGTAAT AGCGAAGGTC ATGTTATCAT 270O 

GTATGCGGAC ACGGTTACCC AGTCTATGCA ACGTGCTATC GATGAAACTG CCCGCCGTCG 2760 

CAAAATCCAG ATGGCCTATA ATGAAGAACA TGGTATCGTT CCACAAACCA TCAAGAAAGA 2820 

AATCCGTGAC TTGATTGCTG TGACCAAGGC AGTTGCTAAG GAAGAAGACA AGGAAGTCGA 2S90 

TATCAATAGC CTCAACAAAC AAGAGCGCAA AGAACTAGTC AAAAAGCTTG AGAAACAAAT 2940 

GCAAGAAGCA GTTGAAGTGC TTGACTTTGA ACTAGCAGCT CAGATTCGTG ATATGATGCT 3000 

GGAAGTCAAG GCCTTGGATT AGGGGAATAG TATGATTTAT TTAAGAAAGT TAAAGAAAGA 3060 

AGATTTGATG TCTTTATGGG AAATGGCTTA TTCACAGCTT AATCCAGTTT GGAAACAGTA 3120 

TGATGCTCCC TATTATGATG ATTATCAGTA TTTTTCAAAT TTTAAAGAAT TCGAACTACA 3180 

AAAATCAGAA TCCATTTTAA GCAACTCAAA TCGCCTTGGT ATTTTTGTTG ATGATAAACT 32 40 

AGTTGGGACT GTTTCGCGTT ATTGG6TATG TAAAGAAAGA AGATGGATGG AATTGGGAAT 3300 

TGGTATTTAT GATAAAAAAT TCTGGAACAC TGGTATTGGG AAAGTTGCTA TGTTGCAGTG 3360 

GATAGATAGG ACGTTTCAGG ATTACTTGGA GTTGGAGCAT CTGGGTTTGA CAACTTGGTC 3420 

AGGAAATATT GGTATGATGA AACTTGCTGA AAAATTAAGA ATGAAAAAAG AAGCTCATAT 3480 

TCCAAAAGTT CGTTATTATC AAGGTAAATA TTTTGACAGT ATTAAATATG GTATTTTGAG 3540 

AGAAGACTGG GAGAAAATAA ATGACGGTTA TTATCAAATC AATGGAAACT CCTGAAGAGA 3600 

TAGAAGGTAA ATCCTTCGTT CACTGGCAAA CGTGGAGAGA GGCTTATGAT GACCTTTTGC 3660 
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CTGCGGAATT TCAGGAGACA ATGACATTAG AAAGATGTCG ACTCTTTAGT CAAAAGTATC 3720 

CAGAAAATAC ATTGATTGCG ATGGATGGTG TGAAGATAGT TGGTTTTATA AGTTATGGCA 3780 

ACTGTCGTGA TGAGACTATT CAAGCTGGTG AAATTATTGC TTTATATGTT TTAAAAGACT 3840 

ATTATGGAAA AGGAATCGCA CAAAAGTTAG TGAAAGCAGC TTTGACTGAT CTTAATCATT 3900 

TTTCTGAAAT TTTCTTATGG GTATTGAAAG ATAACAAGCG CGCCATTGCT TTCTATCAAA 3960 

AAATGGGTTT TACTTTTGAT GGACAAGAAA AAATACTTGA ACTTGGAAAG CCTATAAAGG 4020 

AAAAACGGAT GGTATTCTAT TCTAAATAAT TCTCAAAAGT AAAAGCTAAT ATGGTACCAA 4080 

GTCTGAAAAT TTAATAAATT AGAAAGCGAG TAAATTTATG TCCCGTTCCC AATTAACAAT 4140 

TTTAACAAAT ATCTGTCTGA TTGAAGACCT CGAAACTCAG CGCGTGGTGA TGCAGTATCG 4200 

CGCCCCTGAA AACAATCGCT GGTCTGGTTA TGCCTTTCCT GGAGGTCATG TAGAAAATGA 4260 

TGAGGCTTTT GCGGAGTCTG TCATTCGTGA AATCTACGAA GAAACAGGGT TGACTATCCA 4320 

AAATCCTCAA CTTGTCGGCA TTAAAAATTG GCCACTAGAT ACAGGTGGGC GCTATATTGT 4380 

CATTTGTTAT AAGGCGACTG AGTTCTCTGG TACCCTTCAA TCTTCAGAAG AGGGAGAAGT 4440 

TTCTTGGGTG CAAAAAGACC AGATTCCAAA CTTAAATCTG GCCTATGATA TGCTACCATT 4500 

GATGGAAATG ATGGAAGCTC CCGACAAGTC AGAGTTTTTC TACCCTCGCC GTACAGAAGA 4560 

CGATTGGGAA AAGAAAATCT TCTAGTCTTT TACTAAATAA CCTAGCTGAT CCAAGGCCTC 4620 

CTCGATATAG xGGAGGTL"iT GTTGTGTCTC GGCTTCAACT AGGTGATAAT GAATACCATC 4 680 

TGTTAACTCA GAAATTGGCT TAAAGTCAGA ACGTTCAACT TGTTCTAGAA AATGTTGCAC 4740 

GTCGCGGCGA CAGGTCAGTT TTAGTAAGGT TTCAATCTCT CCATAAACAG GATGATCAAT 4 800 

CAAGATATTT TGAACGCGAC CACCATTATC TACGATAGCA AGTAATTCTC GTCCAATTTC 4860 

TTCAACTTCA TGCTTGACCT TAAATAATTT GTGATGATAA GTATTTGCAT TAGCATCTTT 4920 

ATAGATATAA CCACGATTGG TAGATAGAAT TGGAGATCCA TCAGCTCTTA AAATTGCAAT 4 980 

ATCTTGAACA ATAACTTGTC GAGTGACATG AAAGTGCTCA 5020 
(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4965 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 
AAAAAGTGGC AATCCATTGA TTGGCCACTT CATTTAGAGA ATTATCGTCT CGCCCTTGAA 60 
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GAAGAAGGTC 


flTf^ T a f^T a r* T 
u i u i 1 AL 1 


TvaAGTTACTG 


CTATCGCTAG 


AACTACTACT 


' TGAACTGCTG 


120 


GAGCTGGATG 




ACTCCCCACA 


ATACTAGACC 


AAGCATTCTG 


ATAATCCGCA 


180 




U AA 1 AtjCAAA 


GCGATAACTT 


GTCGCTGGCG 


CTCCTGACTT 


ATTAGCCCAA 


240 




^A(j 1 LuAAt-L. 


TGTGACCTCT 


ACTTCTTTTC 


CTTCAACAGA 


AACCTTCTCT 


300 


i 1 i 1 1 AL. 


CTGTTGATTT 


CAAGACTTCC 


GATTTCACTA 


CACTAGGATC 


TAAAGCAAAG 


360 




CCCAAATGCT 


TGGGGAAGCT 


TGCTGAATCG 


CATTTACCAG 


ATGAGCCATG 


420 




TATTAGAATA 


ACCTGCTCTA 


CGTGACAATG 


AATGATTATC 


ATCATGCCCA 


480 


ATCCAGCCAC 


CTAGGGTTAA 


TCTAGGTGTC 


GAAAGCATGA 


GCCACATATT 


TTCGTCTTGG 


540 


TTGGTTGTAC 


CAGTCTTCCC 


AATCCAATCT 


GCATTAGCCA 


GAGTAGGATT 


TAAAGAAGTC 


600 


AGGTTAGACT 


TGAAGGTTGT 


TGTCACACGA 


GAGGATAGAA 


CTTCTCGTAG 


CAATCCCTGC 


660 


ATAATCGTCG 


CAGTAGCTTT 


TGAATAGACT 


TGAACCGGTT 


TATCCTGATA 


CTCATACACC 


720 


ACTCTACCAT 


CTGCTGCTTC 


AATCTTTGAA 


ATCACATGCT 


TCTGATGATA 


AACTCCATTA 


780 


TTAGCTAAGG 


TCTGATAGCC 


ATTGGTATGC 


TGGGCAACTG 


TGACTTCAAT 


ACCACCACCC 


840 


ATTGGCAAGC 


TCTCAATACC 


GTACTCAGGA 


ATCTCGTAAC 


CCATCTTTTC 


CATATAACCC 


900 


TTGACATCAA 


CACCCTTTTC 


ACGGAGCATA 


CGATAGGTCC 


AGTAAGCAGG 


GATATTCCAT 


960 


GAATAGTTCA GAGCTTCTCC CAAGGTCATC ATTCCTGTTC 


CCTTGCTATT 


AGCATACATA 


1020 


ATCGGATTGC 


CATTAGCAAA 


GTTTGTTGGA 


TAGTTAGATA 


GAATCGTTTC 


ACTTCCCATC 


1080 


AAGCCCTGGT 


CAATAGCAAT 


ACCGTAGGCC 


AGCAAGGGCT 


TGGTAGTAGA 


AGCTGGCGAA 


1140 


CGTTTGGTAT 


CAAAGGCATG 


ATTATTTTGA TTTTCTTGAT 


AATTACGACC 


ACCTACAAAG 


1200 


CCTAGAATAG 


CACCTGTTTG 


GTTATCCATC 


AAGACATTCC 


CTACTTCTAC 


ACGACCTGTT 


1260 


CCATCGTCTA 


AAAGATAGCC 


ATAATCAGCA 


ACCGCACTTT 


GCATGGCAGA 


ATGAATTTTC 


1320 


TGATCTATGG 


TAGTAGTAAT 


CTTATAACCA 


CCATTTTCAA 


TTTCCTTGGC 


TGCCAAATCT 


1380 


CGATAAAACT 


TCTGAGTTGC 


CTCATTTTTC 


AACTCCTTAG 


CGGAGACATT 


GTCTCTCTGA 


1440 


GCTAGATAGT 


CATACATACG 


TTCTTGAGCT 


TCTGCCAAAG 


TTGTAAAGTA 


TAAATAGTCT 


1500 


CGTGAAATTC 


CTGTAACCGT 


GCCCGATGGT 


AAAAAGTCCT 


GTTTAAGGTC 


ATAATCCTTG 


1560 


TACTGAGAAT 


ACTCGTCTTT 


GCTTAATGCA 


CCTGTACGAT 


ACATACTGTA 


AAGAACTGCC 


1620 


TTAGCCCGTC 


TTAAGCCAAT 


TTCTAGGTCT 


TCATCACTCT 


TCAACTCCCC 


AGTATTTTCA 


1680 


TAAGGAGAGT 


AAGTAATGGG ACTCTGTGGA AGTCCTGCTA 


AAAATGCTGC 


TTGAGGAACA 


1740 


GTCAACTGAC 


TGGCATCTAC 


ACCGAAAATT 


CCCTCAGCTG 


CTTGCCGAGC 


CCCTGCAATA 


1800 
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TTCTGTCCCT TATTATTTCG GCCAAAGGGA GCCACATTGA GATAGGTCGT TAAAATCTCA 1860 

TCTTTATTCA TGGCGCGTTC CAAGGCAAGA GCATCCACAA TCTCTGCCGC CTTACGAGCC 1920 

AAGGTCGGCG CATCCCCAAC CACCTGCTGT TTAATTAGTT GCTGGGTCAA GGTTGAACCC 1980 

CCACTAGAGG AACCCAAACC TACAAATTTC CCCAAGGTCG CACGAATCAC CGCCTTGGGT 2040 

ACTACACCCT TATGTTCTTT AAAGTGTTCA TCTTCTGTCG CAATGATAGC CTTCTTCAGA 2100 

TTTTCCGAAA TTTGCTCAGA TGAGATAGAA GTGCGCAACA AATCACTCTC TATGGAAGCA 2160 

ATCACCGTCC " CGTCCGAATA GGTAATCTCT GAAATAGAAG AGATGTCCTT GACCTGATTC 2220 

ACCAATTCTT CTGTCTGAGG CACCCGAACC TTGTCAAATA AGGCCACTCC GTATCCCAAA 2280 

GCAATCCCAG CTCCCAACAT TCCTCCTAGA AAACCGAGTA CAAAGAGTAA GTTAAATAAG 234 0 

GCTTTTATAC TCAGTAAAAT AGCTGGGAAA ATGACTGACT TATCTAAGGT TTTAGATTTT 2400 

TTGGTACTTG AACCTTTCTT GCCAGGTCTA GCTGATTTTT TATTTTTTTG TTTTTGCTGG 2460 

AAAAATTCCA GCATTTTTCG TTTTAATTCA TTTAATTGAT TTTGCATGGA TTTCCTCACT 2520 

TTATCTATTA TACCACAAAA GGGAAATTTT CAATAAAATA GCCACTTTCT TCCCTATTCT 2580 

GCTAGGCTAT TGCCCAAGTT TGTGATACAA TAGGTAGAAA CAATAATTTT AAAAAGGAGA 2640 

AAAAACACAT GCACATTTTT GATGAGCTAA AAGAGCGTGG TTTGATATTT CAAACGACTG 2700 

ATGAAGAAGC TTTGCGTAAA GCCCTAGAAG AAGGTCAAGT TTCTTATTAT ACTGGCTACG 2760 

ATCCAACTGC 'I-tiACAGCCTT CACCTAGGCC ACCTTGTCGC AATCTTGACA AGTCGTCGCT 2820 

TGCAACTAGC AGGTCACAAA CCTTATGCGC TCGTTGGCGG TGCTACAGGT CTCATCGGAG 2880 

ATCCGTCCTT CAAAGATGCT GAACGTAGTC TCCAAACAAA AGACACAGTA GATGGCTGGG 2940 

TCAAGTCTAT CCAAGGACAA CTTTCTCGTT TTCTTGACTT TGAAAATGGC GAAAACAAGG 3000 

CTGTCATGGT CAACAACTAC GACTGGTTTG GCAGCATCAG CTTCATTGAC TTCCTCCGTG 3060 

ATATTGGAAA ATACTTCACG GTCAACTACA TGATGAGTAA GGAATCTGTT AAAAAACGGA 3120 

TCGAAACAGG AATTTCTTAC ACTGAGTTCG CTTACCAAAT CATGCAAGGG TATGACTTCT 3180 

TCGTCCTTAA CCAAGACCAT AATGTCACTC TTCAAATCGG TGGTTCTGAC CAGTGGGGAA 3240 

ATATGACAGC TGGTACCGAA TTGCTTCGTC GTAAGGCGGA CAAGACTGGT CACGTTATCA 3300 

CTGTTCCACT AATCACAGAT GCAACTGGTA AGAAATTTGG TAAATCAGAA GGAAATGCCG 3360 

TCTGGCTCAA TCCCGAAAAG ACTTCTCCAT ACGAAATGTA CCAATTCTGG ATGAACGTGA 3420 

TGGACGCTGA CGCTGTTCGC TTCTTGAAAA TCTTTACTTT CTTGTCACTT GATGAGATTG 3480 

AAGATATTCG TAAACAATTT GAAGCAGCGC CACACGAACG CTTGGCTCAA AAAGTCTTGG 3540 

CTCGTGAAGT TGTTACACTT GTTCACGGAG AAGAAGCCTA CAAAGAAGCA CTTAACATCA 3600 
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CTGAGCAACT CTTTGCAGGA AACATCAAAA ACCTTTCTGT CAAAGAGCTC AAACAAGGAC 3660 

TTCGTGGTGT GCCCAACTAC CAAGTACAGG CAGACGAAAA CAACAATATC GTGGAACTGC 3720 

TCGTCTCATC TGGTATAGTT AACTCAAAAC GCCAAGCCCG TGAAGACGTC CAAAACGGAG 3780 

CCATCTACGT AAACGGCGAC CGCATCCAAG AGCTTGACTA TGTCTTGAGT GACGCTGATA 3840 

AGTTAGAGAA TGAACTGACT GTTATCCGTC GTGGGAAGAA AAAATACTTT GTATTGACTT 3900 

ACTAAACTAT TCAACATTTA TCTATAAACA AAGGAGTTAA CCTCGAGAAA GGTAACTCCT 3960 

TTTTGCTGTT AATAACTCTC ATCTATCTAT TTTTAATAGA CAGGCTACGC AGGACAATGC 4020 

GCAAGGTTGT TAGATTATGT AAGATAGAGA GATTTGAAGG ACTGAACCAA TTAAATAAGC 4080 

CAAAGCCAAT CAAACTACTA TTTACGACAA CGGTATCCTG AATATTTTTC TTGATGAGTG 4140 

TTTGCAAAGA TGATGATAAC GAATCCAACT CTTGGAAGAA ATCCAAACGA TTATCTAACA 42 00 

ATAAGATATC ACTCATCTGC TTAGAAATAT CTGCACTCTC ATTCATCACC ACACCGATAT 42 60 

CTGATAGAGT TAAAGCCGCT GAGTCATTCA ATCCATCTCC AACCATCAAA ATAGTGTGAC 4320 

CTGCTTTCTG CAGTTTCTCT ACTAACTCAA ATTTCCCATC AGGTTTCAAG TCTGTATAGA 4 3 80 

CCTGATCAAA GGGCAAATCT TTGACTAATT CCTCTGTCCT AATCAAGGTG TCTCCTGTTG 4440 

CCAGAATCAA TTTTTTCCCC TGTGCCTTAA GTTTATCCAA GGCTGTTTTT GCTTCTTTTC 4500 

TCAAAGGAGT ATGAATGCAG AACATTCCAA TCAATTCATT TTGATAAGCC /J^GAATAAGA 45 6u 

GATTGTAGTG ACTCTTGTAC TCTTCAATTA AAGCATTTTG TTCTGAACTG ATATGAATCT 4 620 

GCTCATCCTG CATCAAGACA TAATTCCCAA TAAGAACTGG TTGGCCATCT ATATGAGATT 4680 

TGATCCCCTT GCTTGCGATA TATTGGAGTT TCCCATGCAT TTCCTCATGT TCAATTCCCT 4740 

CTATCTCAGC TTGCTTGACG ATGGCATTAG CAATAGGATG ATAAATGTGT TCCTCAAGAC 4 800 

AGGCACTGAT TCTGAGAATA TCTTCCTCAC TATAGTCTCC AAAAGGTAAC ACCTTTTCAA 4860 

CTATAGGATA ACTAGTTGTG ATTGTTCCTG TCTTATCAAA CAAGAAAGTA TCAACTTCCA 4920 

GATATTTCTC CCTGTTGTGG CCTCTGGCTG TCATCTCTGT GCTGG 4965 
(2) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3232 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 
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CAGGGGCGTA 


TTACGTGACA 


ATTCAATGTA 


GGCTGTCGCT 


ACTTGCGCCA 


AAACAAGGAT 


60 


TCGATAATGT 


CGGATGATAC 


TAACGATTAA 


ACCGAGCAGA 


AAGGATCCCA 


AAATTCCCCA 


120 


AACTGCAATA 


TGCAAGGTCA 


GAAAGAATGC 


CTTTTGATAT 


AGTGGTAGAT 


ATTGTTCAAC 


180 


AATGGATCAA 


TCCAAAAATA 


GAACCTCCCA 


TCTAGAAATA 


ATACAGTTAT 


TGTAGCACTT 


240 


AAAATCTTCT 


TTGGATAATA 


TCTATTTTTT 


ATTGCCGTTA 


TAAGGATTTT 


TATCATAGAC 


300 


ATAAAATTTC 


TGAAATTTCC 


AAACAAAATA 


TTTTAAAAGT 


TTTGAAAAAG 


AGTTAAGATA 


360 


TTTTTGTAAT 


ACACAAAGTA 


AACGCTTACT 


TATTAAGGAG 


GACATTTTAT 


GTCATACAAA 


420 


ACAAGCAATG 


CAGAAGGTCA TGTAGATTTC 


ATCAATACCT 


ATGATTTGGA 


GCCAATGGCG 


480 


CAACAAGTTA 


TTCCTAAAGC 


AGCATTTGGC 


TATATCGCTA 


GTGGGGCGGG 


AGATACTTTC 


540 


ACTTCTTTCC 


AGTGATTTTA 


GCGTCAGGTT 


CTTTTTAGTT 


TTTAAAGATT 


ATCCGTGAAT 


600 


TTCTTGCTTA 


TTTATGATAA 


AATGGGAGTG 


TCGCAAAAAA 


TGACTCATCG 


TATTCAATTT 


660 


TGAGT7LAAAC 


TAGGAGGATC 


CCATGTCTAC 


AGAACATATG 


GAAGAACTAA 


ATGACCAGCA 


720 


GATCGTTCGC 


CGTGAAAAAA 


TGGCTGCGCT 


CCGCGAACAA 


GGAATCGATC 


CTTTCGGAAA 


780 


ACGTTTTGAA 


CGTACTGCAA 


ATTCACAAGA 


ATTAAAAGAT 


AAATATGCCA 


ACCTCGATAA 


840 


AGAACAATTA 


CACGATAAAA 


ACGAAACAGC 


TACTATCGCA 


GGACGCTTGA 


TAACCAAACG 


900 


TGGTAAAGGA 


AAAGTTGGTT 


TTGCCCACCT 


TCAAGACCGC 


GAAGGCCAGA 


TTCAGATCTA 


960 


CGTTCGTAAG 


GATGCrGTCG 


GTGAAGTkAi'iA 


CTACGAAATC 


T-rCAAAAAAG 


CAGACCTTGG 


1020 


TGACTTCCTT 


GGTGTCGAAG 


GTGAAGTGAT 


GCGTACGGAT 


ATGGGAGAAC 


TCTCTATCAA 


1080 


GGCAACCCAC 


ATCACACACT 


TGTCTAAGGC 


TCTTCGTCCT 


CTTCCTGAGA 


AATTCCATGG 


1140 


TTTGACAGAC 


GTTGAAACAA 


TTTACCGTAA 


ACGTTACCTT 


GACTTGATTT 


CTAATCGTGA 


1200 


AAGCTTTGAA 


CGCTTTGTCA 


CTCGTTCAAA 


AATCATCTCT 


GAAATCCGTC 


GTTACCTTGA 


1260 


CCAAAAAGGA 


TTCCTTGAAG 


TGGAAACACC 


TGTTCTTCAT 


AATGAAGCCG 


GTGGTGCTGC 


1320 


TGCCCGTCCA 


TTTATCACCC 


ACCACAATGC 


CCAAAACATT 


GACATGGTGC 


TTCGTATCGC 


1380 


GACTGAGCTT 


CACTTAAAAC 


GCCTTATCGT 


GGGTGGTATG 


GAACGTGTCT 


ATGAAATTGG 


1440 


CCGTATCTTC 


CGTAACGAAG 


GAATGGACGC 


TACTCATAAC 


CCTGAGTTCA 


CTTCTATCGA 


1500 


AGTTTACCAA 


GCTTATGCAG 


ACTTCCAAGA 


CATCATGGAC 


TTGACTGAAG 


GCATTATCCA 


1560 


ACACGCTGCT 


AAATCAGTCA 


AAGGTGATGG 


CCCAGTCAAC 


TACCAAGGTA 


CTGAAATCAA 


1620 


GATTAACGAA 


CCATTTAAGC 


GTGTTCATAT 


GGTGGATGCT 


ATCAGAGAAA 


TTACTGGTGT 


1680 


CGATTTCTGG 


CAAGACATGA 


CTTTGGAAGA 


AGCTAAAGCT 


ATCGCTGCTG 


AGAAGAAAGT 


1740 


TCCAGTTGAG 


AAACACTACA 


CTGAGGTTGG 


TCACATCATC 


AATGCCTTCT 


TTGAAGAGTT 


1800 
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1 OTTGAAGAA 


ACTTTAATCC 


AACCAACCTT 


TGTCTATGGA 


CATCCAGTAG 


CTGTATCTCC 


1860 


AC-TCG CT AAG 


AAAAATCCTG 


AAGACCAACG 


CTTTACTGAC 


CGTTTCGAGC 


TCTTTATCAT 


1920 


GACTAAGGAG 


TACGGTAATG 


CCTTTACTGA 


GTTGAACGAC 


CCAATCGACC 


AACTTAGCCG 


1980 


TTTTGAAGCC 


CAAGCTAAAG 


CCAAAGAACT 


TGGTGATGAT 


GAAGCGACAG 


GAATCGACTA 


2040 


TGACTACATT 


GAAGCTCTTG 


AATACGGTAT 


GCCACCAACA 


GGTGGTTTGG 


GAATCGGTAT 


2100 


CGACCGTCTC 


TGCATGCTCC 


TCACTGATAC 


AACAACTATC 


CGTGATGTAT 


TGCTCTTCCC 


2160 


AACAATGAAA 


TAAATTCTTA 


TCCTCTGGGT 


CTTATCAGAG 


GATTTTTTGA 


TTCAAAAAGA 


2220 


GACTGAATTT 


AAGGAGAAAA 


TGAAGTGTAG 


TATATTGAAA 


TTGAAATAGT 


ACACTTTGAT 


2280 


TTCTAAGACA 


TTGTTAGAAA 


TTGGTTTAAA 


TTCCCTAAGC 


AATTTGTGCA 


TGTTTTATTT 


2340 


CATTTTACGA 


TAGTACGCTG 


AAACTTTTCA 


AAAAGTACTA 


GAAATTGACT 


TGGATTCCCC 


2400 


AATTGATTTG 


TTCAGATTCA 


CTATAAATAA 


AAAATTAATA 


AGTGGGATAG 


GAAGTTAGCG 


2460 


TCAACTAGGA 


TAGTATCTTG 


CTTAAACAGT 


ATATATGGGA 


TTGATATAAG 


TCCATAGGTC 


2520 


CTATTAGAGG 


ATGTTCTGGT 


GTCTTATTCA 


CTTGTTTTTT 


ATAGTATTAG 


TAGATAGAAT 


2580 


CAGCAAATAA 


AAACCCAAAT 


CATTCATACC 


TCTCTCAACT 


AGATGTAACT 


TACAAAACCC 


2640 


CTGACCTCAT 


GAGCCACTTT 


CTTCCTCCTC 


ATGAGGTCAG 


TTTTACTTTC 


TGCTGTTCCA 


2700 


GTATCGTTTT 


TCCTCGCTAG 


ATTTCCTCAA 


AAGGGCAGAC 


TCCTCCCTTG 


GTGCGTCACA 


2760 


CGATTTTTTC 


ATCTCGACTG 


TTCTTTAATG 


CATCATTAAC 


GACGCTTTTC 


TTCTAGGTCG 


2820 


TTCATAAGGA 


ACAGGAAGAT 


TCAGGTTGAC 


TTTTCTAATC 


CTAGAATAAA 


GTGCTGAAAA 


2880 


CAATTCGGAA 


TAGGCATAGA 


GACTAGACAA 


TTTGAGGAGC 


TGCTTGCGTC 


CTGTTCGAAC 


2940 


ACATTTTCCC 


ACCACGTGAA 


GAAAAAGATG 


GCGGAAGCGT 


TTGATTGTTA 


AAGTTTGGAA 


3000 


GTCACCTCCA 


GCTAGATGTT 


TGAGAAAAAG 


ATAGAGATTG 


TAGGCGATAC 


AGCTCATCAT 


3060 


CATACGAACT 


TCGTTTTTGA 


TTAAGGTTGA 


ACTATCCGTT 


TTATCGCCAA 


AAAATCCCTC 


3120 


CTTCATCTCC 


TTGATGAAAT 


TCTCGGCTTG 


ACCACGTCCA 


CGATAAAGCT 


GAAACTGGTC 


3180 


TTGGCTTGTT 


CCACTCGTCA 


TATTTGTAAC 


GAGAGAAATA 


ACATCGTAGA 


AC 


3232 



(2) INFORMATION FOR SEQ ID NO: 145: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10711 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 




■ . VP 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 5: 
CCGGAGAAAA TGATGAAAAG TTCAAAACTA TTTGCCCTTG CGGGCGTGAC ATTATTGGCG 
GCGACTACTT TAGCTGCATG CTCTGGATCA GGTTCAAGCA CTAAAGGTGA GAAGACATTC 
TCATACATTT ATGAGACAGA CCCTGATAAC CTC/^CTATT TGACAACTGC TAAGGCTGCG 
ACACAAATAT TACCAGTAAC GTGGTTGATG GTTTGCTAGA AAATGATCGC TACGGGAACT 
TTGTGCCGTC TATGGCTGAG GATTGGTCTG TATCCAAGGA TGGATTGACT TACACTTATA 
CTATCCGTAA GGATGCAAAA TGGTATACTT CTGAAGGTGA AGAATACGGG GCAGTCAAAG 
CTCAAGACTT TGTAACAGGA TTAAAATATG CTGCTGATAA AAAATCAGAT GCTCTTTACC 
TTGTTCAAGA ATCAATCAAA GGGTTGGATG CCTATGTAAA AGGGGAAATC AAAGATTTCT 
CACAAGTAGG AATTAAGGCT CTGGATGAAC AGACAGTTCA GTACACTTTG AACAAACCAG 
AAAGCTTCTG GAATTCTAAG ACAACCATGG GTGTGCTTGC GCCAGTTAAT GAAGAGTTTT 
TGAATTCAAA AGGAGATGAT TTTGCCAAAG CTACGGATCC AAGTAGTCTC TTGTATAACG 
GTCCTTATTT GTTGAAATCC ATTGTGACCA AATCCTCTGT TGAATTTGCG AAAAATCCGA 
ACTACTGGGA TAAGGACAAT GTGCATGTTG ACAAAGTTAA ATTGTCATTC TGGGATGGTC 
AAGATACCAG CAAACCTGCA GAAAACTTTA AAGATGGTAG CCTTACAGCA GCTCGTCTCT 

atccaacaag tgcaagtttc gcagaacttg agaagagtat gaaggacaat attgtctata 
ctcaacaaga ctctattacg tatctagttg gtacaaatat tgaccgtcag tcctataaat 
acacatctaa gaccagcgac gaacaaaagg catcgactaa aaaggctctc ttaaacaagg 

ATTTCCGTCA GGCTATTGCC TTTGGATTTG ACCGTACAGC CTATGCCTCT CAGTTGAATG 
GACAAACTGG AGCAAGTAAA ATCTTGCGTA ATCTCTTTGT GCCACCAACA TTTGTTCAAG 
CAGATGGTAA AAACTTTGGC GATATGGTCA AAGAGAAATT GGTCACTTAT GGGGATGAAT 
GGAAGGATGT TAATCTTGCA GATTCTCAGG ATGGTCTTTA CAATCCAGAA AAAGCCAAGG 
CTGAATTTGC TAAAGCTAAA TCAGCCTTAC AAGCAGAAGG AGTCCAATTC CCAATTCATT 
TGGATATGCC AGTTGACCAA ACAGCAACTA CAAAAGTTCA GCGCGTCCAA TCTATGAAAC 
AATCCTTGGA AGCAACTTTA GGAGCTGATA ATGTCATTAT TGATATTCAA CAACTACAAA 
AAGACGAAGT AAACAATATT ACATATTTTG CTGAAAATGC TGCTGGCGAA GACTGGGATT 
TATCAGATAA TGTCGGTTGG GGTCCAGACT TTGCCGATCC ATCAACCTAC CTTGATATTA 
TCAAACCTTC TGTAGGAGAA AGTACTAAAA CATATTTAGG GTTTGACTCA GGGGAAGATA 
ATGTAGCTGC TAAAAAAGTA GGTCTATATG ACTACGAAAA ATTGGTTACT GAGGCTGGTG 
ATGAGACTAC AGATGTTGCT AAACGCTATG ATAAATACGC TGCAGCCCAA GCTTGGTTGA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
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CAGATAGTGC TTTGATTATT CCAACTACAT CTCGTACAGG GCGTCCAATC TTGTCTAAGA 1800 

TGGTACCATT TACAATACCA TTTGCATTGT CAGGAAATAA AGGTACAAGT GAACCAGTCT 1860 

TGTATAAATA CTTGGAACTT CAAGACAAGG CAGTCACTGT AGATGAATAC CAAAAAGCTC 1920 

AGGAAAAATG GATGAAAGAA AAAGAAGAGT CTAATAAAAA GGCTCAAGAA GATCTCGCAA 1980 

AACATGTGAA ATAACTGTTG CAAAATATAA GAAAGGATTT AGTATTTCCC TTGAATGCTG 2040 

AATCCTTTTT TACATTTGTA AAGAAAGATT CTAAAATGTA CGGACCCCCA AAAGTTGGAG 2100 

CCTCTTTTTG TCAGAATAGA GAAAATTTTT GTTAATTTTA CTTGTTTCCT ATTGCTTTCT 2160 

CAGCTATTAT TTGTTATATT AAAAGTATAA TTATTTTTTA TTTATCAGAG TTAAGCATTG 2220 

CACTTTCAGA GGAAGGAGTA TTTTTTAAAA AGAAAATGTA AACGTTTGCT CAAAAATGAA 22 80 

AGGATTTAGA AGTTTATGAA TAAAGGATTA TTTGAAAAAC GTTGTAAATA TAGTATTCGG 2340 

AAATTTTCAT TAGGTGTTGC TTCTGTTATG ATTGGAGCTG CATTCTTTGG GACAAGTCCG 2400 

GTTCTTGCAG ATAGCGTGCA GTCTGGTTCC ACGGCGAACT TACCAGCTGA TTTAGCTACT 24 60 

GCTCTTGCAA CAGCAAAAGA GAATGATGGG CGTGATTTTG AAGCGCCTAA GGTGGGAGAA 2520 

GACCAAGGTT CTCCAGAAGT TACAGATGGA CCTAAGACAG AAGAAGAACT ATTAGCACTT 2580 

GAAAAAGAAA AACCGGCTGA AGAAAAACCA AAAGAGGATA AACCTGCAGC TGCTAAACCT 2640 

GAAACACCTA AGACGGTAAC CCCTGAATGG CAAACGGTAG CGAATAAAGA RCAACAGGGA 270C 

ACAGTCACTA TCCGAGAAGA AAAAGGTGTC CGCTACAACC AACTATCCTC AACTGCTCAA 27 60 

AATGATAACG CAGGCAAACC AGCCCTGTTT GAAAAGAAGG GCTTGACCGT TGATGCCAAT 2820 

GGAAATGCAA CTGTTGATTT AACCTTCAAA GATGATTCTG AAAAGGGCAA ATCACGCTTT 2880 

GGTGTCTTTT TGAAATTTAA AGATACCAAG AATAATGTTT TTGTCGGTTA TGACAAGGAT 2940 

GGCTGGTTCT GGGAGTATAA ATCTCCAACA ACTAGCACTT GGTATAGAGG TAGTCGTGTT 3000 

GCTGCTCCTG AAACAGGATC AACAAACCGT CTCTCTATCA CTCTCAAGTC AGACGGTCAG 3060 

CTAAATGCCA GCAATAATGA TGTCAATCTC TTTGACACAG TGACTCTACC AGCTGCGGTC 3120 

AATGACCATC TTAAAAATGA GAAGAAGATT CTTCTCAAGG CGGGCTCTTA TGACGATGAG 3180 

CGAACAGTTG TTAGCGTTAA AACGGATAAC CAAGAGGGGG TAAAAACAGA GGATACCCCT 3240 

GCTGAAAAAG AAACAGGTCC TGAAGTTGAT GATAGCAAGG TGACTTATGA CACGATTCAG 3300 

TCTAAGGTCC TCAAAGCAGT GATTGACCAA GCCTTCCCTC GTGTCAAGGA ATACAGCTTG 3360 

AACGGGCATA CTTTGCCAGG ACAGGTGCAA CAGTTCAACC AAGTCTTTAT CAATAACCAC 3420 

CGAATCACCC CTGAAGTCAC TTATAAGAAA ATCAATGAGA CAACAGCAGA GTACTTGATG 3480 
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AAGCTTCGCG ATGATGCTCA CTTAATCAAT GCGGAAATGA CAGTACGCTT GCAAGTTGTA 3540 

GACAATCAAT TGCACTTTGA TGTGACTAAG ATTGTCAACC ACAATCAAGT CACTCCAGGT 3600 

CAAAAGATTG ATGACGAAAG CAAACTACTT TCTTCTATTA GTTTCCTCGG CAATGCTTTA 3660 

GTCTCTGTTT CTAGTAATCA AACTGGTGCT AAGTTTGATG GGGCAACCAT GTCAAACAAT 3720 

ACGCATGTCA GCGGAGATGA TCATATCGAT GTAACCAATC CAATGAAGGA TTTGGCTAAG 3 780 

GGTTACATGT ATGGATTTGT TTCTACAGAT AAGCTTGCTG CTGGTGTTTG GAGTAACTCT 3840 

CAAAACAGCT ATGGTGGTGG TTCGAATGAC TGGACTCGTT TGACAGCTTA TAAAGAAACA 3 900 

GTCGGAAATG CCAACTATGT AGGAATCCAC AGCTCTGAAT GGCAATGGGA AAAAGCTTAT 3960 

AAGGGCATTG TTTTCCCAGA ATACACGAAG GAACTTCCAA GTGCTAAGGT TGTTATCACT 4020 

GAAGATGCCA ATGCAGACAA GAACGTTGAT TGGCAAGATG GTGCCATTGC TTATCGTAGC 4 080 

ATTATGAACA ATCCTCAAGG TTGGGAAAAA GTTAAGGATA TGACAGCTTA CCGTATCGCG 4140 

ATGAACTTTG GTTCTCAAGC ACAAAACCCA TTCCTTATGA CCTTGGATGG TATCAAGAAA 4200 

ATCAATCTCC ATACAGATGG TCTTGGGCAA GGTGTTCTCC TTAAAGGATA TGGTAGCGAA 42 60 

GGCCATGACT CTGGTCACTT GAACTATGCT GATATTGGTA AGCGTATCGG TGGTGTCGAA 4320 

GACTTCAAGA CCCTAATTGA GAAGGCTAAG AAATATGGAG CTCATCTAGG TATCCACGTT 4380 

AACGCTTCAG AAACTTATCC TGAGTCTAAA TACTTCAATG AAAAAATTCT CCGTAAGAAT 4440 

CCAGAtGGaA GCTATAGCTA TGGTTGGAAC TGGCTAGATC AAGGTATCAA CATTGATGCT 4500 

GCCTATGACC TAGCTCATGG TCGTTTGGCA CGTTGGGAAG ATTTGAAGAA AAAACTTGCT 4560 

GACGGTCTCG ACTTTATCTA TGTGGACGTT TGGGGTAATG GTCAATCAGG TGATAACGGT 4620 

GCCTGGGCTA CCCACGTTCT TGCTAAAGAA ATTAACAAAC AAGGCTGGCG CTTTGCGATC 4680 

GAGTGGGGCC ATGGTGGTGA GTACGACTCT ACCTTCCATC ACTGGGCAGC TGACTTGACC 4740 

TACGGTGGCT ACACCAATAA AGGTATCAAC AGTGCCATCA CCCGCTTTAT CCGTAACCAC 4800 

CAAAAAGATG CTTGGGTAGG GGACTACAGA AGTTATGGTG GTGCAGCCAA CTATCCACTG 4860 

CTAGGTGGCT ACAGCATGAA AGACTTTGAA GGCTGGCAGG GAAGAAGTGA CTACAATGGC 4920 

TATGTAACCA ACTTATTTGC CCATGACGTC ATGACTAAGT ACTTCCAACA CTTCACTGTA 4980 

AGTAAATGGG AAAATGGTAC ACCGGTGACT ATGACCGATA ACGGTAGCAC CTATAAATGG 5040 

ACTCCAGAAA TGCGAGTGGA ATTGGTAGAT GCTGACAATA ATAAAGTAGT TGTAACTCGT 5100 

AAGTCAAATG ATGTCAATAG TCCACAATAT CGCGAACGTA CAGTAACGCT CAACGGACGT 5160 

GTCATCCAAG ATGGTTCAGC TTACTTGACT CCTTGGAACT GGGATGCAAA TGGTAAGAAA 5220 

CTTTCTACTG ATAAGGAAAA GATGTACTAC TTCAATACGC AGGCCGGTGC AACAACTTGG 52 80 
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ACCCTTCCAA GCGATTGGGC AAAGAGCAAG GTTTACCTTT ACAAGCTAAC TGACCAAGGT 5340 

AAGACAGAAG AGCAAGAACT AACTGTAAAA GATGGTAAAA TTACCCTAGA TCTTCTAGCA 54 00 

AATCAACCAT ACGTTCTCTA TCGTTCGAAA CAAACTAATC CTGAAATGTC ATGGAGTGAA 54 60 

GGCATGCACA TCTATGACCA AGGATTTAAT AGCGGTACCT TGAAACATTG GACCATTTCA 5520 

GGCGATGCTT CTAAGGCAGA AATTGTCAAG TCTCAAGGGG CAAACGATAT GCTTCGTATT 5580 

CAAGGAAACA AAGAAAAAGT TAGTCTCACT CAGAAATTAA CTGGCTTGAA ACCAAATACC 5640 

AAGTATGCCG TTTATGTTGG TGTAGATAAC CGTAGTAATG CCAAGGCAAG TATCACTGTG 5700 

AATACTGGTG AAAAAGAAGT GACTACTTAT ACCAATAAGT CTCTCGCGCT CAACTATGTT 5760 

AAGGCCTACG CCCACAATAC ACGTCGTGAC AATGCTACAG TTGACGATAC AAGTTACTTC 5820 

CAAAACATGT ACGCCTTCTT TACAACTGGA GCGGACGTCT CAAATGTTAC TCTGACATTG 5880 

AGTCGTGAAG CTGGTGATCA AGCAACTTAC TTTGATGAAA TTCGTACCTT TGAAAACAAT 5940 

TCAAGCATGT ACGGAGACAA GCATGATACA GGTAAAGGCA CCTTCAAGCA AGACTTTGAA 6000 

AATGTTGCTC AGGGTATCTT CCCATTTGTA GTGGGTGGTG TCGAAGGTGT TGAAGATAAC 6060 

CGCACTCACT TGTCTGAAAA ACACAATCCA TATACACAAC GTGGTTGGAA TGGTAAGAAA 6120 

GTCGATGATG TTATCGAAGG AAATTGGTCA CTCAAGACAA ATGGACTAGT GAGCCGTCGT 6180 

AACTTGGTTT ACCAAACCAT CCCACAAAAC TTCCGTTTTG AAGCAGGTAA GACCTACCGT fi7An 

GTAACCTTTG AATACGAAGC AGGATCAGAC AATACCTATG CTTTTGTAGT CGGTAAGGGA 6300 

GAATTCCAGT CAGGTCGTCG TGGTACTCAA GCAAGCAACT TGGAAATGCA TGAATTGCCA 6360 

AATACTTGGA CAGATTCTAA GAAAGCCAAG AAGGCAACCT TCCTTGTGAC AGGTGCAGAA 6420 

ACAGGCGATA CTTGGGTAGG TATCTACTCA ACTGGAAATG CAAGTAATAC TCGTGGTGAT 6480 

TCTGGTGGAA ATGCCAACTT CCGTGGTTAT AACGACTTCA TGATGGATAA TCTTCAAATC 6540 

GAAGAAATTA CCCTAACAGG TAAGATGTTG ACAGAAAATG CTCTGAAGAA CTACTTGCCA 6600 

ACGGTTGCCA TGACTAACTA CACCAAAGAG TCTATGGATG CTTTGAAAGA GGCGGTCTTT 6660 

AACCTCAGTC AGGCCGATGA TGATATCAGT GTGGAAGAAG CGCGTGCAGA GATTGCCAAG 6720 

ATTGAAGCTT TGAAGAATGC TTTGGTTCAG AAGAAGACGG CTTTGGTAGC AGATGACTTT 6780 

GCAAGTCTTA CAGCTCCTGC TCAGGCTCAA GAAGGTCTTG CAAATGCCTT TGATGGCAAT 6840 

GTGTCTAGTC TATGGCATAC ATCTTGGAAT GGTGGAGATG TAGGCAAGCC TGCAACTATG 6900 

GTCTTGAAAG AACCAACTGA AATCACAGGA CTTCGCTATG TTCCGCGTGG ATCAGGTTCA 6960 

AATGGTAACT TGCGAGATGT GAAACTTGTT GTGACAGATG AGTCTGGCAA GGAGCATACC 7020 
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TTTACTGCAA 


CTGATTGGCC 


AAATAACAAC 


AAACCAAAAG 


ATATTGACTT 


TGGTAAGACA 


7080 


ATCAAGGCTA 


AGAAAATTGT 


CCTTACTGGT 


ACCAAGACAT 


ACGGAGATGG 


TGGAGATAAA 


7140 


TACCAATCTG 


CAGCGGAACT 


TATCTTTACT 


CGTCCACAGG 


TAGCAGAAAC 


ACCTCTTGAC 


7200 


TTGTCAGGCT 


ATGAAGCAGC 


TTTGGTTAAG 


GCTCAGAAAT 


TAACAGACAA 


AGACAATCAA 


7260 


GAGGAAGTAG 


CTAGCGTTCA 


GGCAAGCATG 


AAATATGCGA 


CGGATAACCA 


TCTCTTGACG 


7320 


GAAAGAATGG 


TGGAATACTT 


TGCAGATTAT 


CTCAACCAAT 


TAAAAGATTC 


TGCTACGAAA 


7380 


CCAGATGCTC 


CAACTGTAGA 


GAAACCTGAG 


TTTAAACTTA 


GATCTTTAGC 


TTCCGAGCAA 


7440 


GGTAAGACGC 


CAGATTATAA 


GCAAGAAATA 


GCTAGACCAG 


AAACACCTGA 


ACAAATCTTG 


7500 


CCAGCAACAG 


GTGAGAGTCA 


ATCTGACACA 


GCCCTCATCC 


TAGCAAGTGT 


TAGTCTAGCC 


7560 


CTATCTGCTC 


TCTTTGTAGT 


AAAAACGAAG 


AAAGACTAGT 


ATTTAGTAAA 


ACCTCTTAAC 


7620 


AAGATTACGG 


AAGCAGTCTC 


TATCTTTTCC 


AATGAGGTTT 


ATAGTACAGA 


AAAAGCCTGA 


7680 


GAAGATGTCT 


TCTCAGGCTT 


TTGTTAAGCA 


CATAAATACA 


ATAGTGCTAT 


GACAAAATCA 


7740 


CCCAGAAAAA 


TCTGGGTGAT 


AAATGTTATG 


GTTGTGCTGG 


TTGAGGATTC 


TGATTTTGTT 


7800 


GATCAGGGGT 


TGTATTTGAT 


TGTTGCGTAT 


TATTGTTAGG 


ATTGGTAGTC 


GTACTATTAT 


7860 


TTGTGCTTGG 


AGTGGTTGAG 


CTAGACTGTG 


AAGTTGAACT 


ATCTGATGAT 


GAGCTTGAAC 


7920 


TTTCAGTTGA 


TGGGGGTTGT 


TGTGGAGCAG 


GTGAGTTCCA 


CGTAGAACGA 


GCACCATTTT 


7980 


TAAATACGAA 


TTCTCCATTT 




CCrCTGGTAT 


ATiLL'AATCT 


TCTGGATTGC 


8040 


TTCCTTCAGA 


CAGGTAGGTC 


ATCATAGAGC 


GGTAAACTTT 


GGCAGCGACC 


GTAAGGCCAT 


8100 


TGCCTACAAG 


TGGTGTCAGA 


CGGTTAGAAT 


AGCCTGTCCA 


TACAGCCATT 


GAATATTTAC 


8160 


GCGTATAGCC 


AGCAAATAGT 


TCATCAGGTG 


CTACAAATTG 


AGAGGTCTTG 


ATGTGGTTTT 


8220 


CAATTTCCTC 


GTCTGTATAG 


TTAGAGGTTC 


CTGTTTTACC 


AGCCTGAGGG 


AGCCAAGCAA 


8280 


GATAGGCATT 


TCGTCCAGTT CCATAAGTCA AGACTGTTTT 


CATCATGTCG 


GTCATCATAT 


8340 


AGGCTGTCGT 


TTCCTTCATG 


GCACGAGTTC 


CGACATTAGA 


GAACTCTTTT 


TCACTCCCAT 


8400 


CACTAAAGAC 


GACTTTATGG 


ATATACATTG 


GTTTATAGTA 


AGTTCCACCA 


TTTGCAAAGG 


8460 


CAGCGTAAGC 


AGCAGCCATC 


TTTTCACTAC 


TTGCTCCATA 


TTTPTTGTCT 


GATTCGGTTG 


8520 


TGTTACTTGA 


AATGGCATTT 


GAGTAGTGAA 


TACTTGGGTA 


GTCGATTCCT 


AGACCATTTA 


8580 


GGAAAGTCTT 


GGCGCGGTTG 


AGTCCGACCT 


TGTTTAGAGT 


TTCCACGGCT 


GGGACGTTTC 


8640 


GCGATTGTTG 


CAGGGCGTAT 


TGCAAGGTGA 


TGTTGCCAAA 


GTAGCCCCTA 


TCCCAGTTAT 


8700 


AAACAGGAGT 


ATTTGTCCCA 


GGGTAGTTAT 


AGGGCTCATC 


GTGAACGATA 


GTAGCAGTTG 


8760 


AATCGTAGAC 


ACCGTACTCC 


AAGGCAGGAG 


CATAGTCTGT 


GATCGGTTTC 


ATAGTTGATC 


8820 
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CCCAGTCGCG GTTTGTTTCT ACTGCTTGGT TAATTCCGAA GGAAACATTA CTTGACTGAT 
GGCGTGCTCC TAGCTGGGCA ATGACTTTAC CGTTAGAAAC ATCAACAATG GTAGAAGCGA 
CTTGCAATTC ATCGTCTGGA TAGGCAACGT ATTCGTCTGT ATTGTAAATA TCCCACAGAT 
GTTTTTGAGC TTCTTGGTCT ACATTTGTGT AGACATCCAT CCCAGTTGTG AGTAGGTTAT 
AGCCTGTTTC TTCTTCAACT TGATTGATGA CTTCCTTGAG GTAATTATCC ATGTAAGCAG 
GGTAATTACT TGCTGATTTG AGACTTTGTA GTCCATCAGT AATTGGTGTA TTGACTGCTT 
TCTCATACTG TTCAGCAGAG ATGTAGCCTT GATTTTTCAT TTCAGATAAG ACCAAGTTTC 
GGCGGTCTTG GGCTGCTTCT GGATGTGAAT AGGGGTCATA TTGGTTTGGT GCCTGAGGCA 
TTCCAGCCAG CAAGGCTAAC TGAGGTAAAC TTAAATTATT GAGGTCTTTA CCATAGTAGT 
TTTGAGCTGC TGTCTGCATT CCATAGTTCC CATTAGACAT GTAGACCTTA TTTATATAGT 
AGGTCAAGAT TTCTTGCTTG GTTGCTTTTT GTTCTAACTG AATCGCTAAC CAAGCTTCCT 
GAGCCTTACG AGAAATAGTC TGGTCGGAAG TCGAAGTTGA AAAGTAAGTC AACTTAATCA 
ACTGTTGGGT GAGAGTTGAT CCACCTTGGA GGGAATTGCT TTGCAGATTG CGCAAGAAAG 
CTCCCAGGAT ACGGATGGTA TCAATCCCCC TGTGGTCGAA GAAGCGATGG TCTTCGATAG 
AAACGATTGC CTTAACCAAA TCTGTGGGAA TATCATTAGC TTGGGCATTG ACGCGGCGTT 
CAGAACCCAA GTCAGCAATG AGTTGATTTT TATTGTCGTA GATTTTACTA GAAGTTGTTG 
CAACTAGTTT ACTCTCGGAT AGGCTAGGAG CCTTGCTAAC GTAGTAGAAA AAAACTCCTC 
CGCCTAAGAC AATGGCTGCG ATAACCAAGC TTAAGAAGCT AATGCTCAGA TACTTGATTA 
GGCGCAGAAT CGTTGGTTTG TTCATCTTGT TTTACCACCT AATAAATGTT CTTTGATAAC 
ATTGAGATAA GGAATTTGAG GGAAGGCACC AGCCTTGATT TCATATCCAT ATTCTCGAAT 
ATATTCAAGT GGCATTGATT TTTGTCCCTT ATCTTGATGA TAGAAGCGAA TCAAATCGAA 
TGCCGGCAAT AAGTAGGTTT CTTGCTGAGA AGAAAAGTGA AGAAGGACAA AGCAGATTCC 
TTGTTGGGCA AGGACTTGTT CCATATGCTG AATCTGATGT GGATGAAAAT TTTTCATCGG 
AATCGCACGT TTTTGTTTTG TTTCCTTGAC TTCAAAGTCG ATGTAATATC CATTATAAAC 
GCCAGAATAG TCCGTCGTTG AAGCTTGTCG AAAATAGGCT TCAACAATCT TGGCACGACT 
TCGTTGTGGA TAGTCCACTT GTACGATTTG AATAGGAGTT GGTTTCTTAT GTATAACAGC 
CAAGCCCTGA GACAAATAGT AGTCGTTGGT AGCATTGATC ATCTTTTCAA AGGGTACCGA 
GCTCGAATTC GTAATCATGT CATAGCTGTT TCCTGTGTGA AATTGTTATC CGCTCACAAT 
TCCACACAAC ATACGAGCCG GAAGCATAAA GTGTAAAGCC TGGGGTGCCT AATGAGTGAG 



8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 * 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380. 
10440 
10500 
10560 
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CTAACTCACA TTAATTGCGT TGCGCTCACT GCCCGCTTTC CAGTCGGGAA ACCTGTCGTG 10620 
CCAGCTGCAT TAATGAATCG GCCAACGCGC GGGGAGAGGC GGTTTGCGTA TTGGGCGCTC 10680 
TTCCGCTTCC TCGCTCACTG ACTCGCTGCG C 10711 
(2) INFORMATION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11887 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 6: 



TACATTCATT 


CCATCGGCTA 


CTCCATAATA 


CTTAGATAAA 


ACCATAGCTG 


AAGTCGAATA 


60 


CGGATACTGT 


AAAGTATTAT 


CAATTTTAAT 


CAAATCATCA 


TTACCGATAA 


TACTTCTGAT 


120 


TGCTTTTGGT 


AGTATGAACC 


ATACGTTGGT 


GAAATCTCAG 


ATAATGAAGA 


ATCATTAGAC 


180 


TCTGGACCTT 


TTTCTAGTGT 


CTCACTTACC 


TCATATTCTT 


CACCCTTACT 




ZhV 


CTCAAAGCAG 


ATACTGTCGA 


TAACTGGCTA 


GCCAATAAAG 


TACTCGCAAT 


AATTGAAATA 


300 


CCCAATTTTT 


TATAAACAGT 


TTTCTTCATT 


ATTGTATCCT 


CCTAATGTAA 


TTATAGCGTA 


360 


CTATTCTAAA 


TTTCTTAATC 


TACTATAGAA 


TCAAGAAATC 


TACCACCTTC 


TTTAAATACC 


420 


CTCCATTATC 


ACATAAACAG 


GTAAaCTTT'T 


CaATTaATGA 


CTGCGCTTTT 


CAATCACGCT 


480 


AGAGGTACTT 


GCTTGCTTCT 


TTGATACTAA 


GTTCAGCCAT 


TCTTTCCTTG 


TTTTTCTCAA 


540 


TAAAGCATGT 


TACCCAAGTG 


GGATTCGTTT 


TGGAGTAGTC 


TCGCAGAGTC 


CAGCCAATGG 


600 


CTTTATTGAT 


AAAAAATTCT 


GTTTGGTTCA 


AGTTATGAAG 


GAGAATCTTT 


TCCATTAATT 


660 


GAGTATTGGT 


CTTCTCTTTT 


CTTAACAACT 


GGTGGTCAAT 


AGCGACACGT 


CTCAGCCAGA 


720 


TATTATCTGA 


TAGGCTCCAT 


TTTATACTCA 


ATGAAAATCA 


AAGAGCAAAC 


TAGGAAGCTA 


780 


GCCGCAGTTG 


CTCAAAACAC 


TGTTTTGAGG 


TTGCAGATAG 


AGCTGACGTG 


GTTTGAAGAG 


840 


ATTTTCGAAG 


AGTATTAAGA 


TTATTTCTTC 


TAGTTCAGGG 


TGTTCATACA 


CCAAACTCCC 


900 


TACTACTpGA 


TCTAGGATAT 


CTACCGTGTC 


CCACAAGGAT 


TTTGTCACGA 


CTAACTGCTC 


960 


TAGCTTAGGC 


AAATCGGTTT 


CCTTTAGATA 


AGACTGCATT 


GCTTTCAAAT 


AGTTAGCAGC 


1020 


CACATATTGG 


TATTTTCTAG 


GATCCTTTTC 


CCAGCAAGTG 


TCTGCAAAAT 


CCCAATCGAT 


1080 


AATCTTTGTT 


TTTTTCGCTT 


CTGGAAAATA 


TTTTATAGAG 


TTTATTTCTT 


TCAGGCACCG 


1140 


CAATACCTAG 


AAAAGAAAAT 


TGATGGCGCA 


TATAGGCTTC 


CATGGACCTT 


GCTTTTTTAG 


1200 


AGTCTTTTGC 


TGCTTCTAGC 


TCCTCAAGTA 


AATCTGCTAA 


ACTCATCTAA 


AACTCCTCTT 


1260 
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GCCCCACCAA ATGGTGCTGA AAGGCATAGA CAGCCGCCTG GGTACGATCG CTGACTTCAA 1320 

GTTTGGCAAG AATATTGGAC ACGTGGGTCT TGACCGTCTT GAGAGAGATA AAGAGGTCAT 1380 

CTGCGATGCG CTGATTTTCG TAGCCCTTGG CGATGAGTTG GAGAACATCT CGCTCACGCG 1440 

CAGTCAATTC TTCATGAAGT TCCATATGAT TGCGGTGGTA TTCAACCTTC TTGCTAACCT 1500 

CTTGCTCAAT GGCCAGCTCG CCAGCAGCTA CCTTACTGAC GGCATGAAGC AATTCATCTG 15 60 

CACTAGAAGT CTTGAGCATA TAGCCTTTGG CACCAGCATC TAAGACTGGC ATGATTTTTT 1620 

CATTGTCCAA ATAAGAGGTC ACAATCAAAA TCTTGGCTTC AGGCCATTCT TTAAGGATTG 1680 

CTAAGGTCGC GTCAATCCCA TTCATCTCAG GCATGACAAT ATCCATGACA ATGACATCTG 1740 

GACGCAGTTC CAAGGCCAAG TCAATCCCTT GAGACCCGTT GGACGCCTCA CCCACAACTT 1800 

CTACATCGTC TTGGAGGTCA AAGTAGCTTT TCAAGCCCAA TCGGACCATT TCATGGTCAT 1860 

CTACTAGTAA AATTTTCATC TTTACTCCTT TATCATTCCT TATCTAACAG GGGAATACGG 1920 

ATATCAACCG CCAGCCCTTG CTTGGGAGCT GTCAAGAGTT GAACTGTTCC AGCCATATCT 1980 

TCAACCCGCT CCTTGATATT TCGCAGTCCA TAACTCAAGT CGTCTAAGCT CCCTAACTGG 2 040 

AAACCAATCC CATTGTCCAC CACCTTCAGT TGCAATTCAA CATCTGTCTG ATAGAGGTAG 2100 

ACATCTAGGC AAGATGCCTG GGCATGGCGG AGGGTATTGC TAATCAACTC TTGCAGGATA 2160 

CGGAAGATAT GCTCCTCGAT TTTCTTAGGC AATTTCGTCA TATTCTGCTT GAGACTAACC 222 0 

CTAAGATCAC TCTTGTCCTC AAGCTCTTTT AAAAGAATTT GAATCCCTTC TATCAAGCTC 2280 

TTCTGCTCCA GTTCAACTGG TCGCAAATGC AAGAGCAAAA CCCGCAAATC CTTCTGGGCT 2340 

GTTTCTAAAA TAGCTGTGAC ACTCTGCAAC TGGGTCTGCA TCTTTTCTCT ATCCAATTTC 2400 

AAAGCCTGCT GACTGATACC CGATAAAATC ATGTGGGCCG CAAACAACTC CTGACTGACT 24 60 

GTATCGTGCA AATCCCGAGC AATTCGCTTC CGTTCCTTCT CGATGATTTC CTCTTCCTGA 2 520 

GCAAGGCTCT GATTTTCAGC TTTTTGAAGA GCCTCTGTCA AAAGGTTAAG TTTACCTGAT 2580 

AAGGACTTGA AACTGGCATC CAAATCTGGA TCTGCAACCT GAACCACTTC TTGCCCTGCT 2640 

AATAAACGCT TGAGATTAGC CTGCATTTTT CTTAGAGAAA GCTCTTCGAT CCCTCGCCAA 2700 

AACAGGGCTA AGAGACAGGT CATGGACATG CTGAAAACCA ACAATAAAAA GACAAATTTT 2760 

TCTGTTTTTT CGACATCGTG CAAAAAGATA GACCAGTCAA AATCAAGTAT TTCCAGCAAG 2820 

CTGTGGGAGA AAAAAAAGAC AAATAGGAAG GAGGTGAGAG CAATAATGAC ATAGGCTTGT 2880 

TTTTTCATCC TCTAACCACC TCCACATCAC CAATCATAGT GGTCAAGAAA ATCTTGACAC 2940 

TCTTGTTACT CTTGAGATAG TCTTTTGTTT CTTGATGATA GTGTTCATTG CGGAGGGCTC 3000 



■ K ■ ■■ ■ ■ ' 

, WO 98/18931 PCT/US97/19588 



980 

GCTTGGGCTG GTTGAAAAAA ATCAAATCCC CATAGAGACA GTTAACGCTG AGACTGACTT 3060 

CCACATCTAC AGGTACGATG ATTTTGGTCG TTCCTACCAT CTTTCTGAGG ATAATGACAT 3120 

TGTCATGATT GGTTAAGATG ACCCTCTCCA GATGAATAGT GTCCTTGCCC ATGAAGCGAA 3180 

AGAGATTGAT ATCATCGAAT TGGCAAGTCT GGTAGCTTGA AAAATGATGA AGATTTCCAA 3240 

ACCAACGATT TTTCTCCTTC TTAACCGTCA CGACCTCTTC AAAAACCAAA TTGGTCTGCT 3300 

CTTTTTCCTG GTTCATCATC GGGTAAAGAA GAAAGAGGCT ATAGATAACC GCAACAAAAA 3360 

TAGCTAGAAT CACAAAAGGA TTGAGCATAA CGATGAAAAA GAAGAGAATG GTTGCCGCTA 3420 

CTAAAAGAAG ATTATTTCCC TCTTTACCAG TGTAGTAGCG AATCAAAAGC AAAAAGAGGA 3480 

ATAGTATCAG CAGAAAACGC GAAAAATGCT CTGATACCAT CAAAATCAGA GCTCCTGTCA 354 0 

GAAGACAGGC TTCGATAAAT AAAAAGATTT TAAATTTTCT CATAGGTTCA TCCTCTCCCT 3600 

TCTATTTTAT CACAATTCAA AAAAGTCACC TCAGTCTGAG GATGGAAAAA AGGCGCTGGT 3660 

TACGCCTTTT TCATCTGATC CTTTGCTTCT TTTAATTTTC CATAAAGAAG ATAGTCTACT 3720 

TTTTGTAGAT CTGCTATGGT GGCACAGTTA AGGGAACACA TAATCAAGCG TAGATCTGCT 3780 

TTCCAGCCTT GGACAATGCC AATCACTTCT TCAACTGTGT AGGTTTCAAC CAATTCCAGA 3840 

ACGGTTCGTG ACAATCCCAC AGCCTTAGCA CCAAAAACCA AGCACTTAAT CATATCCAGC 3900 

GGATTCCGAA CCCCTCCACT AACCAAGAGT TCGACCTTAT CTTTCCATTC TTGGGCATTG 3960 

AGAAGGGCC-T GCaTGGTAGA CTGACCCCAT TGATTGAGGT AATCACGCTG GCCACTACGA 4020 

CGGTTTTCGA TATAGGCAAA GCTGGTGCCA CCACGACCCG ATAGGTCCAC TGTACGAACA 4080 

CCGAATTCAT AGGCTCTTTC GATTGTCTTG GCATCCATTC CAAAGCCCAC TTCCTTGAGG 4140 

ACAATAGGAA CGGGAATTTG CTTGCTATAA TCTGCTAGAT GCGATTGCCA GCTTCTAAAC 4200 

TTCCTTTCTC CCTCGGGCAT GAGTAATTCC TGCATGACAT TGACATGCAC TTGCAATAGA 4260 

ACAGGATTCA TCTCTTCTAC AGTCTGAAGT CCTAACTCGA CAGGCTTGTC CAATCCAATA 4320 

TTGGTTCCT^ GGAGGAGATT GGGATGACTA GACTTGACAG AAAAAGAATC ATCCGTTGGA 4380 

TTTTTGAGGG CTGCGCTATA AGAACCCGTT ACAAATAAAA TACCACAGGA TTCCGCCACC 4440 

TGAGCCAGCT TTTGATTGAT TTCTCTTCCC TTATTACTTC CACCAGTCAT GGCATTGATA 4500 

TAAAAAGGAA AGTCCCACTT TCGACCAGCA AACTCTGTCG AAAGATCGAT TTCATCCAGA 4560 

TTGTAAAGAG GCAAGGAAGA ATGAATCAGC TCCACCTCAT CAAAGCTATT ATAGGAACTT 4620 

TTCTGCTCAA GGGCATAGAG GATATGCTCG TCCTTACGAT TTGTCGTCAT GTCCTATCCT 4 680 

TTCTTGATAT AAGAGCTCAA TCCCCAGATC GGCCCAACGA TTTTTTAAGG TTTTGGTTGA 4740 

TTGCGCATCA AAACTCAGGG CGATGCCACA GTCACCACCA CCAGCACCAC TACTCTTGGC 4800 
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AACGGTCTGC AAATCTTGAC TGGCTTCTTT CAACTGTCTA AGCAAAGGCG TGTAAATATC 4 860 

TGTACTCAAG CCTTCTAAAA GCTTGCTGGC TACTTCTACT TGATCGATAA TCTTTTCTGA 4 920 

TTTCCCCTGT TCCAAGGCTT CTACCAGAGA AGTCACCGTT TCTTTTGAGG AAGTTAAAAA 4 980 

ATTTTGATTG ATATTTTGCT TGATTTGCTG GACCATGTGA CTCGATACAG CCACTTCCTT 5040 

GGTCCATCCC ACTAAGAAAT CACATTCTAA AGTTGGTTTC ACTTGTGAAA TTGAAAAGCC 5100 

CCAATCACGC TCCAGAACTG TCGCCAAGTT TTCTTCTTCT AACCAAGCAG CCACCTTCTG 5160 

GCGATCAAAT GACTGGTAGA GAACCAAATC CTCTGCCACA ATACAGGCAA GGTCGCCCAT 5220 

GGAACCATTG TCTCCTCGCT TAAGCAAGAC AGCGCTAGTC AGCTTGAACA AGAGCTCCTG 52 80 

ATCAACAGAA ACATCATACA GAGCCAGTAA AGCCTTGACA ACCAAGACAA CGACGCTGCC 534 0 

ACTAGAACCT AGACCAAACT TTTTCCCTTC TCGTTCCATT TTGCCACAGA TTTCTAGAGA 5400 

AAAAGGTCTT AAATTCTGAC CACGAACAGC GAGGAAGTCT CCCATCAAAG CAATCGTTTC 54 60 

TTGAATCAAG CTATAGTCAG GATTAGGCCT TAAGTCCACT GCGAAATCAA ACATATCTGA 5520 

ATAGATACGG TAGCTGTCAG AAAAAGCAAT CTCAGCCCTC ATATAGATGG GAATATCCTT 5580 

TATCAAAGCT AACTGCCCTG GCTCTAAAAT AGCATATTCA CCTGCCCAAT AGAGTTTTCC 564 0 

GCAAGTTTTA ACAGCAATCA TCTTGACTCA AATCCTTTGT TTTTGACACA ATCAAGCGAT 5700 

AACGATGACC GAAAATTTCT GATAAATGCT CCAAGTCTTT CTCCTGACAG AAGACCTTAA 5760 

CATTGGGal-C AGCATCCATG GTAAAGTAGC AGGCCTCTCC TTTCTCACGA AGCTGGCGAA 5820 

CAAAGGCCAT AGCCTCATAA GAGGCATCCG TCAGATAAGA AAAGGCTGGA CTAGCAGTCT 58 80 

TTGTCGTAGC ATGCATAGCC AGGGCATTTT TCTCCGTTAA TTCTCCAATC TTGGCAAAAT 594 0 

CATTTTCCTT GAGATAAATC AGCATATCCT GATAGTCCTT CTCAGACTGA CGAACCCAGT 6000 

CGTCGAAAGT CGTCGAGGTT TCCACACAAA GTTTCATCCC GTCACGGCTA GAGATTGGTT 6060 

TTTTCTTGTC CTCTAGCACC AACATAATCA TAGCTAGTTT CAAGTCTGTC TCTACAGGGT 6120 

AAATTTCTCC ACTATCCTTA TCCCAGGCTC CTAGTGGTCC ATAAAAACTC CGAGAAGAAG 6180 

AACCTGAGGC AAATTTGGCT TCCTGTGCCA ACTGACTTCT ATCCAATCCA AGCTTGAAAT 6240 

AAGCATTACA AGCCTTGACC AGGGCGGACA AACCACTAGA ACTTGAGGAC AGACCCGCTG 6300 

CCGTAGGCAT ATTGTTTTGA GTATCGATAC GGACAAAGCC CTCACCAGCT GGACGATAAC 6360 

GGTCAATAAT CTTACTCATC TTGGCATGCT CGACCTCATT TTGTAGCTGA CCATTGATGT 6420 

AAAATTCGTC AGCTGTTACA TTGGCTGGTA AAGGCGACAA GGTCGTCTCT GTATACATAT 64 80 

TTTCCAAAGT TAGAGAAATA CTGCTAGTAG CAGGCACCAT CTCTTTTTCT TTTl^CTTTC 6540 
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CCCAATATTT GATAATAGCA ATATTTGCGT AGGAACGTAC TGTTACAGGC TCTCTATCCA 6600 
TGTCTGAACA GCTCCTTTCT CTTCTAATCT TTCTGCTAGT TCTTGTGCGT GTGTCAAATT 6560 
GGTTACCAAG GCTATGATAC AACCTCCTAG CCCACCACCG CTCATCTTGG CACCCAGAGC 6720 
ACCATGGCTA AGAGTCGTTT CAACCAAAAA GTCTGCCTCA GGGCTACTGA CTCCAATTTC 6780 
TTTTAAATGT AAATGCGCTT GACTGAGGAT TTGTCCCAGT CCTTCAGCAT CTTTTTGTGA 6840 
AATCGCAACT TCTGCTTGCT GGGTTAATTC TCCCAAGGCA TGCAAAAACG GTAGGGCATC 6900 
CTTGCCCTTA TTTTGAACCA CTTGGATGGC TTCACGAGTA TGACCATAAA CACCCGTATC 6960 
GGCAATCACC AAATAGGCGG ATAAATCCAT CTCAAGTTCT GTAAATCCTA CGTTCTTGAT 7020 
AAAGCGAATA GGTTGGTCAC TAAGACAGGT CTTAGCATCC AAACCACTAG GATTCATATG 7080 
GGCAATCATT TCAGCTCGAT TGACCAAGAT TTCTAGTACA TCATGAGGCA GATCAGCCTG 714 0 

ATAGTAGTCA AATACTGCAC GAATGGCCGC TATGCTGATA GCCGCTGACG AACCCATCCC 7200 
CCGTTTCTCA GGGATAGCCG AGTCAATCTC ACAACGAATG CAGGCTTCTG TGATATTCAA 7260 
ATACTCCAGT GAGGCATAAA CCGCCATGGA CAAGGTATCC TCCTCATAAA GGCGCCAAGG 7320 
ACTCTCTGCA GGAACTACCT TACAGGTCAC CTCCACCTCC AAAAGAGGCA GGGAAATGGC 7380 
AGGATAACCG TAAACGACCG CATGTTCCCC TATTAAAATT ATCTTACTAT GTGCCTGACC 7440 

GACACCAACT TTTTTTGTCA TTTTTTCCTT TTACTAGACG AAAAAACGTC TTATTTTTCA 7500 

TACAAGTATT AATTCTTTCC TATCTATTTT ATTATATTTT CACAAAAAA^. GCGAT7GTTT 7560 

CCATTCACAA TCGCTTCTTT CATTATTGAA CCCATTCGCC ATTATAGTTG ACAGAATAGC 7 620 

CATCTACGGT CGTATTCACT GCCAAGGCAC CTGAGCGCTA TAAGCGTAGT ACCATCTGCC 7680 

ATTGACCTGG AACCAACCTG TCGTCATAGA ACGACGAAAG AAACTCCATA CCATTAAGTA 7740 

AAGAGGAAAG TCGTGAGGGA GCATGCGCCA TTGACAACCT GTTTTAGTGA CGTACAAAGT 7800 

CTCATTAACA AGTACTCGTT TCGGCCATTT ATAGGTGCGG TGTTTGGAGA AATAGGGTTC 7860 

AATCTTCGCC CATTCTTGAT CGTTTAAATC AGTATCATAT GCTTTGCGTA TCATAACTCT 7920 

AGCTTAACAT TTTTTTGTGA ATACAGGTTC TAAATAATCG ACCACGAAAA TTTCTTAAGT 7980 ' 

GGAAAACGCC TTATGAAGTA TGCTACGGGA AAGTTATGCA CTTAATTTGA CAATTCAAGA 8040 

TGTAAAAATA TATACTATAG TAGATTGAAA CTAGAATAGT ACACCTCTAC TTCTAAAATA 8100 

TTGTTAGAAA TCGATTTGAC TGTCCTGATC GATTTATCCT GTTATTATCT CATTTTACTA 8160 

TAATATTTGA TAAGTTATCC TAAAAGTATT ATTATGTTGT TGTGTTATAG ATTGATTGAA 822 0 

TCTAACTAAA GGATCCTATT CAATTACTAG AACTATCACA TACTCAAGGT CAGCTCACAG 82 80 

ATGAGCAACT ATTTTGGTTA CAATGTCTAC TAAATTTAAG TCAAACAAAT AATTTAGTCA 8340 
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AAATTAAAAA AATAGAGGAA CATAAATATG ATTACAAAAC AGAATGTAAT AGTGTTCTAC 84 00 

AATTTTTACT AGATAAAACT GTAAATTCTG AAGGAAGGAT CACTTCTTCA ACAGAATTTG 8460 

GAAATTTCGT AAGTAATTTA TCATTCCAAC ACGGAATAGC TGGACTACTG TTTCCTCTAA 8520 

ATAAATTGTA CCCCCCAGAA CTGGATTCTA AAATACTCTC TATCATCAAG AAGGCAGTGA 8580 

CAATTAGAAC GACACACACA TATGAATATC AATACTCACT GCTATTTGGT GATGCAGGCT 8640 

ATCTATGGTT ACTCCTACAT TTATTTTCTA TCAGTAAAAA TCAATACTAT CTACAATTAG 8700 

CAAACGTCAC CGCTAAAAAA TTAATAGAGA ATTATGATAC TCTAGAGGAA ATAGACTTTG 87 60 

CATTGGGAAA ATCTGGTGTC CTATTATCAT TAATAAAATA CTATCAATTT ACCAATGACA 8820 

ATACTCTTAA AATTTTCATC CACAATAGTA TAGGGGAAAT TTATCATTAT TTCCTACAAA 8880 

GAGATACAGC CAAAGAAAGC ATTTTAGACT ATAGCTTTGC TCATGGATAT TGTGGAATTG 8940 

CATATGCTTT ATTTGCCTAT TCTAAAGTCT TAGAACCTTC TATGTTTTAT AATGATCTCC 9000 

ATACATTCCA TACTGAATTA AAAAAATTAT TAGAAAAAGT TACTTCTAAT ACTGAAAATT 9060 

TAGGAAATTT ACAACTTTCT TGGTGCAAAG GAATTTCCGG AATAATCTTA TATCTTTGTA 9120 

TGTACGATTG TGACGGAAAC AAAGATATTA TTAGTAAATA TCAAGAATTT GTTTTTAACC 9180 

ATCATCTAAA AATGATGACA GGATATTGCC ACGGAATAAC TAGCTTACTA CAAACCACTG ■ 9240 

TCTACAATCA AAACAAATTA CTGATGAAAA AAATCCAACA GGTAATTTTA GCATGTTCTG 9300 

AACGAGATGA TCACGGTTTA CTGATGTTTC AAGGAGATAG TGGTAAAGCA GATTTGTTTG 93 60 

ACTTCGGAAT AGGAAGCATG GGGTATATTG GTGTCTATTA AATAATAAAT TCCCATTCGA 9420 

TGTGCAGACA TAAGGAGAAA AGTATGAAAT TATTTTGGAC AAACAACATA TATAGACAGT 9480 

TGCTGCTAAA CAGCTGTTTT TCATCATTCG GCGACAGTAT TTTCTACCTC GCCATTATCA 9540 

ATTATGTGGC TCAGTACAAT TTCGCTCCGC TAGCGATTTT ACTGATTTCC ATTTCAGAGA 9600 

TGGTTCCCCT ACTATCGCAA CTCTTTCTCG GGATTCTAGG AGATTTTCAA GAAAATAGAG 9660 

TCAAACACGC ACTCTGGATT GCCAAAATCA AAATCCTGCT CTACGCTATT TTGACAGTAT 9720 

TTCTCGTCTT GTCGCCCTTT TCATTAGTTT CAGTCATTAT GATTGTCATC ATCAACCTCA ' 9780 

TCTCTGACAC CTTGAGCTAC CTGTCTGCCT ACATGATGAA CGCCCTCTAC ATCAGTGTAA 984 0 

TTAAGGACGA CCTGCATGAT GCCATGGGGT TCAGGCAGTC TCTGATGAGG GTTGTCCGTA 9900 

TTGTCGCCAA TCTGGCTGGC GCATTCCTTA TCAATGTTAT AAGTATTCAA ACTATTTCCC 9960 

TTATCAACAC TCTGACTTTT GTCATTGCCT TTTTGGGCCT GTATGTTATT CGACATACCT 10020 

TGTATGAGGT TGAAAAAAGA ATTGAAATGT CACATACAGC ACTGAGTTTT AAGAAATATT 10080 
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TTCAACATCT TAAACAGTCG CTGGCTGTGC TCCTGAGGTT AAAAGATACC GTCATACTAC 10140 

TGTTTCTGAC GACCAGTATG ATTGCCATCT TGGATGTGTC CCCTCGGCTG ATTGCCCTCC 10200 

GCTTCATCCA ACAGACACTA GCACAACTGA GCATTGGGCA ACTCCTCGCC CTGCTCTCCA 102 60 

TCATCATGTC TTGTGGAGCT ATCCTTGGCA ATATGACCAG CAGTAATCTA TTTAAAAATA 10320 

TCCGTTTCAC GCACCTCTTG GTTTTCTGTG AGATTTCCCT ATTGACTCTA ATAACTAGTA 10380 

TCCTTTGTCA AGCCTATATC GTAATTTTCA TGACCAGTTT CATCAGTTC? ACGATTATCG 10440 

GCATTCTCAG CCCTCGCCTA CAAGCAGCTG TCTTTGCCCA TATCCCCAGT GACAAGATGG 10500 

GGACGGTGGG CTCTGCTCTG AGCACAGTGG ACATTCTCGC CCCGTCCCTG CTCTCCCTAT 10560 

TAGCCCTATC CATAGCATCG GGCGTTTCGG TGCAGTTAGC ATTGATATTT TTGTATCTTA 10620 

TTTTAATTGC TCTTATCTTT TGTCAATGGT TAGTCAAGTT CAACACTCAT AACTAACGAA 10680 

AAAGCATGTG TAGATTTCAC ATGCTTTTAA TCTCCCCAAT CGTCAGGTCA AGTACAACAA 10740 

AGTCACTTCT TTGATTAAGC GAGTGTTCTA ATATAATTAT AAGCGCCCTG TCATTACCGA 10800 

ACCCATTCGC CATTATAGTT GACAGAATAG CCATCTACGG TCGTA7TCAC TGCCAAAGCA 10860 

CCTGAGCTAT AAGCATAGTA CCAGTTGCCA TTGACCTGGA ACCAACCTGT CTTCATGTCT 10920 

CCATTACCTG CATTTAGGTA GTACCAAGTT GAACCATCTT GATACCAACC AGTTGCCATA 10980 

GCTCCTGATG AACGGAGATA GTACCATTTG TTCCCAAGGT TTTGCCAACC TGTTTTCATA 11040 

TCGCCATTTG GRTGGTCTAA ATA.\TACCrJV GTGGTACCTT CCtGAtacCA GCCAGTGGCC 11100 

ATTGCTCCTG AGGAACGGAG GTAGTACCAC TTATTACCTA GATATTGCCA ACCTGTTTGC 11160 

ATAATACCAG TTGTTGGATC TAGGTAGTAC CAAGTCGAAT CATCGTTTAT CCACCCCGCA 11220 

CGTCTTTCAC CACCAAGGTA GTTTTCTCCA TTAATTTCCG TCTTAGCTAG ATAATACCAG 11280 

TTAGACTGAT CATAAAGCCA ACCTGTCTCT AAAGAATGAT TTTGATTAAA GTAATAGTTC 1134 0 

GTATAATAAC GCTTCTCTTC TTTATCTTCT GAATCTTCAC GTTTTTCCCC GTACTTTCTT 11400 

CCAACACTGT CTTTAGTTTT AATCTCTAAT GTTTTCCAAC CAACAAACTC TTGTAGCACT 114 60 

CCATTTTTAT CGAAGTAGTA CCACTCTGAC TTTGGAAAAC CTTCTAATCT GATACCATTT 1152 0 

GGGTAAGGAC CAATTGTACT ACCTTTAGAT GGAAACGGGA TATATTGCCA GCCGACAACC 11580 

ATCTCTCCAG ATAGAGAATC AAAATAATAG TACTTACCAT CAATCACTCG CCAGTAGGTT 11640 

TCTTTGAGGT CCCCCTTTTT GTAGTAGGTT CTTCCGTTTT CTTGGACAAA CTGCCATCCT 11700 

TCAGAATCAT CTGCAAATAC TGTACTGGTC CCTAGCAAAC CAAAGAAAAA TACTGTCAGT 11760 

CCAACTTGCA TAGTTTTTTT CAAAATTTTC ATCTATATAC CCTCCAATAT TAAATCCACT 11820 

CACCAGATGA GGCGAAATTA TAAACTTTAC CATCGATAGT TTGGCTACCT GTAACCATTG 11880 
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CTCCAGG ^^gg^ 

(2) INFORMATION FOR SEQ ID NO: 147: 

(i) SEOUENCE CHARACTERISTICS: 

<A) LENGTH: 1X340 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 

CCGGTATGTT CTGGAATACT ACCAATCTAA GCTGGCTGTG CCCTACAGTT TTACAACCCT 60 

GTACGAATAC CTTAAGGAAT ATGACCGATT TTTCAGCTGG GTTTTGGAGT CTGGTATTTC 120 

AAACGCTGAT AAAATATCCG ATATTCCTTT ATCAGTTTTG GAAAATATGT CTAAGAAAGA 180 

CATGGAATCC TTTATCCTTT ATCTACGTGA ACGTCCCTTG CTGAATGCTA ATACAACAAA 24 0 

ACAAGGTGTT TCACAGACAA CTATCAATCG AACCTTATCA GCACTTTCTA GTCTTTACAA 300 

GTATCTAACC GAGGAGGTTG AAAACGATCA GGGGGAACCT TATTTCTATC GTAATGTAAT 3 60 

GAAAAAAGTT TCCACCAAGA AAAAGAAAGA AACCCTTGCT GCCAGAGCTG AAAATATCAA 420 

GCAAAAACTC TTTCTAGGTG ATGAAACAGA AGGTTTTCTA ACTTATATCG ATCAAGAGCA 4 80 

CCCACAACAG CTTTCAAATC GAGCTCTCTC ATCATTCAAC AAAAATAAAG AACGAGATTT 540 

AGCCATTATT GCCCTTCTCT TGGCATCTGG TGTTCGCTTA TCTGAAGCTG TTAATCTAGA 600 

TCTAAGAGAT CTCAATCTAA AAATGATGGT TATTGATGTT ACTCGAAAAG GTTGCAAACG 660 

TGACTCAGTC AATGTCGCTG CTTTTGCTAA ACCTTATTTA GAGAATTATC TGGCCATTCG 720 

GAATCAACGC TATAAAACGG AAAAAACAGA TACAGCCCTT TTTTTAACTC TCTACAGAGG 780 

TGTTCCTAAT CGTATCGATG CTTCTAGCGT TGAGAAAATG GTTGCTAAAT ACTCAGAGGA 840 

TTTTAAAGTG CGTGTAACAC CCCATAAACT GCGCCATACA CTAGCAACTA GGCTCTATGA 900 

TGCGACTAAA TCACAAGTTT TAGTCAGTCA CCAACTAGGA CATGCTAGCA CACAAGTCAC 960 

TGACCTCTAT ACCCATATTG TTAGTGATGA ACAAAAGAAT GCTCTGGATA GTTTATGATT 1020 

TTACGTATTT TAAATTATGT AAATAAATAT CAAAAAAAGA AGTTGGCCAA CTTCTTTTTG 1080 

ATTTATCCAA CTACCGCTTC AGCGATTTCT TCACGGCTAA TACCAGCGAA GTAGCGTGTG 1140 

ATATCAATGG TTTTTAGCGC CTTAAGAACA TCTTCGCGTT CGTATTTCAC CCCACGAAGG 1200 

ACATCTTCTA CTGCAGCAAC GTCTTCAATA CCAAAGAAGT CACCATAAAT CTTGATGTCT 1260 

TGGATTTTTG ATTCAGTAAC GTTAGCAAAG ACTTCAACCT TACCACTAGT GAATTTGATT 1320 
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CCACGACGGA CGTTAAATTC AGGTGATTTA CCATAGTTCC AGTCCCAAGT TCCAAACTTA 1380 
GTATCCTTGA TGCGATTGAT TTCGGCCAAT TCTTCTTCTG AAAAGACGTA TTCAGTCATC 1440 
TCTGGGTACT CTTTTTTCAT GTATTCCAAG AGTAAATCAC GGAATTTTTC GACTGTGATT 1500 
TTTTTTGGTA ATTCATTGAT AATATTGGTT ACACGGGCAC GGACGGATTT CACACCTTTT 1560 
GATTCAAATT TATCTTTTGA AACCTTAAGG GCATTTGCGA GGACTGACAA ATCAACGTCA 1620 
AAGAGCAAGC AACCGTGGTG CATGATACGG CCGTTGATAT AGGCTTGGGC ATTGCCACAG 1680 
AACTTCTTAC CATCAATCTC AAGGTCATTA CGACCTGTGA ACTCAGCTTT AACCCCAAGT 1740 
TGAGCCAGGG TATTGATAAC CGGAGTTGAG AAGCTCTTGA AGTCAAATGC CTTATTTTCA 1800 

TCTTCTTTGG AGATGATCGT GTAGTTGAGG TTATTTAAAT CGTGGTAAAC AGCTCCACCA 1860 

CCACTAATAC GGCGAACTAC CTCAATACCA TTTTCGCGAA CATAATCACG GTTGATTTCT 1920 

TCGATAGTGT TCTGGTGACG ACCAACAATG ATAGATGGCT TGTTAATCCA AAGTAGGAAG 1980 

ATTTGATCCT CATCCAAAAG GTGTTTAAAG GCGTATTCTT CCAAGGCAAT ATTAAAAGCA 2040 

GTGTCATTTG AATGATTGAT AATGTATTTC ATGATATCCC TTTACTTTAT ATGATAGAAA 2100 

CTGGAAATAA CCTTCCAGTC TAATCTATCT TCGTTTTATT TTTTCTTAGG TGAATGGATG 2160 

GCCATTCCTA GAACATCTGC AAACGCTTCG TACATCACTT CAGAGTAAGT TGGGTGCCCG 2220 

TGGATGGTCT TCAGCATTTC CTCAACAGTG ATTTCCATTT CGATGATGCT TGATGCTTCG 2280 

TTTATTAATT CTGCGGCTGC AGGACCAAT.A ATGTGTACAC CT'J'.GGATTTC TCCGTAtttc 2340 

TTATCAGCGA TAACTTTTAC GAAACCTTGA GCTGCGTCAG ATGCAATAGC ACGACCGTTA 2400 

GCAGCAAAGT TAAACTTACC GATGGCAACA TCGTATTTCT CACGGGCTTG TTCTTCTGTC 2460 

AAACCTACTG CTGCTACTTC AGGGAGAGTG TAGATGGCTG CAGGAGTCAA ATTCAATTTG 2520 

GCAACTGCAT GATTTCCTTT AAGGGCATTT TCAGCGGAAA CTTCACCCAT GCGGAAAGCT 2580 

GCGTGAGCCA ACATCTTAGT ACCGTTGATG TCACCTGGTG CATAAATGCC TGGAACTGAA 2640 

GTTTCCATGT ATTCGTTGAC CTTGATACAA CCACGATCCA ATTCAAACTC AACCTCTCCA 2700 

ATACCTTCAA GGTCTGGCAT ACGACCAATT GAAAGAAGAG CTTTGCTTGC GATGATATCG 2760 

TCTTTTCCTT CAACCTTGAT ACGAAGTTGA CCATTTTCCT CAATGATTTC TTGCAGTTTA 2820 

GTACCAGTCA AGATGGTCAT TCCTTTACGC TCAAGAATCA AGCGAAGGTT CTTAGAAACT 2880 

TCCACATCCA TAGCTGGAAC TATACGGTCC ATCATTTCGA TAACAGTCAC TTTTGAACCA 2940 

AATGTCATGA AGGCCTGACC GAGTTCGATA CCGACAACTC CACCACCGAT GATAACAAGG 3000 

CTTTCTGGCA CTTCGTTCAT TTCAAGAATG TCATCACTAG TCATGACAAG TGGAGATTCC 3060 

ATACCAGGGA CGTTGATCTT GTTGACTTTT GAACCACCAG CAAGAATGAT TTTCTTGGTT 3120 
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TCAAGCAATT CAGAACCATT TACCAAGACG TTCTTGTCTT TAGTGATTGT ACCAATTCCT 3180 
TTATGAACAG TAACTCCGTA GCTACGAAGA AGTCCTGCAA CACCACCAAC AAGAGTATTA 3240 
ACAACTTTAG ATTTAGTTTC TAAAAGTTTT TCCATATCAA CAGTGAAGTT AGGATTTTCA 3300 
ATCACGATAC CACGATTTGC AGCATGACCG ATATTTTCAA TAATTTCAGC GTTATGAAGG 3360 
TAGGTCTTGG TTGGAATACA TCCACGGTTT AAGCAGGTTC CACCAAGTTC AGATTTCTCA 3420 
ACAAGGGCAA CCTTACCGCC GAATTGGGCA GCTTTAATGG CTGCAACATA ACCAGCAGGA 3480 
CCTCCACCAA TCACAACGAT ATCAAAAGCA TCATCGCTCT TACCATCATC GTTTGAGGTA 354 0 

CTTGCTACAG GTACAGGGCT AGCTTCTGGC GATGCTGCTC CAGCTGTTGG GATGTTTTCC 3 600 

CTTTCTTCAC CAAGGTAACC GATAACTTCC GTTACAGGGA CAGTTTCACC ATCTCCTTTG 3 660 

AGAATGGCAA TCAAGTACCC ATCTTCTTCG GCTTCCAATT CCATGCTGAC TTTATCAGTC 3 720 

ATGATTTCCA AAAGGATTTC TCCTTCTTTT ACAAATTCTC CGACTTTTTT ATTCCATTGG 3780 
ACGATTTGTC CTTCTGTCAT ATCCACGCCG GCTTTTGGCA TAATTACTTC TAAGGCCATG 3840 

TCTTCCTTCC TTTATCTATA TCTTAAAAAT GAATACTCTT GCTCTTAAAT TAACATTGAG 3900 

ATTGGCGTTT CAATCAACTC TTTCAAGTCC TTCATAAACT TAGCACCAGC CATACCATCT 3960 

ACGACACGGT GGTCAATGGT TAATCCTAAA CTCATGATTG GGCGAATCAC AATTTCACCA 4020 

TTGACGACAA CTGGCTTCTC GATTGTCGAA CTGACACCAA GGATAGCTGA GTTGGGTTGG 4080 

TTAATAATCG GACCAAAGGA CTGAACACCA AACATTCCCA AATTACTGAT TGTGAATGTT 4140 

GAATTTTGTA ACTCACTTGG AGCCAATTTA CCATCCAAGG TACGGCCAAT AACATCCTTA 4 200 

AAGGCTACAA CCAGTTCTGA AAGACTCATC TTCTCAGCAT TGTAAACAAC AGGTGTCATC 4260 

AATCCATTAT CCATCCCAAC TGCCATGGCA AGATTGACAT AGTTGTGAGT GATAATAGTC 4320 

TTGCCATCTT CTGTCAATGA AGCGTTGATG TATGGGTGTT TCATAAGAGT CTTAACAACT 4380 

GCAAGCGAAA GAAGGTCTGT TACAGTAGTC TTCTTCCCAG TTGCTTCCAT GATTGGCTCA 4440 

AGAACCTTCT TACGAAGAGC CAACATTTCA GTCATATCAA CTTCATAGTT GAGGGTGAAG 4500 

GTTGGCGCAG TCAAGTAAGA TTCAACCATG CGTTGGGCAA TAACCTTACG CATTGGTGTC 4560 

ATTGGAATAC GCTCGATTTT ACCATATGGT GTTACGTTAT CAGGGACTTC TTCCACTTTT 4 620 

TCAATCTGAG CAGGAGATTT GATGCTATCG TTTTCGATAT TTTCAGGAAG CAGGGCCAAA 4 680 

ACATCCTTCT TCATGATTTT ACCACGATGA CCGGTTCCTT GGATTTCCTG CCAAGCAATG 4740 

TTATGTTCGA GGGCAATTCG TTTTGCAAGT GGCGAAATGC GAACCACGTT TGTGTCTTTA 4800 

TAAGTTTCCA CGTCTTCTTT GTGGACACGA CCGTTTGCAC CTGAGCCAGA AACGTCGTAG 4 8 60 
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AGGTTTATCC CTAAATCATC CGCTAACTTT CTAGCTGCAG GAGTCGCTCT TAGCTTGTCA 4 920 

TCAGCCATGA CCTCTCCAAT TCTATTTATG ATACAAAGGG CGTCAAAAGC GACTGAAAAA 4980 

TAGGAAATCG ACGATGGCTT CGATGAAGCC AAGGAGATTT ATCTTTTTTC CGATCTTTTA 5040 

GCCCGTGCTC TAATCTAAGA TATTAATGAC GAAGAGCTCT GCACCTAAAA GATACAAAGT 5100 

TTCTCGTCAG CTTTATTTTA TTTACATAAC TTATCTTATG TAACCCTATT CTTTGTTATA 5160 

AGTTTTTCGG ATTGCATCTT TGATACTTTC AACTGTTGGA ATCATTGCAT TTTCTAGGTT 5220 

TTGTGCATAA GGCATCGGCA CATCTTCTCC TGCACAACGG CGAATTGGTG CATCTAGATA 5280 

GTCAAATGCT TCTGATTCTG AAATAATAGC TGAAATTTCA CCGATATAGC CACTTGTTTT 5340 

GTGGGCATCG TTGACCAGAA CAACCTTACC AGTCTTCTTC ACTGAGTTTA TGATGATATC 5400 

CTTATCAAGC GGAACAAGGG TACGTGGGTC AACAATTTCA ACTGAAATTC CTTCTTCTGC 5460 

TAATTCTTCA GCAGCTTGAA CCACACGGCG AAGCATTTTT CCATAAGTAA CAACTGTTAC 5520 

ATCCGTTCCT TGGCGTTTGA TTTCACCAAC CCCAAGTGGA ATTGTGTAGT CTGGATCAAC 5580 

TGGCACTTCC CCTTTTTGGT TAAATTCTGA CTTGTACTCA AGTATAATAA CTGGGTTGTT 5 640 

ATCACGGATA GAAGACTTAA GCAGGCCTTT CATGTCCGCA GGTGTTCCAG GTGCCACAAC 5700 

CTTAAGTCCT GGAATGTGAG TAAACCAAGA CTCTAGAGAT TGTGAGTGCT GGGCGGCAGA 5760 

GCCAACTCCG TTACCAGCTG CACAACGAAC AGTCATTGGA ACCTGACCTT TACCACCAAA 5820 

CATGTAACGT GTTTTAGCAG CTTGGTTGAC GATATTGTCC ATGGCAATAA CAGAGAAUTC 5880 

CATGAAGGTC ATATCGACGA TTGGACGAAG TCCTGTCATG GCTGCTCCTG CTGCTGCTCC 5940 

AGAGATGGCA GCTTCAGAAA TCGGACAGTC ACGGACACGT TCTGGACCAA ATTCTTCAAG 6000 

CATTCCAACA GAAGTACCGA AGTCTCCTCC GAAGACACCG ACGTCTTCTC CCATCAAGAA 6060 

CACATTTTCA TCGCGACGCA TTTCCTCAGA CATAGCAAGG ATAATGGTGT CACGGAAGGA 6120 

CATTGTTTTT GTTTCCATTT TATCTCTTTC TCCTTAGTCT GCGTAAATAT CTTCAAAGGC 6180 

TGATTCAAGC GGTGGGAATG GGCTTTCCTC TGCAAATTTA ACAGAAGCTT CTACTGCTTC 6240 

CTTTACTTGC GCTTGGATTT CTTCCAATTC TTCGGCACTT GCAATGTTAT TTTCAATAAG 6300 

GTAATTGCGG AGGTTTTCGA TTGGATCTTT TTGTTTCCAC AATTCCACTT CTTCACGCGT 6360 

ACGATATTTA CCAGGGTCAG ATGATGAGTG ACCGAGCCAG CGATAAGTTA CACTTTCAAT 6420 

CAAGACTGGA CCATTGCCAC TGCGAACATG GTCCACAGCT TTCTGAAATC CTTCATAGAC 6480 

ATCGATGACA TTGTTACCGT CTTCGATGAA CATTCCAGGA ATTCCATAAG CGGCGCTACG 6540 

TTGATGGATA TGTTCTATAT TGGTCATTTT CTTGATATCC GCAGAGATAC CGTAACCGTT 6600 

GTTAATGCAA TAGAAAATGA CTGGCAGGTT CCAGATAGAA GCCATGTTCA CTGCTTCGTG 6660 
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GAAAACACCT TCATTGGTCG CACCATCTCC AAAGAAGCAG ACAACGATTT TACCGGTATT 6720 

TTGCATTTGC TGACTGAGGG CTGCACCGAC AGCGATCCCC ATACCACCAC CTACGATACC 57 80 

ATTGGCACCA AGGTTCCCAG CATCAAGGTC AGCGATATGC ATAGATCCAC CTTTCCCTTT 6840 

ACAGGTTCCA GTGTATTTAC CAAGGATTTC AGCCATCATT CCGTTGAGGT CAATCCCTTT 6900 

AGCAATAGCT TGCCCGTGTC CACGGTGGTT TGAGGTAATC AGATCATCTG GATTGAGAGC 69 60 

TAACATAGCC CCCACGTTAG CTGCCTCTTC ACCAACAGAA AAGTGCGTCA TTCCTGGCAC 7020 

TTTCCCTTTC TTTACTAATT GTGCAATTTT TAAGTCCATG CGACGGATTT CTTCCATCTT 7080 

ACGGAACATT TCTAGCAAAA GATTTTTATC TAAAGTTGAC ATCTTCTTGC CTTTCTAACT 7140 

TTCTTCTTAC CTTACTATTT TACCGCTTTT GGCAAATACT GTCAAAGTTT TTCTAAAAGA 7200 

AATTTCACAA AATAAAAAAG AAAACCCCGT GAAAACAAGG GATTTTCTTG TCAAGAATAT 7260 

TTTTTCACAA ACTTTTTAGC ATTTGGATTT TGCTAAAGAT TCAAATCTCT TCATAATCAC 7320 

AGTTAAACGC CAACGGTAGA GCGCCCCGCT CACAATCAAA CTAATAATCA AGCCGATCCA 7380 

GTAAGAATAA GCTCCAAAAT CTGTTAGGGA ATCAAATAGC GTAnCACAGG GATTGCTACG 744 0 

CCCCAATAAC CAAGCAAACC AAGGTAAAAA GGAATAACTG TATCCTTATA CCCCCGCAAA 7500 

ATTCCCTGAA GCGGCGCCGC AAAGGTATCT GCTAACTGGA AGAAAAGACT ATAAGTTAAA 7560 

AAACGCACTG TCAAATCGAT AAATTTTGGG TCGTTACCAT AAAGACTGGC CACATTTCCC 7620 

CTAAAAATGT AAAGGAAGGT TAAGGTGAAG GCCGCAAAAA TGAGGGCAGT CCATCTTCCT 7680 

AGACCAATAT AGGTTTTCGC ATCATCAAAT CGCTTGGCTC CCACTTCATA GGAAACGACA 7740 

ATAGCCATAG CCGATGAGAT ACTCATAGGA AAGGCGTACA TAAGACTTGA AAAGTTCATA 7800 

GCTGACTGGT GACTAGCTAT AATCAAGGGC GAAAACTTAG CCATAATCAA GCCAACCACT 7860 

GAAAAGATAG CCACTTCCGC GAAGACAGTT CCCCCAATAG GCAGACCTAA ACGAACTCCT 7920 

TCCTTAATTT TATCCATATT AAGTGGAATT CGTTTCTCAA GGTGTAAGGC TTTGAGCTTC 7980 

TCCTGTTTAA ATAAAACCAG AACAGAAATC CCAAGCAAGA CCCAGTAGGC CAAGGATGTT 804 0 

CCTAAACCAG CACCAGCCCC TCCCAGTTCT GGAACACCAA AGGCACCGTA AATCAAGAGA 8100 

TAGTTAAATC CGCTATTGAG AGGGAGTAAC AAAAGCATGA GGTACATGGA CAGTTTGGTC 8160 

AAGCCCAGCG AATCCAGCAA GGAACGAATG ACGCTAAAGA GCAACAAGGG GATAATCCCG 8220 

ATAGATAAAA ACCAAAGATA GCGAACCGCT ACTGCCGCTA CTGCTGCTTC TAACCCAATA 8280 

TGATTCAAGA TTATTGGTGC CAAGAAAAGT ACCATCCCCA GCAAGACCAC AGATAGGCCC 8340 

AAGGCCAAAT AAATAAATTG GTAAAAATCA GACGCAACTT CTTCCTTTTT GCCTCGACCA 8400 
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AGATGGTGAC CAATGATAGG CACCAAGGCT GACACAATCC CTGTTAGAAA TGTAAAGAAA 8460 

GGATTCCAGA TACTGGTTGC CATAGATACA CCAGCCAAGT CCATAGTGTT GTATTGACCT 8520 

GTCATTGCAG TATCAACAAA AGAGGCAGAA TAATTGGCAA ATTGGTAGAT CAGGATTGGG 8580 

AAGAAAATTT TTAAAAATAA TACTAACTTC TCTCGTAAAC ACTTTGTCTT ATACATACTT 8640 

CTCTTTCTAT TCTGATTTAT CTAAACCAAA GAGTTTCAGA CCATAGTTTT TCAAACTTAG 8700 

CGGAGGTTTA TTAGATTTTG AAGTAGTATG CCAACACGCA CATGTACGAC AATAATAGCT 8760 

TCTAACTAAA CCTCCGTTAT CATATTGAAC CGCATGGTCA GCTTTTTCTT TAGTTTCATA 8820 

TTGAATTTTG GAACGATTAG CTGCGGGACA GTAAATTCCA CTATTAGATT TCGCTTGTCT 8880 

CTCCCTACGT TTTCGAAAAT AATTCATATT CTAACTCCTA TCAAGCTTGA TAGACGATTT 8940 

GTCCCTTACA GATGGTATAT TTAACCTGCC CTTTTAAGGT TTCACCGATG AATGGTGAAT 9000 

TAGCTGCTTT GGAAGCAAAA TGGGAGTCCA CAAAGCGGTC AGCCTTGGCA TCAAAAATAG 9060 

TGATATCTGC TGGACCATTC TCAGCCAAGT AACCTGCTTC AAAGTTGTAA AGCTTGGCTG 9120 

GGTTGTATGT CATTTTTTCA AGTAATTCCA TCAAGCTCAA CTCACCAGCT TCTACTAAAT 9180 

AGGTCAAGCT GAGAGACAGG GATGTTTCTA AGCCAGTCAT ACCAGATGGC GCTTTGGTAA 9240 

TATCCTCAAC ATTTTTTTCA TCTACATGAT GAGGCGCGTG GTCAGTCGCA ATAACTGTGA 9300 

TGACACCTGA TTTGAGACCT TCGATAACGG CACGACGGTC TGATTCCAAA CGAAGCGGTG 9360 

GATTCATCTT AGCATTGCTA CCTTGTGTTA AAAGA.^r,Tr?C TTCTGTCTTA GAGAAATGCT 94 2u 

GTGGCGCTAC TTCTGCTGTG ACTTCTGCAC CTAACCCCTG AGCAAACTCC ACTACTTTAA 94 80 

CACTTTCTTC CTTAGACAAA TGCTGGATGT GAACATGGGC rTTAGTTGCA TAGGCAATCA 9540 

TGACATCACG CGCCATCATA GCGTACTCAG CCACCCCAGT AGCACCGCAG ATATGGAAAT 9600 

GTTCTCTAGC AATATTTTCA TTAAAGCCAA GAACACCGTT CAAACCTGGA TCTTCCTCAT 9660 

GAAGGCTGAT AAAGGTATTG AGTTTTTTGG CTTCCTCCAT GGCTTCCTTG ACAATCTTAC 9720 

TGCTCTCAAG CGGAATACCG TCATCAGAGA AACCAACCGC ACCAGCTTCT AAGAGTGCCT 9780 

TAAAGTCAGT CAAGTTTTTA CCATTAAAGT TTTTAGTAAT GGTCGCAACT GTCTTGACAT 9840 

TAATCTTCTC TTTGGCAGCT GACTGGAGAA CTGCTTGCAA AGTCTCCACG TCTGAAATGG 9900 

TTGGACTGGT ATTAGCCATC ATGACGACAG TAGTAAAACC ACCTGCAGCG GCTGCTAGGG 9960 

CACCAGTATG AATGTCTTCT TTATGTGTTT GACCAGGTTC ACGGAAATGA ACATGAATAT 10020 

CGACCAAGCC AGGAGCAACC ACAAGACCAG TAGCATCAAT CGTTTCTGCT CCTTCTTCCG 10080 

TGATCTCAGA CGCAATTTTG ATAATTTTCC CATCTTGAAC TAAGACATCA CAAACTTGAT 10140 

CCAAACCAGA CTTGGGATCC ATTACACGAC CATTTTTGAT TAGTAGCATC TGCTTTCTCC 10200 
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TTTATTCATA GAAATCAACT TGGGTATCCA ACAATTTATC CCCATCATAA ACAAACTTGG 1026O 
CTGAAAAGAA GGGTTTATCC TCTAAAAGCC ACTCAACAAA GGTGTGGTCA CCTTCCCAAG 10320 
TCGGCTTGCT CAAAACCTCA TCATAGGGAA CCCATTCTAG CGTCCCCTCA TTGCAGTCAA 10380 
TCAAGTCGCC CTCAAACTCC GTCACCTTAA AAACATAGGT GTACCAGTCT AAATCTGGTG 10440 

TAAATTCAGG AAAAGTGATG ACACCTTTTA GAACTGGCTT GGCTTTGAGC CCTGTTTCTT 10500 

CAAGGATTTC ACGCGCCGCG CATTCCTGGG GCGTCTCTCC TCTCTCTAGC TTACCACCCA 10560 

CACCAATCCA TTTCCCTTCA TGGACATCAT TGGGTTTCTT ATTACGATGG AGCATGAGCA 10620 

GTTCTTTCCC ATTATCAATG TAGCAAATCG TCGCTAACTG AGGCATATTT TCTCCTTATC 10680 

TAAGCCAATC GATTGGCTCT TGTCCTGTCT CTTTTAAGAA TGCATTGGCC TTGGAAAAGG 1074 0 

GCTTGGAACC CCAAAATCCT CTATAAACCG ACAAAGGACT TGGATGGGCT GATTCGATAA 10800 

TCAAGTGATG AGGATTGGTA ACTAATGCCT TCTTCTTACG TGCATAAGCT CCCCAGAGTA 10860 

CAAAAACGAC TGGTCTATCT AGATGATTGA CCACCTGAAT CACAGCATCA GTAAAAGGCT 10920 

CCCAGATTTG ACCAGCATGA CCATTGGCCT GTCCAGCAGG AACAGTCAAA CAAGCATTAA 10980 

GAAGCAAGAC TCCTTGCTCA GCCCAAGCTG TCAAATCATG AGATTTCTTA ACTCCGATAT 11040 

CATCTGACAA TTCTTTCAAG ATATTTTGCA AGGATGGTGG AGCTGGGATA GAGTCAGGTA 11100 

CAGAAAAACT CAAGCCCTGC GCTTGACCTG GTCCGTGATA GGGGTCTTGC CCTAGAATTA 11160 

CCACCTTAAC TTCTTCAAGC AGTGTTGTCA AGAGAGCCTG AAAAACCTTT TCCTTGGGTG 11220 

GATAAATAAT CCCCTGAGAA TAGACCTGCT CCATAAACTG ATTGATTTTC CCGAAATAAC 11280 

CCTCAGGTAA TTGCGCCTTA ATCAAAGCAT GCCAAGACGA GTGTTCCATA GCCGACTCGG 11340 
(2) INFORMATION FOR SEQ ID NO; 148; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12127 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 
AAAAAATAGA CTTGTTAGAC TATAAATGTA GTAAGCCTAC ACAAGAAAAA TACATAGAGA 
TAAAGGTGAT TATTATGAAA TTCAAAAAAA TGCTTACTCT TGCAGCCATT GGCTTATCAG 
GATTTGGGCT TGTTGCCTGT GGCAATCAGT CAGCTGCTTC CAAACAGTCA GCTTCAGGAA 
CGATTGAGGT GATTTCACGA GAAAATGGCT CTGGGACACG GGGTGCCTTC ACAGAAATCA 
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CAGGGATTCT CAAAAAAGAC GGTGATAAAA AAATTGACAA CACTGCCAAA ACAGCTGTGA 300 

TTCAAAATAG TACAGAAGGT GTTCTCTCAG CAGTTCAAGG GAATGCTAAT GCTATCGGCT 360 

ACATCTCCTT GGGATCTTTA ACGAAATCTG TCAAGGCTTT AGAGATTGAT GGTGTCAAGG 42 0 

CTAGTCGAGA CACAGTTTTA GATGGTGAAT ACCCTCTTCA ACGTCCCTTC AACATTGTTT 4 80 

GGTCTTCTAA TCTTTCCAAG CTAGGTCAAG ATTTTATCAG CTTTATCCAC TCCAAACAAG 540 

GTCAACAAGT GGTCACAGAT AATAAATTTA TTGAAGCTAA AACCGAAACC ACGGAATATA 600 

CAAGCCAACA CTTATCAGGC AAGTTGTCTG TTGTAGGTTC CACTTCAGTA TCTTCTTTAA 660 

TGGAAAAATT AGCAGAAGCT TATAAAAAAG AAAATCCAGA AGTTACGATT GATATTACCT 72 0 

CTAATGGGTC TTCAGCAGGT ATTACCGCTG TTAAGGAGAA AACCGCTGAT ATTGGTATGG 780 

TTTCTAGGGA ATTAACTCCT GAAGAAGGTA AGAGTCTCAC CCATGATGCT ATTGCTTTAG 840 

ACGGTATTGC TGTTGTGGTC AATAATGACA ATAAGGCAAG CCAAGTCAGT ATGGCTGAAC 900 

TTGCAGACGT TTTTAGTGGC AAATTAACCA CCTGGGACAA GATTAAATAA AATGTTTGCT 960 

CCATAAATCT CTAAAGAGAT GCAGACGTTT CATCGTACAA TAAGATAAAG AAGGCAAGTA 1020 

GGGAGGTGTC GTATCTCCCT TACTTTCTTC ACTAGAAAGG ACAAGATGTG ACAAAACAAG 1080 

CCTTCAAAGA AGCAGTTTTT . AGGGCAATTT TTTTCATGAG TGCAACAGTA GCTGTTGTAG 1140 

CTATTTTGCT AATCTGTTTC TTTATTTTTA GTAATGGCTT ACCTTTCATA GCTAACTACG 1200 

GCTTTGCCCG TTTTTTATTA GGCAGTGATT GGTCGCC.^.^C GAACATTCCG GCAAGCtaTG 1260 

GTATTTTACC AATGATCGTT GGTTCCTTAT TAATTACCTT AGGAGCGATT GTGATTGGGG 1320 

TGCCAACAGG CATCTTGACA TCGGTGTTTA TGGTTTATTA TTGTCCAAAG CCCGTCTATG 13 80 

GCTTCTTAAA ATCAGCTATC AACTTGATGG CAGCCATTCC ATCTATTGTT TATGGTTTTT 1440 

TCGGCCTACA ATTATTGGTG CCTTGGATTA GAAGCTTTTT AGGAAATGGC ATGAGTGTCC 1500 

TAACCGCTTC GTTACTATTA GGAATAATGA TTTTGCCAAC CATTATCAGT TTGTCAGAAT 1560 

CTGCTATCCG AACAGTTCCC AAAACGTATT ATTCTGGTAG CTTGGCTCTA GGAGCTAGTC 1620 

ATGAACGGAG TATTTTTAGT GTCATCTTGC CAGCTGCGAG ATCTGGTATT TTATCAGCAG 1680 

TTATTTTAGG AATCGGTCGC GCAGTAGGTG AAACCATGGC AGTTATTTTG GTGGCAGGCA 1740 

ACCAGCCGAT TATTCCAAGT GGACTCTTTT CAGGAACCAG AACCTTAACA ACCAATATTG 1800 

TTCTGGAAAT GGCTTACGCA TCAGGTCAGC ATAGGGAAGC CCTTATTGCA ACCTCAGCAG I860 

TTCTCTTTTT CCTTATTCTC TTGATTAATG CCTACTTTGC CTACTTGAAA GGAAAATCAT 1920 

CTTATGAGTA AATACCTGCT AAAACTTCTC GTTTATTGTT TTTCAGCTTT AACCTTTGGC 1980 

TCTCTCTTTT TAATCATTGG TTTTATCCTC ATCAAAGGCT TACCTCATCT AAGTCTATCC 2040 
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CTCTTTTCTT GGACTTATAC TTCTGAGAAC ATTTCCCTTA TGCCAGCGAT TATTTCCACC 2100 

GTTATTCTGG TCTTTGGTGC TCTTCTTTTA GCCTTGCCCA TAGGGATTTT TGCTGGTTTT 2160 

TATCTTGTGG AATATACAAA AAAAGATTCC CTTTGTGTTA AAATCATGCG ATTGGCCTCA 2220 

GATACCTTAT CTGGGATTCC TTCCATTGTT TTTGGTCTGT TTGGCATGCT CTTCTTTGTA 2280 

GTCTTCTTAG GTTTTCAATA CTCTCTGTTA TCAGGAATCT TAACCTCAGT TATCATGGTG 2340 

TTGCCAGTCA TTATTCGCTC AACAGAAGAA GCCCTTTTAT CTGTTAGTGA TAGCATGCGT 2400 

CAAGCAAGTT ATGGACTTGG GGCAGGTAAG TTACGGACTG TTTTTAGAAT TGTTCTACCA 24 60 

GTTGCCATGC CAGGTATTTT AGCTGGAGTG ATACTAGCTA TTGGCCGTAT CGTTGGTGAA 252 0 

ACAGCTGCCC TCATGTATAC ATTAGGTACC TCTACCAATA CGCCAAGTAG TCTCATGTCT 2580 

TCAGGCCGTT CTCTAGCCCT ACATATGTAT ATGCTGTCAA GTGAGGGGCT ACATGTCAAT 2 640 

GAAGCCTATG CTACCGGCGT GATTTTGATT ATTACTGTTT TAATGATAAA TACTCTATCA 2700 

AGCTTATTAT CTCGAAAACT TGTGAAAGGA GCTTCCTAGT ATGGGAACAT TTTCAGTCAG 27 60 

ACACCTAGAC TTATTTTACG GGGATTTTCA AGCCTTAAAA AATATTTCGA TTCAATTACC 2 820 

AGAAAGACAG ATTACTGCCT TGATAGGCCC ATCTGGTTGT GGCAAATCAA CTTTTCTAAA 2880 

AACCCTTAAC CGGATGAACG ATTTGGTTCC TTCTTGCCAT ATTGAAGGCC AAGTCCTCTT 2 940 

AGATGAGCAA GATATTTATA GTAGCAAATT CAACCTTAAT CAGCTACGTA AGCGTGTAGG 3 000 

GATGGTTTTT CAACAGCCTA ATCCCTTTGC CATGTCTATC TATGATAACG TGGCTTATGG 3060 

CCCAAGGACA CATGGTATTC GAGACAAAAA ACAATTAGAT GCCTTAGTGG AGAAATCTTT 3120 

AAAAGGGGCA GCCATTTGGG AAGAAGTCAA AGATGATCTT AAAAAGAGTG CCATGTCCTT 3180 

ATCTGGCGGT CAGCAGCAAC GCCTTTGCAT TGCGCGAGCT TTAGCAGTAG AACCTGATAT 3240 

TCTGTTAATG GATGAGCCGA CTTCAGCCTT AGACCCTATC TCCACTTTAA AAATTGAAGA 3300 

CCTCATTCAG CAACTAAAAA AGGATTATAC GATTATCATT GTTACCCATA ACATGCAACA 33 60 

AGCTTCACGT ATTTCAGATA AAACTGCTTT TTTCTTAACA GGAGAAATTT GCGAATTTGG 3420 

AGATACCGTT GACGTGTTTA CCAATCCAAA AGATCAGCGC ACAGAAGACT ATATTTCAGG 34 80 

ACGGTTCGGA TAAGGAAGGA AAAACCTATG AGAAATCAAT TTGACTTAGA ATTGCATGAA 3540 

TTAGAACAAT CCTTTTTAGG ACTAGGGCAA CTTGTCCTTG AAACAGCTTC AAAAGCCTTA 3600 

CTGGCCTTAG CCTCCAAAGA CAAGGAGATG GCAGAGCTAA TTATCAATAA GGATCATGCT 3 660 

ATCAACCAAG GTCAAAGCGC TATCGAATTG ACCTGTGCCC GTTTGTTGGC CTTGCAGCAG 3720 

CCACAAGTGT CTGACCTTCG ATTTGTGATT AGCATCATGT CTTCTTGTTC AGACCTTGAA 3780 
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CGTATGGGAG ACCATATGGC AGGCATTGCC AAAGCTGTTT TGCAACTAAA AGAAAATCAA 3840 
CTAGCCCCTG ACGAAGAACA GTTACACCAA ATGGGTAAAT TATCCCTCAG CATGCTAGCC 3 900 

GATTTATTGG TTGCCTTTCC TTTGCACCAA GCCTCAAAAG CTATTAGTAT TGCTCAAAAA 3 960 

GATGAACAGA TTGACCAATA TTATTATGCC TTATCAAAGG AAATCATTGG ACTTATGAAA 4020 
GACCAAGAAA CCTCAATTCC CAATGGAACT CAATACCTTT ATATCATAGG GCATCTGGAA 4080 
CGCTCGCTGA TTACATTGCT AACATTTGTG AACGCCTAGT CTACCTAGAA ACAGGAGAAC 4140 
TAGTGGATTT GAATTAATTC AACTAATCCT TAAAAGAGAA GAGTACGATT AAGTACTCTT 4200 
TTTTATGGTT GTAAAAAAGT TCATTTGACC AATTTAAGCA GTGTAGATAG TGAGGAGTTG 4260 
TTTCAATTCT ATCGTGAACG AGGGAATGCT GAAAACTTTA TCAAAGAAAG GAAAGCAGGA 4320 
TTCTTTGGGG ATAAGACAGA TAGTTCGACC ATGATTAAGA ATGAAGTACG TATGATGATG 43 80 

GGCTGTCTGG CTTATAATCT CTACCTCTTT TTAAAGCAGC TAGCTGGTGA TGAAGTAAAG 44 40 

TCCTTGACTA TCAAGCGTTT TCGACGTCTC TTCCTTCATA TTGCCGGAAA ATATGTCTCT 4 500 

ACTGCTAGAC GACATATTCT CAAATTCTCA AGTCTATACG CCTATTCAAA ACAGTTTCAA 4 560 

GCCTTATTTG ATACAATCTG CCAGATAAAT CTGATACTCC CTGTTCCATA TAGAGCTAGA 4S20 

GGGCAGGGGA AAACATGCCT AACAGAATAA GTCACCTTAT TTTAAAAATC GAGCATCAAA 4 680 

CCAAGGGAGG AGTCTGCCCT TTTTTAGGAA AAAATCAAGA CAAATCTCCT CAATTATGTC 474 0 

TCGAACATCA GAAATTAAGC AAAATCACCA GAAGGACAGT ATTTav\CTA GCTTTTCT'GG 4 900 

TAATTTTTGA ACTGTGTAGT TCGTTAGTGC CAGATATGAA TAATTTGGGA TGATAAATCT 4 860 

TTCTTCCTCA GGTAGCCTAT CATAATACTC TTCAAAAATC TTATCAAAAA CACTCTCTTT 492 0 

CTTTTGGGCG ATAGTTTCAT CTTCGTATGT AGGAGTCCTC ATCAAGAAAT ACTTCAATTC 4980 

TAGGTATTCC TTATCCAACT CTATATAACT TGGCATCAAC TTGTAATCTT CAACCCCCAA 5040 

ACGTTCAGCA ATATATTTTA ACTTTGTTAG TATTGGTCTG GATTCTCCAT TTTCAATTCT 5100 

AATTAATTGA CGGATACTTA ATTCAGACTC ATCACCACAA AATTCTGAAC GACTGATT1 T 5160 

TTTAGCCAAA CGTAATCTTT TAATTTTTTC GCCAAACTCT CGCAACCTAC AAGAACTTCC ri220 

TGAGTTGTTT ACCTCTATTA TAAGCATATA CTGAATCAAA CTATCTATCA GATTTCTTCT 5280 

CACTTTAACT AAAGACTAAG AGTTTATCCC TTCGTCTCGG TTTTTGTGTA TTTTTCCACC 5340 

ATACCCCAGT AATGCAAGTG CAAAATCCCC TAGAATATGA TAGAATAAGA GAAAGAACTC 5400 

TATCAAGGAG GAAATCATGG AAAAACAAAC CGTCGCCGTC TTGGGGCCTG GTTCTTGGGG 54 60 

AACCGCCCTT TCACAAGTCT TAAATGACAA TGGACACGAG GTACGTATTT GGGGAAATCT 5520 

TCCCGAGCAA ATCAATGAAA TTAATACACA CCATACTAAT AAGCACTACT TTAAAGATGT 5580 
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CGTTCTAGAC GAAAATATCA TTGCCTACAC CGACTTAGCA GAAACATTGA AAGATGTGGA 5640 
TGCGATTTTG TTTGTTGTCC CAACAAAAGT GACACGACTT GTTGCCCAGC AAGTTGCACA 5700 
AACCTTGGAC CATAAGGTTA TCATCATGCA CGCATCAAAG GGATTAGAAC CTGATAGCCA 5760 
TAAACGATTA TCAACCATTC TTGAAGAAGA AATTCCTGAA CATCTCCGTA GTGATATCGT 5820 
CGTTGTTTCA GGGCCTAGTC ATGCAGAAGA GACCATTGTG CGTGACCTAA CTTTAATAAC 5880 
TGCTGCTTCT AAAGATTTAC AAACAGCTCA ATACGTTCAG AAGCTATTTA GTAATCACTA 5940 
CTTCCGACTT TATACCAATA CGGATGTTAT CGGGGTTGAA ACTGCTGGTG CTCTTAAAAA 6000 
TATTATTGCT GTCGGTGCTG GAGCTTTACA TGGTCTTGGA TTTGGTGATA ATGCTAAGGC 6060 
AGCCATCATC GCTCGAGGTT TAGCAGAAAT CACCCGCCTA GGGGTAGCAC TCGGGGCCAG 6120 
TCCATTGACC TATAGCGGCT TATCTGGTGT GGGAGATTTG ATCGTAACGG GAACTTCCAT 6180 
CCACTCTCGT AACTGGAGAG CTGGAGATGC TCTCGGACGA GGAGAATCCC TAGCTGATAT 6240 
AGAAGCTAAT ATGGGCATGG TAATCGAAGG AATTTCAACG ACTCGAGCAG CCTATGAACT 6300 
AGCCCAAGAA CTTGGAGTCT ATATGCCCAT TACACAGGCT ATTTACCAAG TTATTTATCA 63 60 

CGGAACCAAT ATCAAAGATG CCATTTATGA CATCATGAAC AATGAATTTA AAGCAGAAAA 6420 

TGAGTGGTCT TAACCCTCTA TAGAAAGGAT TTTTATGACA TCAAAAGTTA GAAAGGCAGT 6480 

CATCCCTGCT GCTGGACTAG GAACTCGATT TTTACCAGCA ACCAAGGCCC TTGCCAAAGA 6540 

AATGTTGCCA ATCGTAGACA AACCAACTAT CCAGTTTATC GTGGAAGAAG CTCTCAAATC 6600 

AGGTATTGAA GATATTCTAG TTGTCACTGG TAAATCAAAA CGTTCTATTG AGGACCACTT 6660 

TGATTCAAAC TTCGAATTGG AATATAACCT CAAAGAAAAA GGGAAAACAG ATCTTTTGAA 6720 

GCTAGTTGAT AAAACAACTG ACATGCGTCT GCATTTTATC CGCCAAACTC ATCCACGCGG 6780 

TCTCGGAGAT GCTGTTTTGC AAGCCAAGGC TTTCGTCGGA AATGAACCTT TTGTCGTTAT 6840 

GCTTGGTGAT GACTTGATGG ATATCACAGA CGAAAAGGCT GTTCCACTTA CCAAACAACT 6900 

CATGGATGAC TACGAGCGTA CCCACGCGTC TACTATCGCT GTCATGCCAG TCCCTCATGA 6960 

CGAAGTATCT GCTTACGGGG TTATTGCTCC GCAAGGCGAA GGAAAAGATG GTCTTTACAG 7020 

TGTTGAAACC TTTGTTGAAA AACCAGCTCC AGAGGACGCT CCTAGCGACC TTGCTATTAT 7080 

CGGACGCTAC CTCCTCACGC CTGAAATTTT TGAGATTCTC GAAAAGCAAG CTCCAGGTGC 7140 

AGGAAATGAA ATTCAGCTGA CAGATGCAAT CGACACCCTC AATAAAACAC AACGTGTATT 7200 

TGCTCGTGAG TTCAAAGGGG CTCGTTACGA TGTCGGAGAC AAGTTTGGCT TCATGAAAAC 72 60 

ATCCATCGAC TACGCCCTCA AACACCCACA AGTCAAAGAT GATTTGAAGA ATTACCTCAT 7320 
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CCAACTTGGA AAAGAATTGA CTGAGAAGGA ATAACAAAAT CATTTATATA AAGATTAGCC 7380 
ACACATAAAT TAAGTAAATT CTCTACTTGA ATCTACCTAT TTAATAAAAA CTAATGAAAA 7440 
CGCTATACTT GTATTTGTTT TTTCATTAAA ATAAGAGTAG AATAAATTAG TATAGTAAAA 7500 
CAAAAAAGCA CCGAATCGGT GCGCACTTTT TCAAGTTGTG TACGGACAAA GCCTTATTTT 75 60 

AACTTTGCTA TGTTGTTTCT AATGGTTCCA AAATAATAAA TAATTTTAAA TTTGACTTAA 762 0 

CTGTTGGAGT AGTCATGGTT AAATTAAATC AACCGAGCCG AACATAAGTT GTTTAATTTT 7680 
GTGGAAGCTA TTAATAAAAA TATAATAAGG GAGAAAGATA GGTGTAATTT TAATTTTAAA 7740 

GTAATTGCGG ACACTATCAA AGAAAAAGAT TATGGAGAAC AAATTTGTAG AATTTATCGA 7800 

AAACAATAAA AAAGTAATCA TTTCATCAGT TGCAGTTGGT GTTGTATTGG TATTAGGGTT 7860 

TGGATGGTAT TCATATAACC AACAACAAGC AGAACAACAA GCAAAAATTG TACAATTAGA 7920 

AAAAGATAGC AAATCAGACA AAGAACAAGT TGATAAACTA TTTGAATCAT TTGATGCATC 7980 

TTCAGATGAA TCTATTTCTA AATTAAAAGA ACTATCTGAA ACTTCACTTA AAACCGATGC 8040 

AGGTAAAGAC TATCTTAATA ACAAAGTCAA AGAATCATCT AAAGCAATTG TAGATTTTCA 8100 

TTTGCAAAAA GGTTTGGCTT ATGATGTTAA AGATTCAGAT GACAAATTTA AAGATAAAGC 8160 

AACTCTTGAA ACAAATGTAA AAGAAATTAC AAAACAAATT GATTTTATCA AAAAAGTTGA 8220 

TGAAACTTTT AAACAAGAGA ATTTGGAAGA AACTCTTAAA TCTCTAAATG ATCTTGTTGA 8280 

TAAATATCAA AAACAAATCG AAOTTTTGAA GAAACV^GAA GAAAAAGCTG CtGaaAAAGC 8340 

TGCTGAAAAA GCAAAGGAAT CTTCTAGTCA AAGTAATTCT TCTGGTAGTG CTTCTAATGA 8400 

GTCTTATAAT GGATCTTCCA ATTCAAATGT AGATTATAGT TCATCTGAAC AAACTAATGG 84 60 

ATATTCAAAT AATTATGGCG GTCAAGATTA TTCTGGTTCA GGAGATAGTT CAACAAATGG 8520 

TGGATCATCA GAACAATATT CATCTAGCAA TTCAAACAGC GGAGCAAATA ATGTCTACAG 8580 

ATATAAAGGC ACTGGTGCTG ACGGCTATCA AAGATACTAC TACAAAGATC ATAATAATGG 8 640 

AGATGTGTAT GATGACGATG GAAATTACCT TGGGAACTTT GGTGGCGGCA TTGCAGAACC 8700 

TAGTCAACGC TAATAACTAT TTTAGAGCTG TGTTGTTTCG AATGGTTCCA AAACACATTA 8760 

AAAGCTACTC ATTTTTTAAG TAGCTTTTTT CTTATTCAAG TTTACATATT ATACTCAATG 8820 

AAAATCAAAT TCAAACCACG TCAGCATCGC CTTACCGTAG GTATGGTTAC TGACTTCGTC 8880 

AGTTTCATCT ACAACCTCAA AACCATGTTT TGAGCTGACT TCGTCAGTTC TATCTACAAC 8940 

CTCAAAGCAG TGCTTTGAGC AACCTGCGGC TAGCTTCCTA GTTTGCTCTT TGATTTTCAT 9000 

TGAGTATTAG TCGTCACAAT CCCATTCCCT TGTAGAAAAG CAAAATGGCG AGTCCTACGA 9060 

ACAAGACTAC CGCTCCTAAT CTCTGGCTGG TGTTATACAT CCGTTTTTCT CCTCTAACTG 9120 
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GAAAGATAAC TGCTAGAAAT GCGCCACCAA CTGCACCACC GATATGGCCT GCTAGGCTGA 9180 

TTCCTGGAAT CAGAACACTT CCAATAATGT TAACCACAAA AAGTGTCAGA TAGGATTGCC 9240 

CTAGCTGTTG GATATAAGGA TTGCGAGTTG CATAGCGAAG AACAATAATC GCGGCAAATA 9300 

GCCCATAAAG AGAGGTAGAG GCGCCTGCTG CTAAGGATTT AGGACTAAAT ACAAAAACAA 93 60 

AGAGATTGCC CATCATTCCT GATAAAAGAT AGAGAAAGAA AAACTGCTTA GAACCGAAAA 9420 

TCTCCTCTAC CTGCCTTCCA AGATAATAAA GTGAAAGCAT ATTAACAATG AAATGTTCCC 9480 

ACCCAATATG AACAAAAATG GCAGACAAGA GACGCCAAAC CTGCTCGGGA AAGAGGCGAA 954 0 

TAGCTGGCCC ATACATGGCT CCAAATCGAA ATAATGTATC TGCCCTGTCA AAGTTTCCGC 9600 

CTGCAGTGAC CAACATTAGT AAAAATACCA AGGCCGTCAC TAAGAGGAAG AAACTCGTCA 9 660 

CAGGGTAACG TCTATCAAAG ATTTCCTTCA TCAATTAATA CCTCCTGAAC AGGAATATCA 9720 

TGGTTTTCAG GTATAAAGTC CTGAATTTGA CAAGGATATA TCGTACTCAA AGTACGACCA 9780 

GAAAAATGTT CCAGATAGCG GTCATAATAG CCTCCACCGT ATCCTATCCG ATATCCTTTC 984 0 

GTCGTAAAAG CCAGACCAGG AACATGAATC AAATCAATCT GAGATGCATC CACCACTTCC 9900 

AAATCTCCCT GTAGCTCCAG TAAGGCAAAG AAAGTTTTTA CCAACTGTTG CGGATCATAG 9960 

ACCACAAAGT CCATGCGCCC CTTGGGATAA GTTTTGGGTA TTAAAACCTT CTTGCCGTCC 10020 

TTCAGCGCCT GCTCAATCAG TTCCTGCGTT TGAAACTCAT GAGAAAAAGA GAGGTAGGTT 10080 

GCGATGACCT TGGCTTCTTG ATAAAAGGGG TGTTGTAAAA GCCGCTCGGT TAAAGCTTGG 10140 

TCTATAGCCT GTTTTTGCTC TTGAGATATA GCCTTCATTT CATGCAAGAC TTGCTTGCGT 10200 

AATTCCGATT TCATAGACAA GCCCTCTATT CTGCTGCCTT CTTTTTCAGG AAACTAGACA 10260 

CCGCAGCCAC CCCAATAGCT AAGACTTCTT CCTTAGGACT CATTTGAGGG TGATGAAGAG 10320 

CGTAGGGACT ATCGATACCT AGCCAAAACA TCACGCCATC AACCTTTGAA AGGAGATAAC 10380 

CAAAGTCCTC GCCTGTCATA GCAGGTTCGA TATCAATCAA CTCGATTCCG TCTTTTTCGT 10440 

CAAAGAAGTC CATCAGTTCA CGCGCCAAGG CTGGATTGTT CTCAACAGGT AGGTATCCAC 10500 

CTTGTTTGAG TTCCACTTCG ACTTCCATAT CAAAGGCAGC TGCAACCCCT TCTGCAACTG 10560 

TTTTTACCCT CTTTTGCACC AAGAGACTCA TGTCCTGTGT CAAGGCACGA ATAGTTCCAT 10620 

GTAAAAAAGC TGTGTCTGTG ATGACATTGT TGGTGGTTCC AGCTTGAAAA ACGCCGAAGG 10680 

TCACCACTGC TCCCTCGATT GGGTTGACAT TGCGGCTAAC AACTGACTGC ACTTGGGTCA 10740 

CAAAGTAACT AGCCGCCACC AAGGCGTCAT TGGCTTCATG AGGAAAAGCT GCGTGGCCAC 10800 

CTTTGCCTTT GAAACGGATC TTCACCTCGC AAGTTCCTGC AAAGAGTGTA TGAGTATTAG 10860 
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TCGCAATCTG 


: GCCGACTTTC AAATCTGGAC GAACATGGAG 


; ACCATAGAAT 


' TGATCTGGCA 


10920 


ACCAATCTCC 


AAAAGCACCG TCCTCATACA TGAGCATACC 


ACCAGCTTCA 


TTTTCTTCAG 


10980 


CAGGCTGAAA 


TAGAAAGAGC AGATTATTCT TGGGTTGCTC 


CTCAAGGGCG 


CGCTCAAGAC 


11040 


AGCCTAAGGC 


AATGGTCATA TGAAAATCAT GGACACAGGC 


ATGCATGCGA 


CCTTGGTGTT 


11100 


GAGAAGCAAA 


AGGTAGACCT GTTTGTTCGA CGATAGGCAG 


GCCATCAATA 


TCTGTCCGCC 


11160 


AACCAATGGT 


TCGCTCCGGC TGACTTCCCT GCAGGTAGAC 


CAAAATCCCT 


GTCCGCCAAG 


11220 


TACGAATTTG 


AACAAAATCC TTGCCCGTAG TCAATTTCTC 


AATCACATCC 


AGCAAATAAG 


11280 


CCTGAGTCTT 


GAACTCCTCC AAGCCAATCT CTGGAATCTG 


GTGTAAATCT 


CGTCTAGTCT 


11340 


GAATCAAATC 


TAACATCTAT CTGTCCTCCG ATATAGCAGA 


AAGAGGCTGG 


AAAAAGGGTT 


11400 


CCGCCTCTTT 


TTTACTTTTA CAATTACAAG GTACGAAGCG 


CATCCTCTAG 


CGCTGTTTTT 


11460 


TGTTGAGTTT 


GGGCATCAAT TTCTTTGATA ATACGAGCTG 


GAACACCTGC 


TACTACCACG 


11520 


TTTTCTGGGA 


CATCTTGGGT AACAATAGCT CCTGCTGCGA 


CAACTGAACC 


ACTACCGATT 


11580 


TGGACTCCTT 


CGATAACCAC TGCATTAGCA CCGATAAGAA 


CATTGTCTCC 


GACACGGACT 


11640 


GGTTCAGCAC 


TAGCTGGCTC AATCACACCT GCCAAAACTG 


CACCTGCACC 


AACGTGGCTA 


11700 


TTTTTTCCAA 


CGATGGCACG GCCACCAAGG ATGGCACCCA 


TGTCAATCAT 


GGTTCCAGCA 


11760 


CCGATTTCAG 


CACCGATATT GATAACAGAT CCCATCATGA 


TAACAGCATT 


GTCACCAATT 


11820 


TCCACCTGGT 


CACGGATAAT CGCACCTGGr TCGATACGAG 


C G T T G AT AGC 


ACGCT'l-Al'C'I' 


11880 


AGCAAAGGAA 


CTGCAGAATT ACGAGCATCT TGCTCGACAA 


CATAATCTTG 


ATTTTCTACC 


11940 


AAACCTTCAA 


GAAGCGGAGC CACATCCTTC CAGTCTCCGA 


ATAGGACATT 


TCCTAGTTTG 


12000 


ACAACAGAGC 


TAGGCACAGC AGTTGCGAGT TGCCCCTCAA 


AGGTTACTTT 


GACACTGGTT 


12060 


TTCTTTTCAG ■ 


CATTGGCGAT AAATTGGATA ATTTCTTGAG 


CGTTCATTTT 


TGTAGCAGTC 


12120 


ATAGGTG 








12127 


{2) INFORMATION FOR SEQ ID NO: 149: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12566 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 
CCATCCTTCT GTTGATGTGA CAGGAATGAT GATAAATCAA CCAGTAGCTA GTCGCGAAGA 
GGTGACAGAG GCTTTGAGTC ACTTGGCGGT AGAGCACAAT AGTCTCATTG CTCGTCGAAT 



60 
120 
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CGTTGAGCCA AATGAAGCTG GAGAAACACG CTTTACCTAT GCCACTTATG GTGAGGGAAA 
GCTTCCAGAA GGTCTGACCA TTTCCTCCAA GGAGAGTGCA GAAACGAGTG ATTTATTAGG 
GTCTTACTTG ATTGTATCAG GAAGTTTGGA TGGAGTGAGC TTACAGACCA CCTTGAAAGA 
GCTTGGTTAT CAAGGCTTTG TTTCGAATGG AGAAGATCCA TTTTCGATAG TCTTACTATT 
GACGGCCACC CCTATGGTGC TACTGAGTTT AGCTATTTTT CTGCTGACCT TTATGAGTCT 
GACCCTGATT TATCGGATCA AATCCCTTCG TCAGGCAGGG ATTCGCTTAA TAGCTGGTGA 
GAGCTTGTTT GGAGTTGCTC TCAGACCAGT GTTAGAAGAT GTGAGACAGC TTATCTGCTC 
AGTGCTGGTA TCCAGTCTTT TGGGATTGGG GATTCTCTGG TATCAAGGTG CCTTGTTTAT 
GGCAACGGTG CAACTGGTCA TCATTGCTCT TCTACTTTAT GGATTGACCT TGGCAGGGAT 
TTCTACCTTA CTAAGTGTCG TCTATCTACT TGGTTTACAG GAAAATAGTC TGGTGGATCT 
ATTGAAAGGG AAACTCCCTC TCAAACGTAT GATGACATTG ATGATGGTGG GGCAACTCTT 
AGCTGTATTG GTGGTCGGAT CGAGTGCGAC AGCTCTCCTA CCCCACTACC GTGAAATGCA 
GGAAATGGAG AGAGCTAGCA ATAAATGGAG CCAGTCCTCA GACCGTTACC GTCTATCCTT 
TGGTTGGTCT AGTGCATTTG CCGATGAAGA AGGAACGCGT AAGGATAATC GTGAGTGGCA 
GACATTTACT GAAGAACGGT TAGCCAATAC AGACTCTTTT TATATTATGA GCAATGTTGA 
CAATTTCTCA GATGGAGCAG AAGTGGACCT AGATGGCAAT CGTCTCAGTG ACTACACACC 
GTCAGGGAAT GTTATCTATG TCTCACCGCG CTATCTGATA GAAGAAAAGA TTACCGTTTC 
TTCAGAGTTT ATGGACAAGA TGCAAAACTT GTCTGAGGGA GAGTTTGGGC TGATCTTGCC 
TGAGAGCTTG CGAGAGCAGT CTGTCTACTA CCAAGGATTG TTTACAGATT ACCTGCAAAA 
CTTTTCATCT GAAAGTGTAG AAGTGACGAG TCAGAAACAC TACCTCCCAC AGGTAAGGCT 
AGCTTTTACA GAAACAGGAC AGGAACGTTT CCTCTATAAT GATGGGTACA AGACAACACG 
CCAGTACCTA AAAGATCCGA TTATTGTAGT TCTAACGCCG CAAGCGACTG GAACAAGACC 
TGTTGCAGGG ATGTTGTGGG GAACTACGGC TAATAGTGCC TTGAAACTAG ATCGATATGG 
AGACAGCATC ACAGCTCTAA AAGAGAAAGG TCTGTATCAC AAGGTTTCTT ACTTGGTAAA 
AAGCCAGCTA TTTTTTGCCA AGGTACTAAA TGACAAACGG GTGGAGTTTT ACTCTCTCCT 
TATTGGGACG ATTTTGACCC TGTCTACGGC TATCTTGTTA TTTGATTCCA TGAATCTTCT 
CTATTTTGAG CAGTTCAGAC GGGAACTTAT GATTAAACGT CTTGCTGGTA TGACAATCTA 
TGAGCTTCAT GGCAAGTATT TACTGGCGCA AGGAGGAGTT CTCTTGCTTG GCCTAGTCCT 
ATCTAGTATT TTGACAAGAG ATGGTTTGAT TAGCGCTCTA GTTGTAGCTT TGTTTACGCT 



lao 

240 
300 
360 
420 
480 
540 
6O0 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
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TAACGCCCTC TTGATTTTAG TAAGGCAGGA CAAAAAAGAA GAAGCTGGTA GCATGGCAGT 1920 
ATTGAAAGGA AAATAAGATG ATTGATATTC AAGGATTGGA AAAGAAATTT AATGACCGCG 1980 
CGATTTTCTC TGGTTTGAAT CTCAAGCTGG AGAAGGGCAA GGTTTATGCC TTAATCGGAA 2040 
AGAGTGGAAG CGGAAAGACG ACGCTGCTGA ATATCTTGGG AAAGCTAGAA AAGATAGATG 2100 
GTGGAAGGGT TCTCTATCAG GGGAAAGATT TAAAAACCAT TCCCACTCGT GAGTATTTTC 2160 
GAGACCAGAT GGGCTATCTC TTTCAAAATT TCGGCCTCTT AGAAAACCAA TCAATCAAAG 2220 
AAAATTTGGA TTTGGGTTTT GTTGGTCAGA AAATCTCAAA AGTAGAACGT TTGGAAAGGC 2280 

AAGTGGGGGC TTTAGAAAAA GTTAATCTAG GGTATTTGGA TTTAGAACAA AAAATCTATA 2340 

CTTTATCTGG GGGAGAGGCC CAACGAGTTG CCCTTGCTAA GACTATTTTG AAAAATCCAC 2400 

CCTTGATTTT GGCAGATGAA CCAACAGCAG CTCTTGATCC TGAAAATTCA GAGGAGGTTA 2460 

TGAATCTCTT GGTGGATTTG AAAGATGAAA ATCGAATTAT CATCATTGCG ACCCATAATC 2520 

CCCTAGTCTG GAATAAGGCT GATGAAATCA TTGATATGAG GAAACTTGCT CATGTGTGAA 2580 

AAAATCCGTA TTCGCAGGGT ATCTGATTAT CCTAGTGCCA GAGGTGGTTT AGAAGATATC 2640 

CTCATCATGG AAAATATGAC CAATCATCTC CTTTTGGTTC AAATCCGAGT GCATGGCTAT 2700 

TTGCTTGATT TTGCTAGTAT TGAAGGGCAA AGGCAAAAGC ATTATCGTTT GAAAAATTTA 2 760 

CCTCAGACGG TTGAACTGAC AGTGGATGAT GTGGAGGAGG ATGTGGATTT GACCCTACCT 2 820 

GAAAATCGAA GTTATCAAGA AGCTGATTTT TTTGAArGCA TGTTTCGACA GAACTGCT/iA 2 88u 

GGCCACTTTT AAAGATTTCC AAGACTATCT TTCTTCATGA GGAAAGATAG TTTTTTGGTA 2 940 

TGATTTTCAT TCCCAAAATA CAAGGGGAAT GTGTTACAAT AGTAGTAACA GATAATAGAA 3000 

AAGAGAATAG ATGAGAATTG CAGATTATAG CGTGACCAAG GCAGTGCTGG AGCGTCACGG 3060 

TTTTACCTTT AAAAAGTCCT TTGGGCAAAA TTTTTTGACG GATACCAATA TCCTTCAAAA 3120 

AATTGTGGAT ACGGCTGAAA TTGATGATCA GGTCAATGTC ATCGAAATCG GGCCAGGTAT 3180 

TGGTGCCTTG ACAGAATTTT TGGCTGAGCG TGCAGCCCAA GTCATGGCTT TTGAGATTGA 3240 

CCACCGTTTG GTGCCAATTT TGGCAGATAC CCTGCGTGAT TTTGATAATG TGACCGTAGT 3300 

TAACGAAGAT ATTCTCAAGG TTGATTTGGC GCAACATATC CAGAATTTTA AAAATCCTGA 3360 

CCTGCCAATC AAGGTAGTGG CTAATTTGCC TTACTACATC ACGACGCCTA TTCTCATGCA 3420 

CTTGATTGAG AGTGGCATTC CTTTTTGTGA GTTTGTGGTC ATGATGCAGA AAGAAGTAGC 3480 

GGACCGCATT TCAGCCCAGC CTAACACCAA GGCTTACGGT AGCTTGTCTA TCGCCGTGCA 3540 

GTATTACATG ACAGCCAAGG TTGCCTTTAT CGTGCCTCGT ACGGTCTTTG TGCCAGCGCC 3600 

AAATGTGGAT TCAGCCATCT TGAAAATGGT GCGTCGTCCA GAGCCAGCCG TAGCAGTAGA 3 660 
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AGATGAGAAC TTTTTCTTTA AGGTTTCCAA GGCTAGTTTT ACCCATCGCC GCAAGACCTT 3720 
GTGGAATAAC TTGACAGGTT ACTTTGGTAA GACTGAAGAG GTCAAGGACA AGCTGACCAA 3780 
GGCTTTGGAC CAGGCAGGCT TGTCACCAAG TGTGCGTGGG GAAGCTCTCA GCTTGGCAGA 3840 
ATTTGCCGGT CTAGCAGACG CACTTAAAGG GCAAGGACTC TAAGATGCAG GGACAAATCA 3 900 

TTAAAGCCTT GGCAGGTTTC TACTATGTGG AGAGTGATGG CCAGGTTTAT CAAACACGCG 3960 

CGCGTGGGAA TTTCCGTAAA AAAGGCCATA CCCCTTATGT TGGGGACTGG GTAGATTTCT 4020 

CTGCCGAGGA AAATTCAGAA GGCTATATCC TCAAAATTCA CGAACGGAAA AACAGTCTGG 4 080 

TTCGTCCGCC TATTGTCAAT ATCGATCAAG CTGTAGTAAT CATGTCCGTC AAGGAACCTG 4140 

ATTTTAACAG CAATTTGCTG GATCGTTTCT TGGTTCTTTT GGAGCACAAG GGCATCCATC 42 00 

CCATTGTCTA TATTTCCAAA ATGGATTTGT TGGAAGATAG GGGAGAACTG GATTTTTACC 4250 

AGCAGACCTA TGGTGACATC GGCTATGACT TTGTGACCAG TAAAGAGGAA CTCCTGTCTT 4 320 

TGTTAACAGG CAAGGTTACG GTCTTTATGG GGCAGACAGG TGTTGGGAAG TCAACTCTTC 4380 

TCAATAAAAT CGCACCAGAC CTCAATCTTG AAACGGGAGA AATTTCAGAC AGTCTAGGTC 4440 

GCGGTCGCCA TACCACTCGA GCTGTTAGTT TTTACAATCT CAACGGGGGT AAAATCGCAG 4500 

ATACACCAGG ATTTTCATCC TTGGACTATG AAGTATCAAG GGCTGAAGAC CTCAATCAGG 4560 

CTTTCCCAGA GATTGCTACT GTTAGCCGAG ATTGTAAGTT CCGTACTTGT ACCCATACCC 4 620 

ATGAGCCGTC TTGTGCCGTC AAACCAGCTG TTGAAGAGGG TGTTATTGCA ACCTTCCGTT 4680 

TTGACAATTA CCTGCAATTC CTTAGTGAAA TTGAAAATCG TAGAGAAACC TATAAAAAAG 4740 

TCAGCAAAAA AATTCCAAAA TAAGGAGAAA CCTATGTCTC AATACAAGAT TGCTCCGTCA 4800 

ATTCTGGCAG CAGATTATGC CAACTTTGAA CGTGAAATCA AACGTCTAGA AGCAACTGGG 4860 

GCAGAATATG CCCATATCGA TATCATGGAC AGTCATTTTG TACCGCAAAT CAGTTTTGGT 4 920 

GCAGGTGTGG TCGAGAGCCT TCGTCCTCAT AGTAAGATGG TTTTCGATTG CCACTTGATG 4 980 

GTGTCAAACC CTGAGCATCA TCTGGAAGAT TTTGCGCGTG CAGGTGCAGA CATCATCAGT 5040 

ATCCATGTAG AAGCAACGCC TCATATTCAT GGCGCCCTCC AAAAAATTCG TTCACTCGGA 5100 

GTTAAGCCTT CAGTCGTTAT CAATCCTGGC ACATCAGTTG AAGCCATCAA GCACGTCCTT 5160 

CATCTAGTTG ACCAAGTTTT AGTCATGACG GTTAATCCAG GTTTTGGTGG GCAAGCCTTT 5220 

CTGCCAGAAA CCATGGATAA GGTCCGTGAG TTGGTTGCTC TTCGTGAGGA AAAAGGTTTG 52 80 

AACTTTGAAA TCGAAGTGGA TGGTGGGATT GATGACCAAA CTATTGCTCA AGCCAAAGAA 5340 

GCCGGTGCGA CTGTTTTTGT AGCAGGTTCC TATGTCTTTA AGGGAGAAGT CAATGAGCGA 5400 



wo 98/18931 



PCT/US97/19588 



1002 

GTACAAACTC TCAGAAAACA ACTGGACTAG GGTTGCAGTT TTTGCAGGCG GAAACCGCGG 5460 
TCATTATCGG ACAGATTTTG ATGCTTTTGT TGGGGTGGAT CGAGGCTCGC TCTGGGTCTT 5520 
GGAAGAAGAC TTACCTCTTG CTCTAGCAGT CGGAGATTTT GATTCTGTGA CGGAAGAAGA 5580 
GCGACAGGTG ATTCAAAAAG GTGCCCAGTA TTTTGTCCAA GCACGACCAG AAAAGGATGA 5640 
TACAGATCTG GAATTGGCTC TCTTAACCAT CTTTGAACAA AATCCTCAGG CTCAGGTCAC 5700 
TATTTTCGGT GCCTTGGGTG GCCGTATTGA CCATATGTTG GCCAATGTCT TTCTGCCTAG 5760 
CAATCCTAAG TTGGCACCCT ATATGCATCA AATAGAAATT GAGGATGGGC AAAACTTGAT 5820 
TACTTATTGT CCAGAAGGAA TCAGTCAGCT AGAACCTCGT TCAGACTACG ACTATCTAGC 5880 
CTTTATGCCA GTTCGGGATA GCCAGCTGAC TATTCTTGGA GCCAAGTATG AGTTGACAGA 5940 
GGAAAATTTT TTCTTTAAAA AAGTGTACGC TTCTAACGAA TATATAGATA GGGAAGTGTC 5000 
GGTAACTTGC CCAGATGGTT ATGTGGTCGT ACTGCATAGC AAGGACAGGA GGTAGGATGG 6060 
AAAGTTTACT TATTCTATTA TTAATTGCCA ATCTAGCTGG TCTCTTTCTG ATTTGGCAAA 6120 

GGCAGGATAG GCAGGAGAAA CACTTAAGTA AGAGCTTGGA GGATCAGGCA GATCATTTGT 6180 

CAGACCAGTT GGATTACCGC TTTGACCAAG CCAGACAAGC CAGCCAGTTA GACCAAAAAG 6240 

ATTTGGAAGT GGTTGTCAGC GACCGTTTGC AAGAAGTGCG GATTGAATTG CACCAAGGTC 6300 

TGACCCAAGT CCGTCAAGAA ATGACAGATA ATCTCCTCCA AACTAGAGAC AAGACAGACC 6360 

AACGTCTCCA AGCCTTGCAG GAATCAAATG AGCAACC-TTT CCAACAAATG CGCCAGACGG 64 20 

TCGAGGAAAA ACTAGAAAAG ACCTTGCAGA CACGCTTACA GGCTTCCTTT GAGACAGTTT 6480 

CTAAACAACT GGAGTCTGTC AATCGTGGCC TTGGAGAAAT GCAGACAGTT GCCCGTGATG 6540 

TCGGAGCTCT TAACAAGGTT CTCTCTGGAA CCAAGACGCG AGGGATTCTG GGAGAATTGC 6600 

AACTGGGGCA AATTATTGAA GACATCATGA CACCTGCCCA GTACGAACGA GAATACGCAA 6660 

CGGTTGAA7VA CTCTAGTGAA CGAGTGGAGT ATGCCATCAA GTTACCCGGA CAAGGCGACC 6720 

AAGAATACGT CTATCTGCCA ATTGACTCTA AGTTTCCACT GGCAGATTAT TACCGCTTtiG 6780 

AAGAAGCCTA TGAGACAGGT GACAAGGATG AGATTGAACG CTGTCGTAAG TCACTCCTAG 6840 

CAAGCGTCAA GCGCTTTGCT AGGGATATTA GGAACAAGTA CATAGCACCA CCTCGGACGA 6900 

CCAATTTTGG AGTTTTGTTT GTTCCGACAG AAGGTCTCTA CTCAGAAATC GTCCGCAATC 6960 

CGGTCTTCTT TGATGATTTG AGACGGGAAG AACAGATTAT TGTTGCAGGA CCAAGTACCC 7020 

TATCAGCCCT TCTTAACTCC CTATCAGTTG GTTTCAAGAC CCTTAATATC CAAAAGAGTG 7080 

CCGACCATAT CAGCAAGACT CTTGCCAGTG TCAAGACCGA GTTTGGCAAG TTTGGTGGTA 7140 

TTCTGGTCAA GGCACAAAAA CATCTCCAAC ATGCCTCTGG CAATATTGAT GAATTATTAA 7200 



wo 98/18931 



PCT/US97/19588 



1003 

ACCGTCGTAC CATAGCTATC GAGCGGACGC TCCGTCACAT TGAGTTGTCA GAAGGTGAGC 7260 
CTGCGCTTGA TCTACTCCAT TTTCAAGAAA ATGAGGAAGA ATATGAAGAT TAGTCACATG 7320 
AAAAAAGATG AGTTATTTGA AGGCTTTTAC CTAATCAAAT CAGCTGACCT GAGGCAAACT 73 80 

CGAGCTGGGA AAAACTACCT AGCCTTTACC TTCCAAGATG ATAGTGGCGA GATTGATGGG 744 0 

AAGCTCTGGG ATGCCCAACC TCATAACATT GAGGCCTTTA CCGCAGGTAA GGTTGTCCAC 7500 
ATGAAAGGAC GCCGAGAAGT TTATAACAAT ACCCCTCAAG TCAATCAAAT TACTCTCCGC 7560 
CTGCCTCAAG CTGGTGAACC CAATGACCCA GCTGATTTCA AGGTCAAGTC ACCAGTTGAT 7620 
GTCAAGGAAA TTCGTGACTA CATGTCGCAA ATGATTTTCA AAATTGAAAA TCCTGTCTGG 7680 
CAACGGATTG TCCGAAATCT CTACACCAAG TATGATAAGG AATTCTACTC CTATCCAGCT 7740 
GCCAAGACCA ACCACCATGC CTTTGAAACG GGCTTGGCCT ATCATACGGC GACCATGGTG 7800 
CGTTTGGCAG ACGCTATTAG CGAAGTTTAT CCTCAGCTCA ATAAGAGCCT GCTCTATGCG 7860 

GGGATTATGT TGCATGACTT AGCTAAGGTC ATCGAGTTGA CGGGGCCAGA CCAGACAGAG 7920 

TACACAGTGC GAGGTAATCT TCTTGGACAT ATCGCTCTCA TTGATAGCGA AATTACCAAG 7980 

ACAGTTATGG AACTCGGCAT CGATGATACC AAGGAAGAAG TCGTTTTGCT TCGTCATGTC 8040 

ATCCTCAGTC ACCACGGCTT GCTTGAGTAT GGAAGCCCAG TCCGTCCACG CATTATGGAA 8100 

GCAGAGATTA TCCATATGAT TGACAATCTG GATGCAAGCA TGATGATGAT GTCAACAGCT 8160 

CTTGCTTTGG TGGATAAAGG AGAGATGACC AATAAAATCT TCGCTATGGA TAATCGTTCC 8220 

TTCTATAAAC CAGATTTAGA TTAATAATTT AAGAAAAATG AGCATTTTTT AGGATAAGAA 82 80 

TGTTCGTTTT TTTATGTGAA TATGGTATAA TAAGTAAAAG ACAAAAATGA ATACTCTTCG 8340 

AAAATCTCTT CAAACTAGGG TAGTATCGCC TTGTCGTATG TATATATGCA GGTATATTAC 8400 

AGGGTTTGTC AGTTCTATTG ACAATCTCAA AACAGTGTTT TGAACCACCA GCGACCAGCT 8460 

TTCTAGTTTG CTTTTTGATT TTTTGAATAA AAATGGAATA GGAAATAGAA ATGAAATTAA 8520 

GAAGAAGTGA TCGGATGGTT GTCATTTCCA ACTATTTGAT TAATAATCCT TATAAACTAA 8580 

CTAGTCTCAA TACTTTTGCT GAAAAGTATG AGTCTGCTAA ATCATCCATC TCAGAAGATA 864 0 

TCGTCATTAT CAAACGCGCC TTTGAGGAAA TTGAAATCGG TCATATCCAG ACAGTGACTG 8700 

GGGCTGGCGG AGGTGTCATC TTCACACCGT CTATTTCGAG TCAGGATGCT AAGGAAATGG 8760 

TTGAAGACTT GCGTACCAAG TTGTCAGAAA GTGACCGTAT CTTGCCAGGT GGTTATATCT 8820 

ATCTGTCTGA TTTGCTTAGC ACACCAGCCA TCTTGAAAAA TATTGGTCGT ATTATTGCCA 8880 

AAAGCTTTAT GGACCAAAAA ATTGACGCGG TTATGACCGT AGCAACTAAG GGTGTGCCAC 8940 
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TTGCAAATGC AGTTGCCAAT GTCCTCAATG TCTCTTTTGT CATTGTGCGC CGTGACCTGA 9000 

AAATTACCGA AGGTTCAACT GTTAGCGTCA ACTATGTTTC AGGTTCAAGT GGTGACCGTA 9060 

TCGAGAAAAT GTTCCTTTCA AAACGTAGTC TTAAGGCAGG CAGCCGTGTC TTGATTGTGG 9120 

ATGACTTCTT GAAAGGTGGC GGAACGGTCA ATGGTATGAT TAGTCTCTTG CGCGAGTTCG 9180 

ACTCAGAACT GGCAGGTGTA GCGGTCTTTG CGGACAATGC CCAAGAAGAA CGTGAAAAGC 9240 

AGTTTGACTA CAAGTCACTC TTGAAGGTAA CCAATATTGA TGTCAAGAAC CAAGCCATCG 9300 

ATGTTGAGGT TGGCAATATC TTTGACGAAG ATAAATAAGA GATAGAACTA AAGGTTGGAA 9360 

CGATTGTCCC AGCCTTTCTT TGCAAACAGA ATAGAAGGAA GCTTATGAAA ACACCATTTA 9420 

TCAATAGAGA AGAGTTAGAA GCGATTGTTG CCGAGTTCCC GACTCCCTTT CACTTGTATG 9480 

ATGAGAAGGG GATTCGTGAG AAGGCAAGAG CCGTCAACCA AGCTTTTTCG TGGAACAAGG 9540 

GCTTTAAGGA ATATTTTGCA GTTAAGGCTA CTCCAACTCC AGCTATTTTG AAAATTCTCC 9600 

AAGAAGAAGG TTGTGGTGTG GACTGCTCTA GTTATGTAGA GCTTTTGATG AGCCATAAAC 9660 

TGGACTTTCT GGGTTCTGAG ATTATGTTCT CTTCCAACAA CACGCCAGAC AAGGAATACG 9720 

CCTATGCACG TGAATTGGGT GCGACCATTA ACTTGGATGC CTTTGAAGAT ATTGAACATC 9780 

TGGAGAGAGT AGCAGGCATT CCAGAAATCA TCTCTTGTCG TTATAATCCT GGAGGCGTTT 9840 

TTGAACTGGG GACAGACATT ATGGACAATC CTGGGGAGGC TAAGTTTGGC ATGACCAAGG 9900 

ACCAGCTCTT TG-AAGCCTTT GCTATCTTGA AGGAAAAAGG AyeCAAGACT TTTGGGATTC 9960 

ACTCCTTCCT AGCGTCCAAT ACCGTGACCC ATCTCTATTA TCCAGAGTTG GCTCGTCAGC 10020 

TCTTTGAACT GGCTGTTGAA ATCAAGGAAA AGTTGGGCAT TTCGCTAGAC TTTATCAATC 10080 

TTTCTGGCGG TATTGGTGTT AATTATCATC CAGACCAGGA GCCGAACGAT ATCGCCTTGA 10140 

TTGGTGAGGG AGTTCGTAAG GTGTATGAAG AGGTTCTTAC GTCAGCAGGT CTTGGTCAGG 10200 

TCAAGATTTT CACCGAATTG GGTCGTTTTA TGCTGGCACC TCACGGTGCT CTAGTCACAA 102 60 

GAGTCACTCA TAAGAAGGAA ACCTACCGTA CCTATCTAGG TGTGGATGCC TCAGCAGTCA 10320 

ACCTCATGCG TCCAGCTATG TACGGAGCTT ACCATCATAT TAGCAACGTG ACCCATCCAG 10380 

ATGGACCAGC TGAAGTGGTA GATGTGGTCG GTTCACTCTG TGAAAACAAT GATAAATTTG 1044 0 

CAGTTAATCG CGAACTGCCT CATACAGAAA TCGGTGATTT GCTGGTCATT CATGATACAG 10500 

GTGCCCACGG ATTTTCAATG GGCTACCAGT ATAATGCCAA ATTACGTTCT GCGGAAATCC 10560 

TCTATACCGA AGAAGGTAAA GCCCGTCAAA TCCGCCGTGC AGAGCGCCCT GAGGACTATT 10620 

TTGCAACCTT ATATGGCTTC GATTTTGAAG AATAATCTGA TAATAGATTG AAAATGAAAT 10680 

TGAAAAACAG ATTGCTTTCT AAAAAATAGG CAAAAATCTT GTTTTTCCTT CAAGTCGTGA 10740 
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TATAATAAAA CTATAAAACG TTTTCAAGGA AGGTAACGAT ATGTCTGAAG AAACAATTGA 10800 
TTATGGACAA GTGACAGGAA TGGTGCATTC GACAGAAAGC TTTGGGTCAG TAGATGGGCC 108 50 
TGGTATTCGC TTTATTGTCT TTTTGCAGGG CTGTCACATG CGTTGCCAGT ATTGCCACAA 10920 
CCCAGACACT TGGGCTATGG AGTCCAATAA GTCACGTGAA CGGACGGTAG ATGATGTCTT 10980 
GACAGAGGCC TTGCGCTACC GTGGTTTCTG GGGAAATAAG GGTGGGATTA CAGTCAGTGG 11040 
AGGAGAAGCT CTCTTGCAGA TTGATTTCCT GATTGCTCTC TTCACCAAGG CTAAGGAACA 11100 
AGGAATCCAC TGTACCTTGG ACACCTGTGC TCTTCCTTTC CGTAATAAAC CACGTTACCT 11160 
TGAGAAGTTT GACAAACTCA TGGCTGTCAC TGACTTGGTT CTTTTGGATA TCAAGGAAAT 11220 
CAACGAAGAA CAGCACAAGA TTGTCACTAG CCAAACCAAT AAAAATATCT TGGCTTGTGC 11280 
CCAGTATCTA TCAGATATTG GAAAACCTGT CTGGATTCGC CACGTGCTAG TTCCAGGATT 1134 0 
GACAGACAGA GATGATGACT TGATTGAACT TGGTAAGTTC GTCAAGACCC TCAAAAATGT 11400 
TGATAAGTTT GAAATTCTAC CTTATCACAC CATGGGTGAG TTCAAGTGGC GTGAACTTGG 11460 
AATTCCATAT TCCCTCGAAG GAGTCAAACC ACCAACAGCA GATCGCGTCA AGAACGCTAA 11520 
ACAACTCATG GATACCGAAA GTTATCAAGA TTATATGAAA CGTGTACATG GATAGAAAAG 11580 

AAGCCTGATG GAAACATCGG GCTTTTGACT TGCAAAAAGA CTTAGCAAAT CAGCTAAGCC 11640 

TTTTTCTTCT TATCTCGAAC GTTGTTTTCC AGCGTTGCGA TTTTTGTGTT TTTTCTTGCT 11700 

TGTGATAGCA GTTGGTTGTT CAGGGGTAAC GTCTTTTCGT CCACTTGGTT TAGAGAAAGC 117 60 

ACTTGCTTTT GGTGGGTTCT TGGCTAGTTC TTCACGGACT TTTTTGCGAA GTTTTGGACG 11820 

AACGATATAG TTGACGATAA ACTGTTGGAG AATCATCATG AAACCACCGA CAACCCAGTA 11880 

AAGTGTGACA CTAGCTGGTG AGAAGAGGGA GAAGACGACG ATCATGAGTG GGCTCATGTA 11940 

AATCATTTTC TTGATTTGTT CTCTTTGCAT TTCATCTTCT ACTCCGTGAA GTGAAAGGAG 12000 

CGATTGAAGA TAGTAAAGGA CACCAGCACA GGCAACCAAA ATCATACTTG GAGAACCTAG 12 060 

AGGAATGCCT AGGTAGCTTG CTTGAGCAAC CCCTTCAGTA TGTTGGGCAG CAAAGTAGAT 12120 

AGCAGAGAAG AAAGGCATTT GAAGGAGGAT AGGGAAACAT CCTACACCGC CAAACATGCT 12180 

GATACCGTGC TCTTTTTGAG CAGCAAAGAG AGCTTGTTGG GCTTCGAGTT TTTCTTCTTG 12240 

AGTAGTCGCT TCTTTGAGAC GCGTTTGGTG TGGCTCAAGG ACGTGCTTGA GGGCGTTCAT 12300 

CTTTTCAGAG TGAAGCGTTG CCTTCCATGA TTGGTAGATA CCAAGTGGTA AGATAATCAA 123 50 

GCGTACGATA ATGGTTACGA TAATGATAGC GACACCAAAG CCTAGACCTT TATCAGTAGC 12420 

GAAGTACTTG ATGGCTTCAG CCATAGGCGC TCCGATCGTA TTCCAAATAA ATCCTGTTGG 12480 
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CTGACCTGTG GrTTTATCGA CATTGACACA GCCAGTCAAG ACAAGCAACA TAGCCACTCC 12540 
CATAGCCGAG AGTGCAAAAT CGGGGT 12566 
(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


150: 






TGACACTCTG 


TAGGATTGTC 


GTTAATTGAT 


TGCTCGTACT 


CTCTACAATA 


ACCACCAAAG 


60 


TAAAAACGAC 


ATAGAAAGAT 


AGCATCAGCT 


GTAGCCATAG 


CGCCTTTGAC 


ACCTTCTGGA 


120 


TGATTATGAG 


TTACCTCTGC 


AGAAAGACTC 


GTAAGTCCTC 


TAGATGATGG 


CCATATACCA 


180 


GTTTTCGCAT 


AAAAACCACA 


GTCCATGATC 


CAAGCACATG 


GAGAAATACG 


CATAGCTGAT 


240 


CCATTCCCAA 


AGCTATTATA 


AGGCTCACGG 


TTATCGCTGT 


TTAGCCATGC 


ATTAAACCGA 


300 


GCACCGTAAT 


CAGCATTCGG 


ATACATTCTG 


CCATATTTCT 


TCATCGCGTC 


AATGAAGTCA 


360 


TCTTTTTGTC 


CACCATTCAT 


AATTGCTTCT 


GCAACAGCAC 


AGGTCATAAC 


CGTGTCATCT 


420 


GTAAAAAAGC 


AGTCCTTCCG 


AAATAAAGGA 


AAGTCCTTTG 


TTTTGATATT 


GTTCCATTCG 


480 


TAAACAGAAC 


CGACAATATC 


TCC^ATAATT 


CCTCC/lTiGCA 


T C AG AT T C C T* 


CCrrG'i-l'CAT 


540 


TTTGATGCTT 


TTTATATTGG 


TTATCTACCA 


TATTTATTTT 


AGAAAATAAC 


ATCCTGTTGG 


600 


ATTTTAAAAA 


TTTCATTTTT 


TTCAAAATAG 


GGTTTTACCA 


TTTCTTTCCA 


CCTAGCTCTA 


660 


TGAAAATTGA 


TTGATTTTAA 


AGGAGATAGG 


CCATAATTTC 


CCAATGCATA 


ACCATCATTT 


720 


ACTTCAACAA 


CAAGTGTTCT 


GCCATCGCGA 


GTAACACCGA 


TATCTAGTCC 


ATAAGCTATT 


780 


GGCGCATCTT 


TCCAACATGA 


TATCGCTTCA 


TCAATTACAC 


TTGCATCAAA 


TTGTGCATGA 


840 


TAATCACCTG 


TATAGGGTCG 


AACATCTAAT 


ACGCGACCAT 


CTAACACAAA 


ACAACGCCAT 


900 


TCAGCTATGA 


ATTCTACAAC 


CTCACTAATC 


CATATAGGAT 


AGTCGAAAGG 


TAGACCAATA 


960 


CCTATTAAAT 


CATGGGTTCC 


ATTAACAACT 


CTTCCAGTAA 


AGACTTTTGA 


ACCAGCTTTA 


1020 


GGCTTAATAA 


ATTTTCCCCA 


ATTATCAGGT 


ATATTCACAA 


TCTCTCCTAA 


AATACCAGCA 


1080 


TAAATCTTTC 


GACCATAAAA 


CTCTTTAAGC 


tcaataggat 


AGTCATGAAC 


CGGAACGTTT 


1140 


AAGCCCATCA 


TTTTTAGTAA 


TGCTCTAGTC 


TCCATTATAT 


AATCTACAAC 


TATATCTTCA 


1200 


CTTGTTAACT 


CTTTTATTTC 


AGAAAAAGAT 


TGATATAAAA 


TAACTTCTTC 


TCCTTGTAAG 


1260 


TAGGCACCTA 


CTTGAGCATT 


GTATTTATTA 


ATTGAAACCT 


CACTTGGTAA 


TTTACTTTGT 


1320 
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CTAATATAAA CAACCATTTC ATCACTCCTA TATCACTAGT GTTACACCAA TTTGTAAAAA 1380 

ATAATAGCAA TTTTGCTCTT ATTTTTTTGA GTAAATAGCC CCCATAATAT CATCGAAATA 1440 

ATCAACGGTA TTTAGGAGTA ATTCAATAAC CTGGGACTTT GTTAGTCGCA TTCCCCTTCT 1500 

ATCTCTAGCA TCTTCTACTA AATTTTCAAG TTTCTCTAGA TTTTTATCAT CCAAGCTAAT 1560 

CATTATTCTA TTTTTATCGG TTGCCATTTT CATCACCTCA AGTTAATTCT ATCACAGGTG 1620 

TAACACTAGT GTCAACTGGC TTTTATAATA CATTAGTTTA AAAGTGGAGA GGATTTTTAA 1680 

CACAGTAACT TTAAATCTTT GGTATTAAAA AATTTTCACA ATATTTATAG AAATAAAATC 1740 

TGTCTCAAAT CAGTTATCAA ATCTAGTATA AATTATGAGC GGCTACTCTA ATACTTTCCC 1800 

TCTAAACAAG AAAAAGACTT ACACTCAAGG GTTTTCTTCC CCCCCTTCGT TATAACGTTT I860 

TGACTCTTTT ACTAGCAAAG GTATATACTC ACAAGGAACT TTGGTTGACT ATTGAATCTC 1920 

TCCAACTTCT TCTTTAACAT ATCCTTCTAC ATCTTCAATC TCTACAAACA TTGGGTCTAA 1980 

GTGACACAAG AAATGCCAAA CTTCGATCCC TTTTTTTCTG TAAAGAATCG CTTCACCGTC 204 0 

TTCACTTCCG AAAAAGCTTC TGTCGATTTC ATATCCGCGG CTTTCTAAGA AGTCTTTTGC 2100 

TTTACGATAG TTCGTTTCTC TTGTTTCGAC ATAGGCTTTA ACTTCATGGT TGTTAACGAC 2160 

ATATGCATCA ATTTTTGAAT ATCCTTCGAT CACTCTATCA TTTTTGAGGG ATAAATTTGA 2220 

AATCTCTTTC CAAATAATGT TTACATTTTC CTCAGGATCG AACATAAATT TAGATAAAGG 2280 

AACAATATTT CCGTTAAAAA TAATTTCCAT ATAATCCGGT ATGTTTTTAG GATTAAAATA 2340 

CTCCACTTCA AAACCATCTT CTGTTTCCAG AGTGTATCCC GGGATTTGAG CTACAAAGGC 2400 

TTTCCCATCT TCTATGGAAT CAAATGCTAC TAAATCTTTA GAATAATCAT TTTGGTACAA 2460 

TTCCAATATA ACCATCGATA ATCTCTCCAT TTTCATTATC AGGCTAATGT AAATAAGCAC 2520 

GTCACCTGAC CAATTCAGGC TCTCTGTATC ATCTCATCAT ATTTCCTACT TACTTTACGA 2580 

GTCTTATACC CAGAACACAC CTTATCGACC TTCGGTCTCA CCTCGTCGCA TTGGCTGAAC 2640 

ATCTACTTTT ACTTTGCTGA TGCTTCAACT CGTACAAGCA GTGATACCGC CTCAGCGTGA 2700 

TGCGTCAGTG GGACTCAAAA GGTTCGGGGA ACCTTTTGAG GATTAACTAC GTTTCTCTAA .27 60 

TAAACTTACA CATTCAACTT GTTCATCATT GTCCAAACCT ATGTTGAGAT TTTCTTCTAT 2820 

AATTGGTAGC TTAAAAGTAA TGGATTTTAG CCATTGTCCG TTAGATTGTT TTTCTTCATA 2880 

AACTTGAATT TCAGAAATCA AAGCTGAAAT TAACTGCCTA CGCTCTACAT CATTCATGAC 2940 

TTTATAGAGC TTATCAAAAT AGATCAGAAC CTTATATATG TTATCTCCTG TAAGCTTTTC 3000 

AGCTTCAATA GTCTGTTTCT TTGCTTTCGC ATCAATTAGT GATGATTCTA ATTCATCTAG 3060 
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TTTGTCATAC ATACGATATA GTCTATCATC TAAATCCTGT TTCCTTCTCT TATAATGCTT 3120 
ATCTTCAACA TCTAAATTAT CTATTTCCTC AATTAGCTTA AACTTTGTAG AATGACTCTT 3180 
TCTCAATTCC TTTTGGTAAT TATCTATTTC TTTTTCTATT TCAGAGGTAT CCACCTTCAT 3240 
GTTGATTTTT TCTTGCATCA TAGAAGCAAA TTTCGGATTA CTTACTATCT TGACAATCAC 33 00 

CTCTGCAACA GCATCATCTA ACAATTCTTC TCTAATTTGC TTACTGAATG TACACTTATT 3360 
ACCTCTTATC ATCTGCCTAT GGTTACAACC ATAGTAATAA AAATCTTTAT ACTTTGTGCC 3420 
ATCTTTCTTT TTCTTGATAC ACTTGTTCCC AAACATTCCC ACTCCACATA TCGGGCATTT 34 80 

TACAATTCCA GAAAGCAAGT GTGTGCGTGT ATCTTTTCCT TTATTCACAT GCTCATATTT 3540 
CTTTGCTTGA GATTTTAGCT TAACCTGAGC AGCTTGCCAA ACTTCATCGG AAACTATAGC 3 600 

TTCATGTATC CCTTCAGATA TTAGATATTC ATCTTGTTCA ACCTGCTTAT ATTCATTTCT 3 660 

TGTACCATGA ACTTTTTCTA AAGTTCTTCT TCCAAATGCT ATTTTCCCAT TATATACAGG 3720 

ATTCTTTAAT ATCTTTCTTA TAAGACCTGC ATCAAACAAA GGATTCTTAC CATTCTGTCT 3 780 

TGGGATTTTT CTAATTCCAT GATTCTCTAA GTATTTAGAT ATCCCATTGG CTCCTATCGT 3 84 0 

AGTATTTACA TACTGGTCGA AAATCGTTCT TATTGCAACT GCCTCTTCCT CATTTATAAA 3900 

CAGCTTGCCG TCTTCAAGTT TATATCCATA CGGAGCAAAG CCACCATTCC ATTTTCCTTC 3 960 

CCCTGCTTTT TGAATGCGAC CTTCCATTGT TTGAATACTG ATGTTTTCTC TTTCTATTTC 4 020 

AGCCACAGCT GATAAA^^VCAH AAATCATTAG TTTCCCAGCA TCTT-TAGAtG AATCAATGCC 4 080 

ATCTTCAACG CAGATAAGAT TAACTCCATA ATCCTGCATT ATATGAAGTG TAGAAAGAAC 4140 

ATCAGCGGCA TTTCTTGCAA ATCTTGATAA CTTAAACACA AGAACAAAAG ATACTCCATC 4200 

TTTTCCAGAT TTTATATCTT CCATCATTCG ATTGAACTGT ATTCTACCTT CAATAGACTT 4 260 

GTCAGACTTC CCGGCATCTT CATACTCTCC AACAATTTCA TAATCGTTGT AAATAGCAAA 4320 

AGCTTTCATT CGTGATTTTT GTGCCTCTAA CGAATACCCC TCTATCTGTA TTGACGTAGA 4380 

TACTCGTGTA TAGAGGTATA CTTTTATTTT TTCTTTTGAC ATAGTATTAA CCTCAATATA 444 0 

ATTTTTCTAT ATCATATATA ATTTTTTTAA TTTAAGTTTG GACTATCATT TCAAGTATAT 4500 

TATAACACTT TTATTAGTCC GTCTCAATTT GTGTTTTTGC CATGTCAAAA CTATTTTTCA 4560 

TCTCTTGATT TTTTGCTGGC GTTGGATCGG GTAGATTATC TAAATCTAAA GCACCAGCAT 4620 

ATTTTGCAAT CAGATTTGCT ATTAAATCAG CCAATCCATT CCAGTCATTG TCCAATATAT 4680 

ACCTCCTCTA AAGTTTTATA TCTAATAATT ATTTGTTTAA TTAAGTTTTT TGACATTGAC 4740 

AAGTGCTTTG GATTAGCAAC ATAGGAATCT CACTTCCGCC TCTATTCCGG ATGAGCCGGC 4800 

TTCAACCTTA GAAGTATCAT TACCCTCATT TTCTTCATAG CGGATAGGGT ATCCCTCCCT 4860 
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ATATTCAAAC TCTTACTTAT CGCTCACTTT CTTTTTGCTT AGCAGAACTT TTTTTGCCGA 4 920 

ATTATTCAGC CGAAAGATCT TGACGGATAG GTTATTACGC TCCAAAAATA ATTAACGTCT 4 980 

TGTCTTGGTC TATTCAATTG TTAAGGTTCA AAATTTATCG AGAGTTATTA ATCTTTTTAA 504 0 

AATTTGACCA TCAGAAAATA TTTATCTTGA TGTAACAAAA TTCTATAAAT TACCCTCTTA 5100 

TACTTAACAG TGAAAAGAAG TCTTTCTTGG TAACCAATTT TGAAATAGAA TTTGCTTATA 5160 

TAAAAAGGTC CAATTCCCAC TGCATAAATA GCAGTGAAAA TTAGACCCTC TTGGTAACTG 5220 

TCATCTAAAA GTCTTCTA 5238 



(2) INFORMATION FOR SEQ ID NO: 151; 

(i) SEQUENCE CHARACTERISTICS: ' 

(A) LENGTH: 13425 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 



GACGATTTAC 


GAAGAATCGA 


ACAAGAACCT 


GCTCCTATCA 


ATTCCCAACC 


TCTATCTCTA 


60 


AAATCTTGCA 


GTTCATGCTT 


ATACTTTTTT 


AAGAAATCTA 


GAATCATAGA 


TACGGTAGAT 


120 


GACATCGTCT 


GGTTGACATT 


GGTCAAAATA 


GAACAAACCA 


AAACGACTCG 


TTCTATACCT 


180 


CCAACCTTTC 


AAATGCATCT 


CATGTAAATG 


TTCTTCTTCC 


TTGTCCAAAT 


CAACAATGGT 


240 


GAAAATCCGA 


AATTCTACTC 


TGCTATTCAT 


TGTCTTACCC 


CAAAATTAGA 


AAACATGCCT 


300 


GGCGTTATTT 


ATTAGATAAT 


TCTTTCCACT 


TTTGACTCAA 


TCTCCAAAAA 


ATATAAGAAA 


360 


TCTGAATCGC 


AAAAACTATC 


AATAAAACCC 


AATCTATTAT 


GAAAATCAAA 


AACACTTTCC 


420 


AACTGAAAGA 


ACTACCTCCA 


GTGACAAACT 


TTGAGAAAAA 


CGGTAGTAGA 


GCTAAAAAGA 


480 


GAAATAAAAT 


AGGAAGCATC 


CGCATTGTTA 


AAATCCGTTT 


GGCATAAAAA 


AATCTTTATT 


540 


TAAACGAAAA 


TATTATGGCA 


AAATTTACGC 


CAGTTTTTGA 


ACGGCTGATG 


TAGATATTTT 


600 


ATACTTTCAA 


AATGTTTAAA 


TGTGATTATT 


TATTTTTGAA 


AAATAGATCA 


CCAGCCCGAC 


660 


TGAAAGTGCT 


TATAGAATGA 


TAATAAGTCG 


CCTGCCGAAA 


ACAGCGAAAA 


ATAGCGGTGT 


720 


TATGCGGAGA 


TAATCTGACG 


CGATGCGAAA 


GTATATTGCA 


TACTTATTTT 


CAACAATTTA 


780 


GCAGAGTATT 


TTTATAAGTG 


TGATATAATA 


GAAGTATAAT 


TTGTTCTGAT 


AGTTTATTTT 


840 


ATGGAGAAGT 


AGATTTTTAG 


AATGCGGAGG 


GTTCAATATG 


GTTGAGTTTA 


TAAAGTCTAA 


900 


GAAAGAAATG 


AGTGAGGAGG 


ATATTAAAGC 


AAATTTCATC 


ACTCCTGCTA 


TTGTATCCAA 


960 
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AGGATGGAAA AATGGTGAGC ATATCGCTTA CGAAGAATAC TTCACTGATG GTCGAATTGA 1020 

AGTTAGAGGA GATAAGGCTC GTCGTAAAGA AGGAAAAAAA TCAGACTATT CACTGTATTA 1080 

CCAATTTGGA ACTCGAATTG CAATTGTTGA GGCAAAGGAT AATAAACACA GCGTTCGAGC 1140 

AGGATTACAA CAAGCTATTG AATATGGAGA GATTTTAGAT GTTCCATTTG TTTATTCTTC 1200 

GAATGGTGAT GGCTTTATTG AACACGACCG TATCACGAGA GAAGAACGTG AGCTGGAGTT 1260 

AGACGAATTC CCTACTCGTG AAGAATTATT TTCTCGTATG ACGAAGGAAA AAGGATTGAC 1320 

GTACGAAATT ACAGAAGCTA TCTCAACTCC ATACTATACA GACGCCTTCT CAATGAAAAC 1380 

GCCACGCTAT TATCAGCAAA TAGCTATCAA CCGTACTATT GAAACAGTTG CCAGAGGACA 1440 

AAAACGAGTA ATGTTTGTGA TGGCAACAGG AACGGGGAAA ACGTTCATGG CTTTTCAAAT 1500 

TATTCATCGC CTTCGAAAAG CTGGTTTGGC TAAACGAGTT TTATTCTTAG CAGATAGAAA 1560 

CATCTTAGTA GACCAAACGA TGGCTGAAGA CTTTAGGCCA TTCGAAAAGG TAATGACGAA 1620 

AATTACACCA AAACTTTTGA CTGCTCCTGA AAAATTAAAT TCTTTTGAAA TTTATCTAGG 1680 

GCTTTATCAG CAACTAACTG GTGAAGATGG AACTGAAACA CATTATCAAA AATTTGACAA 1740 

AGACTTCTTT GATTTAATCG TAATTGATGA AGCGCACCGT GGTTCAGCTA AGGAAAACAG 1800 

TAACTGGCGT AAGGTAATTG ATTATTTCAG TTCTGCGACA CAGATTGGGA TGACCGCTAC 1860 

TCTTAAAGAA ACCAAGAATG CTTCCAATAC GGAATACTTT GGTGAGCCAA TCTATACTTA 1920 

TAGTTTAAAA CAGGr; A ATCG AGGATGGTTT TTTGGCTCCA TATCGTGTtA TGAGGGTTAA 1980 

TTTAGATGTG GATGTGGATG GTTATCGTCC AGAAACTGGA AAAGTTGATG CTAACGGACA 2040 

ATTAATAGAA GATAGGTACT ACGGCAGGAA AGATTTTGAT AAAACCATTG TCATTGATGA 2100 

TAGAACGCAA AGAGTTGCCA AGTTTGTTTC TGATTATATG AAGCAAAACA ATGCACGATT 2160 

TGATAAAACA ATTGTTTTTT GTGTTGATAT TGACCATGCC GAGCGAATGC GTGCTGCACT 2220 

TGTAAAAGAG AATCTAGACT TAGTCCAAGA AGACTATCGT TATGTCATGC AAGTAACTGG 2280 

TGACAACGCT GAAGGAAAAG CTCAACTGGA TAACTTTATG GATGTCAATT CTAATTTTCC 2340 

CGCTATTGTA ACAACGTCTA AATTATTAAC GACAGGAGTT AATGCTAAAA CATGTCGTTT 2400 

GATTGTTTTA GACTCTAATA TCCAATCCAT GACTGAATTT AAACAAATTA TTGGTCGTGG 24 60 

CACACGTCTT TATCCTCAAA AGGGGAAAGA ATTTTTTACG ATTATTGATT TTCGAAATGT 2520 

TACCAATTTG TTTGCTGACC CTGATTTTGA TGGTGATCCA GTGAAGGTGC TAGAAACAGG 2580 

TGCGAAAACA GTCAGTGGTT CTACGCCCGG TTTCGTAGAT GAGGAAGGTG ACCCAGTAGA 2640 

AAAATATATC GTTACAGACA AGCAGGTTAC CATTCTTAAT TCTACTGTTC AAGTATTGGA 2700 

TGAAAACGGG AAACTGATTA CCGAAAGCCT GACCGACTAC ACTCGAAAGA ATATCTTAGG 2760 
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TAGCTACGCC ACTTTGAACG ATTTTATCAC AGTTTGGCAT ACGGCAGATA AGAAGAAGCT 2820 

TATCTTAGAC GAACTTTATA AAAAAGGAGT TTATCTAGAT GCTATTCGAG AGTCGGAGGG 2 880 

AATATCAGAA CAAGAAATCG ATGATTTTGA TTTACTCCTA AAACTTGCCT ATGGTCAAAA 2 940 

AGAATTAACC AAAACGGAAC GTATCAATAA ACTCAAACAA AGCGGATATT TATATAAATA 3000 

TAGTGAGGAA GCGCGTGCTG TTTTGGAAAT TTTACTGAAC AAATACATGG ATAAAGGTAT 3060 

TGGAGAACTC GAAAGCATTG AAACATTAAA ACTTCCAGAA TTTCAGATAT ATGGTGGAAC 3120 

CTTCAAAATC ATCAATACTT ATTTTGGAGA TAAAAAACGA TATTTACAAG CAATTAAAGA 3180 

ATTGGAGCAA GAGCTATTTA CAGTAGCTTA ATGAAAGGAA AGTATGTCAA TTACATCATT 3240 

TGTAAAAAGA ATTCAAGATA TCACTCGAAA CGATGCTGGT GTTAATGGTG ATGCTCAACG 3300 

TATTGAGCAA ATGTCTTGGT TATTATTCTT AAAAATTTAT GATAGCCGTG AAATGGTTTG 33 60 

GGAATTAGAA GAAGACGAGT ATGAGTCAAT TATCCCAGAG GAATTAAAAT GGCGAAATTG 3420 

GGCTCATGCT CAAAATGGGG AACGGGTATT GACAGGCGAT GT^TTACTTG ATTTTGTCAA 3480 

TAACAAGTTA TTCAAAGAGT TGAAAGAGCT TGAAATAACT TCAAATATGC CTATTCGAAA 3540 

AACGATTGTT AAATCAGCTT TTGAAGATGC GAACAACTAT ATGAAAAATG GCGTCTTGTT 3 600 

ACGCCAAGTC ATCAATGTTA TTGATGAAGT TGATTTCAAT AGCCCTGAAG ATCGTCATTC 3660 

GTTTAATGAT ATTTACGAAA AAATTCTTAA AGATATTCAA AATGCTGGGA ACTCAGGAGA 3720 

ATTTTATACG CCACGTGCAG CGACTGATTT TATTGCCGAA GTTCTTGACC CAAAACTTGG 3780 

AGAATCAATG GCAGACCTTG CTTGCGGAAC AGGAGGCTTC TTGACTTCGA CTCTGAACCG 3840 

TTTAAGTAGT CAACGTAAAA CTAGTGAAGA TACCAAAAAA TATAATACAG CTGTTTTTGG 3 900 

TATTGAAAAG AAAGCATTTC CTCATCTTTT AGCAGTTACA AATCTGTTTC TTCACGAAAT 39 60 

TGATGACCCT AAAATTGTTC ATGGAAATAC TTTGGAGAAA AATGTTCGTG AATATACGGA 4020 

TGATGAAAAA TTTGACATTA TTATGATGAA TCCACCTTTT GGAGGGTCAG AATTAGAAAC 4080 

AATAAAAAAT AACTTTCCAG CAGAATTACG GAGTTCTGAA ACAGCTGATT TATTTATGGC 4140 

TGTCATTATG TATCGTTTGA AAGAAAATGG TCGTGTTGGA GTTATTTTAC CTGATGGTTT 4200 

TCTATTTGGT GAAGGTGTAA AAACTCGCTT GAAACAAAAA CTGGTAGATG AGTTCAACTT 42 60 

GCATACGATT ATTAGGTTGC CTCATAGTGT CTTTGCACCG TATACAGGAA TCCATACGAA 4320 

CATTCTTTTC TTTGATAAAA CAAAGAAAAC AGAAGAAACT TGGTTTTATC GTTTAGATAT 4 380 

GCCAGATGGT TATAAAAATT TCTCGAAAAC TAAGCCGATG AAGTCAGAAC ACTTCAATCC 4440 

TGTTCGTGAC TGGTGGGAAA ATCGTGAAGA GATTCTGGAA GGTAAGTTCT ACAAATCTAA 4500 
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ATCATTTACA CCTAGTGAAT TGGCTGAGTT GAATTATAAT TTAGACCAGT GTGACTTTCC 4560 
AAAAGAGGAA GAGGAAATCT TAAATCCCTT TGAGTTGATT CAGAATTATC AAGCGGAAAG 4620 
AGCAACTTTA AATCATAAGA TTGATAATGT ATTAGCTGAT ATTTTGCAGT TGTTGGAGGA 4 680 

CAAATAATGA CACCAGAACA ACTTAAAGCA AGTATTCTCC AAAGAGCGAT GGAAGGGAAA 4740 
TTAGTGCCGC AAAATCCCAA TGACGAACCT GCAAGTGAAT TATTAAAGAG AATTAAAGCT 4800 
GAAAAAGAAA AACTTATCAG TGAAGGAAAA ATCAAACGAG ATAAAAAGGA AACTGAGATA 4860 
TTTCGTGGTG ATGATGGGAA ACATTATGGG AAGTTTGCTG ATGGAAGCAC TCAAGAAATT 4920 
GATGTTCCTT ATGATATTCC TGATACTTGG GAGTGGGTGA GGTTTTCTAC ATTGGTTGAA 4980 
ATTGTCAGAG GTGGCTCTCC ACGACCAATC AAaGATTATC TTACTTCTGA AGTAGATGGA 5040 
ATAAATTGGA TAAAAATAGG TGATACTGAA AAGGGTGAAA AGTATATAAA TAATGTTAAA 5100 
GAAAAAATCA AAAAATCAGG GCTTAACAAA ACTAGATTTG TAAAAAAAGG TACATTTTTG 5160 
TTAACTAATT CTATGAGTTT TGGTAGACCT TATATTTTGA ATGTTGATGG TGCAATACAC 5220 
GATGGATGGT TGGCTATTTC GAACTATGAA AACTCATTAA ATAAAGATTA CCTATTCTAT 5280 
ATTCTTTCAT CAAATGTAGT TTATTCTCAA TTTCTATCTC TAATTAGTGG AGCTGTTGTG 5340 
AAAAACTTGA ATAGTGATAA AGTTGCTTCT ATTCTTATCC CTCTCCCCCC ACTATCCGAA 5400 

CAACAACGAA TAGTAGAAGC AATCGAATCA GCTTTAGAAA AAGTAGATGA ATATGCTGAA 54 60 

AGTTATAATA GACTAGAACA GCTAGATAAA GAATTTCCAG ATAAACTAATw AAAATCTATT 5520 

CTTCAATATG CTATGCAAGG AAAATTAGTT GAACAAGACC CAAATGATGA ATCAGTCGAA 5580 

GTTTTACTTG AAAAAATACG AGCAGAAAAA CAAAAACTCT TTGAAGAAGG CAAGATTAAA 5640 

AAGAAAGATT TGGACATTTC TATTGTTTCC CAAGGAGATG ATAACTCTTA TTATGGGAAT 5700 

ATACCTATGA ATTGGGTTGT TATAAAAATA AAAGATATTT TTTCAATAAA TACAGGTCTT 5760 

TCTTACAAGA AGGGCGATTT AAGCATTAAT AATAAAGGTG TTAGAATTAT ACGTGGTGGT 5820 

AATATTAAGC CTTTAGAATT TTCTCTGTTG GATAATGATT ACTACATTGA TACACAATTC 5880 

ATCTCCTCTG AGCAAGTTTA TTTAAAACAT AATCAGCTAA TAACACCTGT ATCAACCTCT 5940 

TTAGAACATA TTGGAAAGTT TGCAAGAATC GATAAAGACT ATGATGGTGT TGTGGCTGGT 6000 

GGATTTATTT TCCAATTAAC ACCATTCGAA AGTTCAGAGA TTATTTCAAA ATTTCTATTA 6060 

TTTAACTTGT CCTCTCCGTT ATTTTATAAA CAATTGAAAG CAATAACTAA ACTATCAGGT 612 0 

CAAGCTTTAT ATAATATTCC TAAAACTACA CTGAGCGAGC TATTAATTCC GTTAGCTCCT 6180 

TTTGAGGAAC AGGAACTTAT TACTCAAAAA GTTGAGAAAC TTTTTGAAAA AGTAAATCAA 6240 

CTTTGAAAAT GATTCTTTTC ATCTCTTCAT GATTAGAAAT AGGGATTAAT AATTCGGAGA 63 00 
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TACTGGTACT ATTTAATGTT TTCCCTTTGA TAGCATCTTT TGAATCACCT AAAGTAGAGA 53 50 

TAAGTGGCAA AAATATCATT AAGTAATCTC TGATAATATT TTCTTTATTA GCATAGGGGA 64 20 

ATATCGATAT AATGGCTTCA TTATGAGTGG CAGGAATATC CAATATGGCA ACTTTTCCAA 64 80 

TAGATAATTT AAAACTCATT AATAAAGTTC CTTTAGGTGA AATGTCTATT TTCTTTGATT 6540 
TTAATGCTAA TTTAGAAATA GATTCTCTCG CATTAGTTAC ATAACCAGAT ATAGGCATAT 6600 
CTGATATAGA TACCCAAGGT ATTTCAGTTC CCCAAAAAGT AGCTTCACTG CGTGGAGGAG 6660 
TTTTTCCTAT TCTGAAGTTA ACTAGGCTAG CAAATTTAAT ATATCTCCAT GCTTCTGGGA 6720 
TTTCATATAT AGGATAAGAG GTTGTTTCGT CTTTGTTCCC ATAATAAGAG CCATAATCAC 6780 
AAAAATAGCA GGTAGTCAGT TTGACCACCT GTTATTTTTT ACCAATTAAC AATTTTATCT 6840 
ACAATATTTT GTTGTTCAGT AGCTGTTTTC CTTAGATAAA TTCGAGTAGT TTCTATACTT 6900 
TCGTGTCCCA TCAAATCTGC AAGCAAGGCA ATATCATTAT ACTTCGCTAA AAAATTCTTA 6960 
GCAAATAAAT GCCTAAAAGA ATGAGGGTAA ATTACGTTAG GATTCATTTT GTATTTATCA 7020 
GCATAATTTT TTAACTGTTG AGCAACTCCT CTTGCTGTAA TTGGTTCGTT AAATTTATTC 7080 

AAAAATAAAT AACCACTTCG GCGATTTTCT GATTCTAACC AACTAAGACA ACTATTTCTT 7140 

AATTTTTTAG GAATGTACAG TCTACGAATT TTACCACCTT TTGAGTAAAT GTCAAAATAA 72 00 

CCGATTTCTA CATGCTCTAC TTTTAGTTTA ATAAGTTCAC TTACACGAGC CCCAGTTGCA 72 60 

CCTAAAAACC AAACGACAAA ATGCCATTTT AAAATACCAT CTTTTTTCAA ACTACGTTTA 7320 

AGAAAAAGGT AATCAGCATG GCTAATGACA TCTTCTAAAA ACGGTTTTTG CTGTACTTTG 73 80 

ACAAATTTTA ATTTCAAATC ATCATGACCA ATAAAAGCCA GATATTTATT TACTCCTTGT 74 40 

AGTCGCAAAT TGACAGTTTT AGGTTTAAAA TTGTCTAATA AATATCCTTT GTATTCAAAT 7500 

AAATCTTCCA TTTTGAGTTC GTAATTCTCC AAGAAAAATC GAACACCATA AAGGTACGAA 7560 

CGCACAGTAT TTTCAGCTAA ACCAGCTTTC TTCAAATGTA ATTCAAAATC TTTCAACGTA 7620 

AAACTCCTAT CTTATGTTTG ATAGAAATTC CACCGCACGT AAAACTATTA TACTAAATTA 7 680 

GTGCGTCAAT ATGGGCGAAA AATTGTTCGA TTTTATCAAC GATTCTGGAT TGTTCAGGAA 774 0 

GGGGTGGGAG GGGGATTAAA TATTCTTTTA TAGTTTTCGT TAATAATTCT TTTTGTTTTG 7 800 

TACTACCCGA CGCTTTTTCT TCAATAACTG ACTGAACAAT AGGAGAGGAA AGAAAATTAT 7860 

AGATGAAATG GCAATTAATA ACCCCCGATA AGACTCTTAT AACTGTAACA TGGCTATCTG 7920 

CAACAGCCCA GCCATAAGGA TTTTTATTTT CATGGTAAAT AGCTAATCGT CCTAACGTAC 7980 

CTAGACCTGT TGAATTCCAC ATTAAATCAC CATCTCTTAG TAATCTTTCT TTCTGGTAAC 8040 
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TATGAACTGT TTCGGGATCA ATAAATCTTG CTAAGTCAAT AGAAAAGCCA GACCATTGAT 8100 

TACATTTCTG AGCAATCACA GGGTATATAG GAATATTTGA ATATTTTGGA GACTTCCCTC 8160 

TTTGAATGTA GGAGGTTATA TCGTTTAACC TCACCCATTC CCAACTTTCT GGTATTTCAC 8220 

AAGGTACTTC CTCATAATAA GAGTTATCAT CTCCTTGGGA AACAATAGAA ATGTCCAAAT 8280 

CTTTCTTTTT AATCTTGCCT TCTTCAAAGA GTTTTTGTTT TTCTGCTCGT ATTTTTTCAA 8340 

GTAAAACTTC GACTGATTCA TCATTTGGGT CTTGTTCAAC TAATTTTCCT TGCATAGCAT 8400 

ATTGAAGAAT AGATTTTTTT AGTTTATCTG GAAATTCTTT ATCTAGCTGT TCTAGTCTAT 8460 

TATAACTTTC AGCATATTCA TCTACTTTTT CTAAAGCTGA TTCGATTGCT TCTACTATTC 8520 

GTTGTTGTTC GGATAGTGGG GGGAGAGCAA TTAATAATAG ATTAAAATTA TAATCATTGA 8580 

TTGCAGGATA ACTTGTTCCA GTAGATTTAT TATTAACACG ATTGATAAAA TTATCTGATA 8640 

ATAAATAATA TTTCAAATAT GTTTCGTTAA GTAAAGTATC CAAAACAATA AATGCTGTAC 8700 

TAGCTATCAA ATACTCTTTA AGTTCTCTAA CTACAGCAAT ATTTTTTAGA TATGGTCTAA 8760 

CTGTTGAAAA TAAGACACTA TTCTGCGAAA CTAATTTTCT AGCACGGGAA GGCGCTTGTT 8820 

CAGGTGAAAG ATATTGTAGA TTTTTGTAGT TGATTATGTT CTTTTTTCTA TCAATACTAG 8880 

ACGTATCTAT ATACCTAAAG GATTTCTCTG GCTTATTTTG CCCAAAATTC CAATAAATTG 8940 

ATTTTATCCT CACCCACTCC CAAGTATCAG GAATATCATA AGGAACATCA ATTTCTTGAG 9000 

TGCTTCCATC AGCAJWiCTTC CCAT.ViTCTT TCTTATGTGC TTCAAGtata TAAAAAGGCG 9060 

TAAAAATACG CCTATAGATA ATGGGGTTGA AATAGGTTTA TTGTTGATGA GATTGTAGAT 9120 

AATTCAATTT TTTACTTCCA ATCGAATATT CAAATCCTCC ACCTTTTCTG CCTGTAATTG 9180 

TTCATCATAA AATTCAATAT CTTCAGGATT TTCCCCTTGG CAACCTCGGC AGAAATATTC 9240 

TTCCGCTCGA TCAGGATTCA AAAATCGACA AGCACAAACA AAACAGTCGC CATCATCATT 9300 

TATTGAGATA ATATAGTAGA TTGAAATAAG ATGTAAACAA ATCGATTAGG AAAGTTAAAT 9360 

TAGTTTCTAG AAATTTTTAG CAGATGTAGT GTACTATTCT AGTCTCAATT TACTATGGCT 9420 

TCAAATATAT CTTTCGAAAA AATATTTACA GATGTGTAAT TTTGAAGCTT GCAAAAGTTA 9480 

GTAAACTTGT AGATTTCGAT TTGAAGTAAC TTGTTTTCTT GCCCGATATT GTTTTTGAAA 9540 

TTGAATTTTT CCATAGTGAC TCCTTAATTT TCTTCTACAC GTCTGATGAT AAATCTAATT 9 600 

CGCAAAAGAG TCAAGAGGAT TTTTCGAAAA ATAAATAGCG ACCGAAATCG CTATTTTAAG 9 660 

GGTTATAGGT ATTTGATGGC TTAGACTGCT GTGTGACTGT TTACCCACAG GCAATCTTTC 9720 

TTCTATATTA GTATTAGTAA AGGTCTAAAT AATTATCAAT TTCCCATTGT GAAACGAAGG 9780 

TTGCATAACT TGCCCATTCG ATTCGTTTGG CTTCAAGGAA GCTAGTATAG ATGTGATCTC 984 0 
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CGAGAGCAGC 


TTTAACCACT 


TCATCTTCTG 


TCAAAGCTTT 


CAAAGCGTTG 


TGAAGAGTTG 


9900 


ATGGAAGGTC 


TGTAATACCA 


GCTTCCTTGC 


GCTCTTCTGC 


TGTCATGATG 


TAGATATTTT 


9960 


CTTCGATAGG 


AGCTGGTGCT 


TCGATTTTAT 


TTTCAATACC 


ATACAAACCA 


ACTTCCAAAA 


10020 


GAACAGCCAT 


AGCAACGTAA 


GGGTTCGCCA 


TTGGATCCAC 


TGAACGCAAC 


TCAAGACGAG 


10080 


TTCCCATACC 


ACGTGAAGCA 


GGTACGCGCA 


CAAGTGGCGA 


ACGGTTACGA 


CCAGCCCAAG 


10140 


CAATGTAAAC 


AGGCGCTTCA 


TAACCTGGAA 


CCAAACGTTT 


GTATGAGTTA 


ACTGTTGGGT 


10200 


TCATGATGGC 


AGTATAGTTG 


TAAGCATGCT 


TGATCAAACC 


GCCTAGGAAA 


TGGTAAGCTG 


10250 


TTTCTGACAA 


CTGCATTCCT 


TTTGGATCAT 


TTGGATCAAA 


GAAGGCGTTA 


TTTCCTTCTG 


10320 


CATCAAACAA 


GGACATATTA 


CAGTGCATAC 


CTGATCCAGC 


AATACCAAAT 


TTTGGCTTCG 


10300 


CCATAAATGT 


TGCGTAAAGT 


CCGTGTTTGC 


GAGCAATGGT 


TTTAACAACA 


AGCTTAAAGA 


10440 


TTTGAATCTT 


ATCACAAGCA 


CGGAGAACTT 


CATCGTACTT 


AAAGTCAATC 


TCATGCTGTC 


10500 


CAACCGCAAC 


CTCGTGGTGA 


CTCGCTTCTA 


CTTCAAATCC 


CATTTTGGTC 


AAGACATTCA 


10560 


CAATCTCACG 


ACGTGTGTTG 


TCCGCAAGGT 


CAGTAGGTGC 


CAAGTCAAAG 


TAGCCACCCT 


10620 


TGTCATTCAC 


TTCAAGTGTT 


GGGTCCCCAT 


TTTCATCCAA 


CTTAAATAGG 


AAGAATTCTG 


10680 


GCTCTGGACC 


AAGGTTGAAG 


GATTTGAATC 


CAACTTCTTC 


CATGTGACGA 


AGAGCTCGTT 


10740 


TCAAATTACC 


ACGAGGGTCA 


CCCGCAAATG 


GTTCACCTTC 


TGTTGTATAG 


ACATCACAGA 


10800 


TCAGACCTGC 


AACACTTCCA 


TTTTCATCTC 


CCCAAGGGAA 


GACTGTCCAT 


GTATCCAAGT 


10860 


CCGGGTACAA 


GTACATATCC 


GACTCATTGA 


TACGTACAAA 


ACCTTCAATA 


GAAGATCCAT 


10920 


CAAACATAAC 


CTTGTTCGAC 


AAGACCTTAT 


CTAACTGTTC 


ATCTGTAGCA 


GGAATTTCGA 


10980 


CGTTTTTCAT 


GGTTCCCAAA 


ATATCTGAGA 


ACATAAGACG 


AATAAAGGTA 


ACATTTTTTT 


11040 


CCTTGACTTC 


ACGACGAATA 


TCTGCAGCTG 


TGATTGGCAT 


AAGTTTTCTC 


CTTAATCTAT 


11100 


GACTACTTGC 


GGTTGCCTAA 


CCGCGACCAA 


AAGGTGACTG 


TACTGAAGCA 


AAACGCCCCT 


11160 


GTTGGAGGAG 


TTCATTGTGA 


AGTGCACGAC 


GTACTTCAGT 


CTGACTAACC 


GCTTTCTTGG 


11220 


ATTTCGCTTC 


ACGTTCAGCA 


TATTTTTTCT 


TAATGGCAGC 


GATATTATAA 


CCTTCAGAGA 


11280 


TATAATCTTT 


GATTTCAAGC 


AGACGATCCA 


TGTCATTCAA 


GGAATACATG 


CGACGATTTC 


11340 


CTTCGTTTCG 


ATCGGGCTTG 


ATCAACTCTT 


GATCTTCATA 


ATAACGAATC 


TGACGCGCCG 


11400 


ATAGATCGGT 


CAACTTCATA 


ACACTGCCGA 


TAGGAAAAAC 


AGCCATATTT 


CGGCGAAATT 


11460 


CTTTTTCCTT 


CATTTACAAT 


TTCCTTCTTT 


CTGTCTATTA 


TAGTCTAAAA 


AAAGACAAAC 


11520 


GTCAATTGAT 


AATGTTATAA 


AATGTAACAT 


TATTTTTCTT 


TTTTCTCTAA 


AAAGAGACGA 


11580 
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ATACGATCAA TATCGTAATT TACGATAATT GCGACAAAAA CTCCCATAAA CGTTTCTAAT 11640 

ACACGCACAA ACACGTACAA AATTGTCTCA CCACTTGGAA TTGATAGGGT AATGATTAAC 11700 

ATAGCTGCTA CACCACCAAT AACCCCTGCT TTGTTATTCA TGGCTACATT TGTCATAATG 11760 

GTTAACATGG TGCAGATTGG AACAACTACC AAGGTCACCC AAAAGGCTTC GTGGAAAAAG 11820 

GTATTTAATA AGAAGAAGAC CAAGGCATAG AGTCCACCGA TACTATTTCC TAGAATACGC 11880 

GAAGTCCCAA AATGAACACT CTCATCAAAA CTCTCCCTCA GGCTAAAAAC GGCTGTCAAA 11940 

GCACCAATTT GAAGACCTTT CCAGCCAAAA AAGCCAAAAA TCAAGAGAAC TAGAAAAACA 12000 

GCAATACCTG TTTTAAAGGT TCGCATACCA AGTTTGAACT GGGATTTATC GAATTTATAT 12060 

TTTTTAAAAT AACTCATAAT CTCAACTTTC TATTTCCATT TTATCATAAA TCGGTGATTT 12120 

TTATGAGTAA TAGTTGAGAG GAAGCGTTTT TATTTTAAGC AAAAGAAAAG AGGAACTTTC 12180 

ATCCCTCTCT TCTTTGATTT ATTTATAAAA TCTTATTTTT CTGTCAAGGC TGCAAGTCCT 1224 0 

GGAAGAACCT TACCTTCAAG AAGTTCCATT GATGCTCCAC CACCCGTACT AATCCATGAG 12300 

AACTTGTCTG CACGGCCAAG GTTAATCGCT GCGGCAGCTG AGTCACCACC ACCGATGATT 123 60 

GATTTAACTC CTGGTTGTTT CACGATAGCG TCCATCACAC CGATTGTACC AGCTTGGAAA 12420 

TCTGGGTTTT CAAATACACC CATAGGTCCG TTCCATACGA CTGTTTTGGC ACCAGTCAAA 12480 

GCTTCGTCAA ATTTGGCGAT AGATTTTGGA CCGATGTCAA GACCAAGGAA GCCTTCAGAA 12 540 

ACTGCTTCAC CTTCAGTGTC ACGCACTTCa U'l-GTAACCAG CAAATGCGTT AGCTTCTTTT 12 600 

GAGTCAACTG GCAAGATCAA TTTACCATTT GCTTTTTCAA GAAGAGCTTT CGCAACATCC 12 660 

AATTTGTCTT CTTCTACAAG TGAGTTACCG ATTTCGATAC CTTGTGCTTT GTAGAATGTG 12720 

TAAGTCATCC CACCACCGAT AAGGACGTTA TCAGCTTTTT CAAGCAAGTT TTCGATAACA 12780 

CCGATCTTGT CTG/^CTTT TGAACCACCA AGGATAGCCA CGAATGGACG TTCTGGAGTT 12840 

TCAACTGCTT CTTGGATGTA GGCAATTTCG TTTTCAAGAA GGAAACCAGC AACTGCTTTT 12900 

TCAACGTTTG CTGAGATACC AACGTTAGAT GCGTGTGCAC GGTGAGCTGT ACCGAATGCA 12960 

TCGTTTACGA AGATACCATC TCCAAGTGAT GCCCAGTATT TACCAAGTTC AGGATCGTTT 13020 

TTAGATTCTT TCTTGCCGTC AACATCTTCG TAACGAGTGT TTTCAACCAA GAGAACTTGT 13080 

CCATCTTCAA GAGCGTTGAT TGCCGCTTCT AATTCAGCAC CACGAGTGAC ACCTGGGAAA 13140 

ACAACATCTT GACCAAGTTT TGCTGCCAAG TCAGCTGCTA CAGGAGCAAG TGATTTACCA 13200 

GCTTTATCAG CTTCTTCTTT CACACGTCCA AGGTGAGAGA AAAGAATTGC ACGTCCACCT 132 60 

TGTTCGATGA TGTACTTAAT AGTTGGAAGA GCTGCTGrCA TACGGTTATC GTTAGTGATT 13320 

ACGCCATCTT TCAATGGTAC GTTGAAGTCA ACACGAACGA GGACTTTTTT ACCTTTCAAG 133 80 
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TCAACGTCTT TAACAGTAAG TTTTGCCATG TTACAAAAAC TCCGG 13425 
(2) INFORMATION PGR SEQ ID NO: 152: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 905 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 

GATTTATCCT ACCGGnGAAT TTCCGGAGGG GTTCTAGCAG CAATCTTAGG AATCTATGAA 60 

CGAATGATTG GCTTTCTGGC CCATCCCTTT AAAGACTTTA AAGAAAATGT TTTGTACTTT 12 0 

ATTCCAGTTG CCATCGGTAT GCTTCTGGGA ATCGGCTTAT TTTCCTACCC GATTGAATAC 180 

CTGCTTGAAA ATTATCAGGT TTTTGTATTA TGGAGCTTTG CGGGAGCTAT TATCGGTACA 240 

GTTCCTAGCC TCCTCAAAGA ATCAACTCGA GAATCTGACC GAGACAAGAT TGATTTAGCT 300 

TGGTTATGGA CAACCTTTAT CATTTCTGGA TTAGGACTCT ATGCCTTAAA TTTTGTCGTT 360 

GGAACCTTAA GCGCCAGCTT TCTTAACTTC GTCCTAGCAG GCGCACTATT GGCCCTTGGC 420 

GTCTTGGTTC CTGGCCTCAG CCCATCAAAT TTACTTTTGA TTTTGGGACT CTATGCTCCT 4 80 

ATGTTGACTG GTTTTAAAAC TTTTGATTTC TTGGGAACCT TCTTTCCGAT TGGAATTGGT 540 

GCAGGTGCAA CTCTCATCGT TTTTTCAAAA TTGATAGATT ATGCCTTAAA CAACTACCAC 600 

TCACGCGTCT ATCATTTCAT CATCGGTATC GTCCTATCAA GTACCCTTTT GATCTTAATT 660 

CCAAATGCAG GAAACGCTGA AAGTATCCAA TACACAGGAC TTTCACTTGT CGGTTATGTC 720 

ATCATCGCCT TCTTCTTTGC GCTGGGAATC TGGCTTGGTA TTTGGATGAG TCAATTGGAG 780 

GATAAATATA AATAATGGCA AAAAAAGTTA AAATCAAAAA AACATTGGTG GAACAAATCC 840 

TATCTAAAGC AGCTATCCCT CATCAGGGGA TTCAAATCAA TGCCCTAGAA GGAGAGCTTC 900 
CTCAA 

(2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4278 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



905 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 
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CTTGAATTAA ATAAAAAACG TCATGCGACT AAGCATTTTA CTGATAAGCT TGTTGATCCC 60 

AAAGATGTGC GTACGGCTAT CGAAATTGCA ACCTTAGCGC CAAGCGCCCA CAACAGCCAG 120 

CCTTGGAAAT TTGTGGTGGT ACGTGAGAAA AATGCTGAAC TGGCAAAGTT AGCTTATGGT 180 

TCCAATTTTG AACAGGTATC ATCAGCGCCT GTAACCATTG CCTTGTTTAC AGATACGGAC 24 0 

TTAGCCAAAC GTGCTCGTAA GATTGCCCGT GTTGGTGGTG CTAATAACTT TTCTGAAGAG 300 

CAACTTCAAT ATTTTATGAA AAATCTGCCA GCTGAGTTTG CCCGTTACAG TGAGCAACAA 3 60 

GTCAGCGACT ACCTAGCTCT CAATGCAGGT TTGGTTGCCA TGAACTTGGT TCTTGCATTG 420 

ACAGACCAAG GAATTGGTTC TAACATTATT CTTGGTTTTG ACAAATCAAA AGTTAATGAA 480 

GTTTTGGAAA TCGAAGACCG TTTCCGCCCA GAACTCTTGA TCACAGTGGG TTATACAGAC 540 

GAAAAATTGG AACCAAGCTA CCGCTTGCCA GTAGATGAAA TCATCGAGAA AAGATAGAAA 600 

GAAGAAAAAA TGACAGCAAT TGATTTTACA GCAGAAGTAG AAAAACGCAA AGAAGACCTC 660 

TTGGCTGACT TGTTTAGCCT TTTGGAAATC AATTCAGAAC GTGATGACAG CAAGGCTGAT 720 

GCCCAGCATC CATTTGGGCC TGGTCCAGTA AAAGCCTTGG AGAAATTCCT TGAAATCGCA 780 

GACCGCGATG GCTACCCAAC TAAGAATGTT GATAACTATG CAGGACATTT TGAGTTTGGT 84 0 

GATGGAGAAG AAGTTCTCGG AATCTTTGCC CATATGGATG TGGTGCCTGC TGGTAGCGGT 900 

TGGGACACAG ACCCTTACAC ACCAACTATC AAAGATGGTC GCCTTTATGC GCGCGGGGCT 960 

TCGGACGATA AGGGTCCTAC A.i^r?AGCTTGT TACTATGCTT TG/J^TCA'r CAAAGAATTG 1020 

GGTCTTCCAA CTTCTAAGAA AGTTCGCTTC ATCGTTGGAA CAGACGAAGA ATCAGGCTGG 1080 

GCAGACATGG ACTACTACTT TGAGCACGTA GGACTTGCCA AACCAGATTT CGGTTTCTCA 1140 

CCAGATGCTG AATTTCCAAT CATCAATGGT GAAAAAGGAA ATATCACGGA ATACCTCCAC 1200 

TTTGCAGGAG AAAATACAGG TGTTGCCCGT CTTCACAGCT TTACAGGTGG TTTACGTGAA 1260 

AATATGGTAC CAGAATCAGC AACAGCAGTC GTTTCAGGTG ACTTGGCTGA CTTGCAAGCT 1320 

AAACTAGATG CCTTTGTTGC AGAACACAAA CTTAGAGGAG AACTCCAAGA AGAAGCTGGC 1380 

AAATACAAGG TGACGATCAT TGGTAAATCA GCCCACGGTG CTATGCCTGC TTCAGGTGTC ::440 

AATGGCGCAA CTTACCTTGC CCTCTTCCTC AGCCAGTTTG GCTTTGCTGG TCCAGCCAAA 1500 

GACTACCTTG ACATCGCAGG TAAAATTCTC TTGAACGATC ATGAGGGTGA AAATCTTAAG 1560 

ATTGCTCATG TGGATGAAAA GATGGGTGCT CTTTCTATGA ATGCCGGCGT CTTCCACTTC 1620 

GATGAAACAA GTGCTGATAA TACCATTGCC CTCAACATCC GCTATCCAAA AGGAACAAGT 1680 

CCAGAACAAA TCAAGTCAAT CCTTGAAAAC TTGCCAGTTG TTTCTGTTAG CCTGTCTGAA 1740 

CACGGTCACA CGCCTCACTA TGTGCCAATG GAAGATCCAC TTGTGCAAAC CTTGTTGAAT 1800 
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ATCTATGAAA AACAAACTGG CTTTAAAGGT CATGAACAAG TCATCGGTGG TGGAACCTTT I860 
GGTCGCTTGC TAGAACGCGG AGTTGCCTAC GGTGCTATGT TCCCAGACTC GATTGATACC 1920 
ATGCACCAAG CCAATGAATT TATCGCCTTG GATGATCTTT TCCGAGCAGC AGCAATTTAT 1980 
GCCGAAGCTA TTTACGAATT GATCAAATAA AACGATAGAA GTCTGAGATC TTATGCTTGG 2040 
ACTTCTTTTT GGAGGGAAAG TAGATGTCTC AAATCGAAAG AATCAAACAG GCTATCATGG 2100 
CGGATTCGCA GAATGCCAGC TATACAGAGC GTGGCATTGA GCCTCTCTTT GCAGCGCCAA 2160 
AAACTGCTCG CATCAATATC ATCGGTCAGG CTCCGGGACT TAAAACTCAA GAAGCAGGCC "2220 
TTTACTGGAA AGATAAAAGT GGTGACCGCT TGCGGGACTG GCTAGGTGTG GATGAAGATA 2280 
CCTTTTACAA TTCAGGTTAT TTTGCTGTTT TGCCTATGGA TTTCTACTTT CCAGGACATG 2340 
GCAAGTCGGG TGATCTTCCG CCTCGTACAG GTTTTGCAGA AAAATGGCAT CCGCAGGTCT 2400 
TACAGGAATT GCCTGATATT CAGTTAACCC TCTTGATTGG GCAATATGCC CAAGCCTACT 2460 
ATTTACAGGA GAAAATCAGT GGGAAGGTAA CGGAGAGGGT GAAACACTAT AAAGACTATC 2520 

TGCCAGCCTA TTTTCCGCTA GTTCACCCAT CACCACGAAA TCAAATCTGG ATGGCCAAAA 2580 

ATCCTTGGTT TGAGGCAGAA GTAGTGCCAG ATTTGAAAAA AAGAATTAAA ACCATTTTAT 2640 

AGTCAATGAA AATCAAAGAG CAAACTAGGA AGCTAGTCGT AGGCTGCTCA AAGTACAGCT 2700 

TTGAAGTTGC AGATAAAAGT GACGAAGTCG GTAACATACG CACGGTAAGG CGACGCTGAC 27 60 

GTGGTTTGAA GAGATTTTCG AAGAGTATTA GAAGAAAAAG AATGAAAGAA ATAGCCTTTG 2 820 

ACGCATTTTA CCAGCTTTAC CAAAACGACC AGCTTTCTTT AGTGGATGTG AGAGAAGTGG 2880 

ATGAGTTTGC AGCTCTTCAT TTAGAAGGTG CCCACAACCT ACCGCTTAGT CAATTGGCTG 2940 

ATAGTTATGA TTAATTGGAC AAAGATCGCT TGCATTATAT TATTTGCAAA TCTGGAATGA 3000 

GATCGGCGCG TGCTTGCCAA TTCCTATTAG AACAAGGTTA TAATGTTATC AATGTCCAGG 3060 

GTGGCATGTT AGCCTTTGAA GAACTTTAAA ATTTTGCATT TCTCCTACTT GGTGTGGACT 3120 

GGGTAGGAGA GTTTTATTTT TAGATAATTC TTATTTTTAA GAAAATTGAA AACATTTAAT 3180 

ATTTGCCTCG TGATGCTTTT TTCAGACTCC TAATCGTGGT ATACTAGGTC AGTATTTTAT 3240 

AAATATGAAG GAGATTTTTA TGGCTAAAAA AGGTACCCTA ACAGGTTTGC TCCTGTTTGG 33 00 

AATATTTTTT GGTGCGGGGA ACTTGATTTT TCCGCCTTCT CTAGGTGCTC TATCTGGAGA 3360 

ACATTTTCTT CCTGCCATCG CAGGTTTTGT CTTTTCAGGC GTTGGTATCG CCGTCTTGAC 3420 

CCTTATTATT GGAACGCTAA ATCCTAAAGG ATATATCTAC GAGATTTCAA CGAAGATAGC 3480 

GCCTTGGTTT GCGACTCTTT ACCTCTCAGT TCTTTACTTG TCAATCGGTC CATTCTTTGC 3540 
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TACCCCACGT ACTGCTACAA CAGCTTACGA AGTAGGGATT AGCCCCCTTT TGTCGGATGC 3 600 

AAATAAAGGA CTTGGCTTGA TTGTATTTAC GGTTCTGTAT TTTGCGGCAG CCTATTTGAT 3660 

TTCGCTTAAT CCATCAAAAA TCTTAGACCG CATTGGACGT ATTTTAACGC CAGTCTTTGC 3720 

AATTTTGATT GTTATCTTGG TCGTTCTGGG AGCTATCAAA TATGGTGGAA CAAGTCCTCA 37 80 

AGCTGCTTCA CTGCTTATCA AGCTTCTGCC TTTGGTACAG GTTTCCTAGA AGGTTACAAT 3 840 

ACCTTGGACG CCCTTGCCTC AGTGGCCTTT AGCGTAATCG CAGTTCAAAC CTTGAAACAA 3 900 

CTTGGATTTT CAAGTAAGAA AGAATACATT TCAACTATTT GGGTTGTTGG TATCGTTGTT 3 9 60 

GCCCTTGCCT TCAGCGCTCT TTACATCGGT TTAGGTTTTC TTGGAAATCA TTTCCCAGTA 4 020 

CCAGCTGAAG CGATGAAGGG TGGAACACCA GGTGTTTACA TCTTGTCACA AGCCACTCAA 4080 

GAAATCTTTG GCTCAACAGC TCAACTCTTC CTTGCAGCTA TGGTTACCGT AACCTGCTTC 4140 

ACAACGACTG TTGGTTTGAT TGTGTCAACA GCTGAGTTCT TTAATGAGCG CTTCCCACAA 4200 

ATCAGCTACA AGGTTTATGC GACAGCCTTT ACCTTGATTG GATTTGCTAT TGCCAATTTG 4260 

GGTCTTGATG CGATTATC ^278 
(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1953 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
{D; TOFGLOGV: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 154: 

ACCCGATCAA ATGACAAAAG CTAACTTTGG TGTCGTAGGT ATGGCCGTAA TGGGTCGTAA 60. 

CCTTGCCCTT AATATTGAAT CTCGTGGTTA CACAGTTGCT ATCTACAACC GTAGTAAAGA 120 

AAAAACGGAA GATGTGATTG CTTGCCATCC TGAAAAGAAC TTTGTACCAA GCTATGACGT 180 

TGAAAGTTTT GTAAACTCAA TCGAAAAACC TCGTCGTATC ATGCTGATGG TTCAAGCTGG 240 

ACCTGGTACA GATGCTACTA TCCAAGCCCT TCTTCCACAC CTTGACAAGG GTGATATCTT 3 00 

GATTGACGGA GGAAATACTT TCTACAAAGA TACCATCCGT CGTAATGAAG AATTGGCAAA 360 

CTCTGGTATC AACTTTATCG GTACTGGGGT TTCTGGTGGT GAAAAAGGTG CCCTTGAAGG 420 

TCCTTCTATC ATGCCTGGTG GACAAAAAGA AGCCTACGAA TTGGTTGCGG ATGTTCTTGA 480 

AGAAATCTCA GCTAAAGCAC CAGAAGATGG CAAACCATGT GTGACTTACA TCGGTCCTGA 54 0 

TGGAGCTGGT CACTATGTGA AAATGGTTCA CAATGGTATT GAGTACGGTG ATATGCAATT 600 

GATCGCAGAA AGCTATGACT TGATGCAACA CTTGCTAGGC CTTTCTGCAG AAGATATGGC 660 
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TGAAATCTTT ACTGAGTGGA ACAAGGGTGA ATTAGACAGC TACTTGATTG AAATCACAGC 720 

TGATATCTTG AGCCGTAAAG ACGATGAAGG CCAAGATGGA CCAATCGTAG ACTACATCCT 780 

TGATGCTGCA GGTAACAAGG GAACTGGTAA ATGGACTAGC CAATCATCTC TTGACCTTGG 840 

TGTACCATTG TCACTGATTA CTGAGTCAGT GTTTGCACGC TACATTTCAA CTTACAAAGA 900 

AGAACGTGTA CATGCTAGCA AGGTGCTTCC AAAACCAGCT GCCTTCAACT TTGAAGGAGA 960 

CAAGGCTGAA TTGATTGAAA AGATCCGTCA AGCCCTTTAC TTCTCAAAAA TCATTTCATA 1020 

CGCACAAGGA TTTGCTCAAT TGCGTGTAGC CTCTAAAGAA AACAACTGGA ACTTGCCATT 1080 

TGCAGATATC GCATCTATCT GGCGTGATGG CTGTATCATC CGTTCTCGTT TCTTGCAAAA 1140 

GATTACAGAT GCTTACAACC GCGATGCAGA TCTTGCCAAC CTTCTTTTGG ACGAGTACTT 1200 

CTTGGATGTT ACTGCTAAGT ACCAACAAGC AGTACGTGAT ATCGTAGCTC TTGCGGTTCA 1260 

AGCAGGTGTG CCAGTGCCAA CTTTCTCAGC AGCTATTACT TACTTTGATA GCTACCGTTC 1320 

AGCTGACCTT CCAGCTAACT TGATCCAAGC ACAACGTGAC TACTTTGGTG CTCACACTTA 1380 

CCAACGTAAA GACAAAGAAG GAACCTTCCA CTACTCTTGG TATGACGAAA AATAAGTAGG 1440 

TCAGCCATGG GGAAACGGAT TTTATTACTT GAGAAAGAAC GAAATCTAGC TCATTTTTTA 1500 

AGTTTGGAAC TCCAGAAAGA GCAGTATCGG GTTGATCTGG TAGAGGAGGG GCAAAAAGCC 1560 

CTCTCCATGG CTCTTCAGAC AGACTATGAT TTGATGTTAT TGAACGTTAA TCTGGGAGAT 1620 

ATGATGGCTC AGGATTTTGC AGAAAAATTG AGCCGAACTA AACCTGCCTC AGTCATCATG 1680 

ATTTTAGATC ATTGGGAAGA CTTGCAAGAA GAGCTGGAAG TTGTTCAGCG TTTTGCAGTT 1740 

TCATACATCT ATAAGCCAGT CCTTATCGAA AATCTGGTAG CGCGTATTTC GGCGATCTTC 1800 

CGAGGTCGGG ACTTCATTGA TCAACACTGC AGTCTGATGA AAGTTCCAAG GACCTACCGC I860 

AATCTTAGGA TAGATGTTGA ACATCACACG GTTTATCGTG GTGAAGAGAT GATTGCTCTG 1920 

ACACGCCGTG AGTATGACCT TTTGGCGACA CGG 3^953 
(2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6474 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 
CCGGCAGTAC ACGAGCTTGG GGAACAGCCA CTGGAACGAT GAGGTGTGAG CTCAAAATAT 



60 
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CCTCCAGTTA TGTTTTTCCT AATAGTATAC CGGAAGAGTG AAAGGATTTT ATAATGGAGC 
GGTTACAAAG AACCTACTTT CTATTAAACA GTATACTATG AAAATGTGAA AATTTAACAT 
TTTTTTGTAC AAATTTTATA AATTATTGCC TTTTTAATAT CAATAGTTAA TCTCTTATCC 
AGATCCCCCT TGTGTAAACT TTATCTTTAT AAGCTTCAAG GCCCCTATCC CATCTATTTG 
CAACAATTAG ATCACTTTGT TTTGTAAATA GTTCAAAATT CTTTTCAATA ATTACGTTAT 
CTATACTAAC GTTTAAATTT GGTTCATATA CTAAAATTTT TATACCGACA ATCAATAGTT 
CATTAATTAT ACTTAAAATA GCTGACTCTT TGTAATTATC TGAATTATAT TTCATCCCCA 
ATTTATATAT TCCTACTATC TTTGGCTTTC GTTCCAATAT TTGTTTAACT ATGAACTGTT 
TTCTATTTGT GTTTGAAATA TCAATCGCTT CTATCACTGG GGCATTTATT TCTATAAATT 
CTTTTTTTAA TTGTTTAGTA TCTTTGGGAA GACAATATCC TCCAAATCCA AAAGAAGGAT 



TATTATAAAA 


ATTTCCAATT 


CTTGGATCTA 


AACAAACACC 


TTTTATTACA 


ACTTCAGCAT 


720 


TTAAGCTTCT 


CCTCTCAGCA 


AAAGAATCTA 


GTTCATTAAA 


AAAGCAACAC 


GGAGAGCTAA 


780 


GAATGTGTTA 


GAAAAAAGCT 


TAATTGCTTC 


TGCTTCAGTA 


GGAGAAACTA 


ACATAACATT 


840 


TTTAATATTG 


GCAGTACTAT 


GAGTACTAAT 


CGAAAGGAAC 


AACTCTGCAA 


TTTTTCTTCC 


900 


TTCAACTGTC 


TCATCTCCAA 


CAACTATGCG 


ACTTGGATAT 


AAATTATCAT 


ATATAGAACA 


960 


ACCTTCTCTC 


AAAAATTCAG 


GGACAAAAAT 


GATATTTTTT 


GTATCAAACA 


GCCTTTTTAA 


1020 


TTTGTTTGAA 


AAGCCGATCG 


GAACTGTTGA 




ATCTTTCCAt 


I'ACiGTTTTAC 


1080 


CCTCAGAATC 


TTCGATACCG 


TTTGTTCGAT 


TTCATATGTA 


TTAAAACTAC 


CAATTTTCTC 


1140 


ATCATAATCT 


GTCGGAAGCG 


CAATAATATA 


ATAATCAATA 


TTATTTTTAA 


TTTCAGAAAA 


1200 


TGTATCAAAA 


AAAGTAATAT 


TTAAGTTATT 


CTCGCAAAAA 


AACTTCATAA 


GCTCTTCATT 


1260 


TTTAGATGGA 


AGAATGCCCT 


TTTTTAAATT 


ATTTATTTTT 


ACAGAATCTA 


TATCATATGC 


1320 


AACAACTTTA 


TATTTAGATG 


CAAATAGTAA 


CGCGTAGGCC 


AGCCCAACAT 


GCCCCAAACC 


1380 


AATTACTGCT 


ATATTCATAA 


AACTACTTCC 


TTATTTCTTA 


ATCCAAAATC 


TAATAGAA'i'A 


1440 


AGCTGCCCCA 


TTCCTTAAAT 


ACAACTCTTT 


AATATTGTTT 


AAAAGTTTTT 


CAACTGATTT 


1500 


CCAGATTATC 


AAAATCTGAG 


ATTTATAGCA 


CAATATTGAT 


GATATTCTAT 


CAATATAATT 


1560 


TTTTTCATCA 


AGTTCCTCTT 


GATACATTTT 


TAATTCTTTA 


GTTTTTCCCA 


TATAACTAAC 


1620 


CATACTACTA 


TCACTTACAT 


ATGGGAAGTC 


CTCATAATAT 


ATTACTTTAT 


AACGCATAAA 


1680 


TTCAAGCGCC 


CTTCCAATAC 


TATTCACAAA 


AACATGAGCA 


ACATGGTCAC 


CAAGTGAAAG 


1740 


CGGACAATAT 


ACGACACATT 


TGTCGTCTAA 


ATGCATTAAC 


AGCTCTTTTA 


TGATATCATT 


1800 


CTTTAATGTG 


TCCTCATTTT 


TTAATTCACT 


ATAGATATGA 


CGGTATAGAA 


AATTGCCATT 


1860 
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TCTATCTTTC CTATAGAGAC ATTCATAGTA CGATAAGTGT CTAAAATCAC ATTGTAGACG 1920 

TTCACAAGCT AACCTGTCTT CTTTCTTCCT TTCTTCAATC GGATATTTCC CAAGGTTACA 1980 

CAACTTATGA AATTGCTTAG CAGAGGGCTG TAGCTGTTGG CTCAAAGGGT AACCAGAAAA 2040 

TATAGTAATA ACAAGTACAA TTTCTCCTTC TGAAGTTAAT TTTGAAATAT AATCACCACA 2100 

GGAAAAAATT GCGTCATCTA AATGTGGAGA TAAAAAGATA TACTTAGTAT TGTTACTCAT 2160 

AACCATTCCC TCTACAATTT ATCTAAAAAC TCACTAAGTG TCTGATTAAA TTCCACATCA 2220 

TCAAAAAAAT TCACCTTATT CTTAATAATG AATATTTCGT TAAATAAACA TATATATAAA 2280 

TATTTCAATA TCCTTTCAAT ATCATCCTCT AAATTCTCCT CAATATTTTG TATCAGCCCA 2340 

TTTACAATCT TATTAAAAAA GATAAGCTCT TTATCTCTAA AATTAAATAT TTTCATACAA 2400 

CTGTTGTATC GAAAAATATA TAAAATAATT TTTACTAATG TTTGAATATT TAAACAACTA 2460 

AATAAATGAG TTGTACCCGG GACACTATTT ATGTTATCAA GAACACTATC TTGAAACCTC 2520 

AACTCACAGT TCTTTTTGTG AAATTCTTTT TTATCGTTTA GATCTGATAT TTTTTTAGAC 2580 

ATTTCAACAA TCTCAGACAT TTTATATGGA TATCTAGGAT GAATGCCAAA ACTATGCAAA 2640 

ATGAACTGCA CCCCAAAAGT TAGACAGAAT AAATCTAACT TTTGGGGTGC AGTTCATAAG 2700 

ATTGGGATAT TTTTTTTTAG CTAGAACTAG TAGAAATATA TAGTCAAATA ACAGATACCT 2760 

TAAGGGTTTC TCATCTACAT AAAAAAATGA TACTTTTTTC TCTTCAGTAA TTACCTCATA 2 820 

AGCTTCACAA TAGAATCTCA TGTTTCCCTC CCCTATATTC TTAAATAAAA TCCTTTGGAA 2880 

ATTGATATAT CTTAGTAAAA TATTGTTTAA GTTCCGGATG CGGAGCATGG GTAACAATAA 294 0 

TGACAGTCAA ATCCTCTCTA TCTAATATCT TACGTTCAAT CGCTAACGAA GTTCTCCTAT 3 000 

CGATAGCAGA AGTTCCCTCG TCAATTAATA CTATTTTCTT ATTTCTAATT AGCCCTCTAG 3060 

CTAAAGTAAT TTTTTGTTTC TGCCCTCCTG ACAGTAATCT CCCATCATCA CCAACATAAT 3120 

AATCTAAAAT GTTATTAGGA AAATCTTTTA CACTCAAACC AACTTGCTCT AAAGACTGTA 3180 

GTATTTCTTC ATCAGTATAA TTTTCTTCCA ATAAAATATT ATCTCTAATC GTACCTTCAA 3240 

ACAAATAAGC TTTTTGATCT ACATATAGAA CATTCGAAAC CATATTTAAA TAGGAGGTTT 3300 

TTTTTATATC ATCCCCGCAG AATCGCAATT CTCCACTATA ATCTCTCAAA AAGCCATTCA 3 3 60 

ATAATTTTAA TAATGTAGAT TTCCCGCTTC CACTTTCACC TAAAATTAAA TACTTTTCAT 342 0 

TACGTTGAAA ACAAAAATTT AAGTTTTTTA ATATTTCTTT ATCTCCATAC TTATAGCAAA 3480 

TATTTTTTGC TTCATATAAC GGAAAATCTC TATTCACCTC ATTTGGTTCG ATATCATTCA 3540 

TTTTATTTGA CTCAATTGGA TTAATTGAAT ACAATTTTAA AAAAATAGGC TTCGTACCAA 3600 
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TAATAGAGGA 


TAATTGACCT 


CCTAATTCAC 


CTAGCGCTGT 


AAAAATAACA 


CCTGTTAGTG 


3660 


CTCCTATTGC 


TTCAATAGTA 


CCAATTTTCA 


CTATTCCTTT 


TATTGCAAGA 


TAGCCTGTTA 


3720 


AAAAAACGAG 


AGATATCTGA 


AAAAAAATAT 


TGAGAAAGAA 


GCTAATAGCG 


CCTGCTAACG 


3780 


TTTCTACAGT 


TGTCTTTCTT 


TGTATAACCA 


TCTTTAATAA 


AATTCCTGCT 


TCTTTAATTT 


3840 


TCTTAGGCAA 


TACATATAAA 


AGATTCAAGG 


ACGCTAACAC 


ATCAAATCCA 


TTCAATATAG 


3900 


TCTCACTAGA 


TTTTAAAAAA 


GCTTCATTTT 


GGTTAGTTAA 


ATTTAGACTA 


ACTTCTCGCA 


3960 


TTTTCGATGC 


AAAGATTTTT 


GGTACAAGTA 


GCATAATCAT. TAATGAAAAC 


AAGGTGGCTA 


4020 


CAGTCAATGA 


CCAATGATAG 


TGATTAAGAG 


TCACAACTGC 


AAATATAGTA 


CCAGAAATTC 


4080 


CTTTTATTAC 


TAAAAAAAGT 


TGTTTAAACG 


CCTGATCATT 


TAAAGTCTGA 


ACATCATTAT 


4140 


TTAGCCACGA 


AAGATATGTT 


CCTGATGATT 


TACTATGAAA 


TTCTTGATAG 


GTAGAGTTAG 


4200 


AGATGTCTGT 


GGCAACTCTA 


TTTCGAATCT 


CTAGATTAAA 


CTCTTGGATC 


ACTTCAACCT 


4260 


GATAATTTTT 


CACTACCCAG 


TCAAGGAATA 


TTATCCCACA 


CCAGACAATC 


ATTTGGTAGA 


4320 


TTGACAATTT 


CAAAAACCGC 


TCTAAATTCA 


TCGCAATTAA 


TTCATTCAAC 


ACCAGAGCAT 


4380 


TAATAGTTGC 


TGCATAAATT 


AGCAATAATT 


GACCAGCAAC 


AATAAATATC 


GTTAATAAAC 


4440 


TAAATTTTTT 


TATATTTGAT 


TTTATAATAG 


TATACACAAT 


AGTTTCTCAC 


TTTCTAAATT 


4500 


TTAATTGAAC 


ATAGTTTTCA 


TATATACAAT 


AGAAAAAACC 


AAAATGATAT 


AATAACATAT 


4560 


ATTTCAAAAA 


AGAAATTCGT 




TTCTTCTCrT 


GCCTTCTTGA 


TTACTTTTAA 


4620 


AGCCTTGCAT 


TTGTCTCCTA 


TTAATAGTAA 


CCGCTTTATG 


TTTAAAGAAT 


AATATTTCTT 


4680 


TGTAACCAAT 


ATTCTCTCGT 


TGAAACTCAA 


TAAATTAAAA 


TATTTCCTAC 


AGTAATTATA 


4740 


ATATTCTTCA 


TCTGCATTAA 


TTGTTTTTTG 


TGTCACTCCA 


GTGATACCGT 


TTTCTTTACT 


4800 


GTGAGCGTAG 


TAATTCACCA 


AGAATTCTCG 


CACTATATCA 


ATTTGGTATC 


CTTGAACAAG 


4860 


TAGTTTTAAT 


AAAACAACAC 


CGTCCTGATG 


TGAATCTATT 


TTCTCAAAAC 


CATTAATTAA 


4920 


TTCTAGCACC 


TCTTTTTTAC 


ACAACCAAAA 


TGACGTACCT 


GCTATATTGT 


GAACCATTTG 


4980 


AACAAACAAG 


GGATTTCCAA 


CAAAATCGGT 


CTTCTCCTCT 


TCTCGTGTAC 


CATTTGGATA 


5040 


AATTATTATT 


CCATAACTAC 


AAACTAAAGC 


TAAATTCTTC 


ATTCTACTCT 


TTTTAAAACA 


5100 


AGCCATCAAC 


TTTAAAATTC 


GATCTGGCAT 


ATATTCATCA 


TCATCGTCTA 


AAAATGATAT 


5160 


ATACTTACCT 


CTAGAATTTT 


TGATACCTAT 


GTTTCTGGCA 


TTAGTTGCAC 


CTAAATCTTC 


5220 


ATTACTTAAA 


ATTAACTTAA 


TTCTATGATT 


GGTATAGCCA 


AATTGATGGA 


TAATTTTATT 


5280 


TCTTAAATTT 


ACATTACTAT 


AATTATCATC 


AATAATTATA 


ACTTCGATAT 


TTTTATAACT 


5340 


TTGATGTAAA 


CAACTTTTCA 


CAGCTCTAAT 


CAGAGATTCA 


TACCTATTAT 


GTGTTGGTAT 


5400 
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TATAATACTT ACTAATTCTT GATCTATATT CCTATCCATG ACTACTCTTC TCTAATAATT 54 60 

CATCATATAC TCTCATGGTT TCTACAAACA TTTTTTGCAC AGAAAAATGT TTTCTTATTT 5520 

TTGATTTACT ATTCTCACCT ATATATTTCA AATACTCAGA ATCATTGAGT AAAAAATTAG 5580 

CACAAGCACA CACTCCCTCA ACATCTTCCT TCTCAAATAA AAATCCATCA ACCCTATGTT 5640 

CAATAATTTC ACTTAACCCG CCAACATTAC TAGCTAAAAC CGGAGTTCCT TGTGACATTG 5700 

ACTCTAAAAC ACACATAGGT ATTCCTTCTG TATCAGAAGG AATATACAAT AAATCCGATA 5760 

TTTGGTAAAC TATAGTAGCT GGATAGATTT CACCAAGTAA CCTGAAATTA TCTCTACATT 5820 

TCAAATGGCA AATTTTTTCT TTCAAAGCAG CCCACATACT ACCATTTCCA GCCATAATAA 5880 

AAATCACATC TTCTCTGACT AAAAATAATT TTTCTGCAAA TTCAAGGAAT CTATCCGGCC 5940 

TTTTTTCTGG ATCCAACCTT CCAACATAAC AAATGATTTT TTGTTATTTG GAATACAAAA 6000 

TTCTTTTTTA AAGTCTTGAA CACCTACTAC ATCTAAATCG CTATTTGATA CATTAATTCC 6060 

GTTATTTATT GCAACTATCT TCTTATTTTT TATTATACTC TCCAATCTTT TTTTTCATAG 6120 

TTTCAGATAC ACAAATAAAA GCATCTCCCA TAGAATATGT CCAAAAATCA AAATAAGTCA 6180 

AGAATTTCTT TTTTAAGTTA TATTCAACCC ATCCATGGCA TGTTATCACT GTCTTAACCT 6240 

TTCCAAATCC ATTCTTGTCA AGTTTTTTTA ACATATATAA AAAATAATTA GTTGAGTAGC 63 OO 

CATGACAGTG TATAAGTTGG ATTTTTAATA ATTTTAAAAT ATTTTTAACG TGTAAGGCAG 63 60 

TTTCAAAATT ATTTGAACAT TGAGTACAAT CAACATAGGC AATATCTAAA TTTTTATAAT 6420 

CATCAATAAC CTTTGAATCT CTAGATACAA TTATCAAAAT AGGGAATAGA GACA 6474 
(2) INFORMATION FOR SEQ ID NO: 156: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4792 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 

TATTTAACGA TTTTTTTCAT GTCATTTCCT CCAAAATAGA ATACCTTATA ATCTTAACAG 60 

AAAAAGAGCA TTTACGCCAT TATATGATAT CTATCTCTGT GATAAGTTTT TTTTATGGGT 120 

AATTTAAAAG ACCAAACGCA AGATGGCAAT CAAGACCACT CCAAAGAGAA CTGTTCCGAC 180 

TAGATTGCGG TAGCGAAAGG CTACCCAAGC TGTTGGAAAG ACGGCTAAGA AGTCCAGTCA 240 

TTTGATTTGA GGAAGACTGC CAACCTTACC TGTCACTACG CTTGAAAGAA TCAGGGCT^ 300 
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GATAATGGAA ACAGGCAAAA ACTTCAAAAA ACGCTCAACA ATCGCAGGCA GGCCCTTATA 3 60 

CTTGACCAAG ATGAAGGGAA TCATACGGGG AATCCAAGTC ACCAAGCCAG AGAAAATAAC 420 

TGCTAATAAA AGATACTTAC TGACCATCTA AAACCACCCC CATGCTACAA CCAAGTAGCG 4 80 

TCGCAAACAG AACAGCTAGT GACTGAGACA TCACTGTCAA GAGCAAAAAG AAGGACACCG 54 0 

CAACAACTGC TAGGATAATG AGCAGATTGC GGACAGGAAT CCGTCTTTGC ATAATCTGAA 600 

ATTGCGAAGC AAAATACCAA TAAACATCCC AACCAGGGCA AAATCCAAGC CAAAGATTTC 660 

TGGATTTGGT AGCAGGCCAC CCAGAGCCGT TCCGACTACT GTCCCCACAA ACCAAGCCAG 720 

ATAGCTGTTA AGATTGTTTC CGTGCATCCA CATAGGATTT ACCTTGTCTG TATGGGCCAA 780 

TTCACCCATC AAAACGCCAT AGGTCTCATC TGTCAAGATA CTAGACATAC CGATATTGTA 840 

CCAAAGACTG GTATGACGGA AATAAGTCGA TGCGTGTAAA CTCAACAAAA AGAGACGCAA 900 

GTTGATTAGA AAAACCGTCA TAGCAATAGC TGCCACAGGA GCTTGAACCA CAATCAGTGC 960 

CAACATGGCA AACTGGGCAC TCCCAGCATA AACAAAGAGA CTCATCAAGC CCATCTC/^C 1020 

AGGTGTCACA TAGGGCGCAC CGATAATTCC ACAGGCCAGG CCGATACTGA CATAGCCAAG 1080 

AGCCGTTGGC ATGGCTGCCT GCGCCCCCTC CTAAAATCCT TTTTCTTTCA TCTTTCTCCT 1140 

CATATTGTCT TAATAATACT CAATGAAAAT CAAAGAGCAA ACTAGGAAAC TAGCCGCAGG 1200 

TTGCTCAAAA CACTGTTTTG AGGTTGCAGA TAGAACTGAT GAAGTCAGCT CAAAACACTG 1260 

TTTTGAGCTT GTGGATAGAA CTGACGAAGT (JAGCTCAAAA CACCGTTTTG AGGTTGTGGA 1320 

TAGAACTGAC GAAGTCAGTA ACCATACCTA CGGCAAAGTG AAGCTGACGT GGTTTGAAGA 1380 

GAGTTTCGAA GAGTACAAGT AGGCTGAAAA GAATCCAACC ACAGCATGGA CTATTATATA 1440 

GCAGATTGAA ATAAGATGAG AACAAATCGA TTGGGAAAGT AAAATTAATT TCTATAAATG 1500 

TTTTAGCAAT TGTTTCGTAC TATTTTAGAT TCAGTCTATT ATAACACATT CAGAAAAGAG 1550 

AAAAAAGTCT GTTGATTTTG ACCATCATAA AAAGACTGGC AATCCAGTCT CAAACATATA 1620 

TTATAGAAAT TCTCCACTAA ATACTTTCAC GAATATTCAG AAGCATAACA AAGGCAACTA 1680 

GAAGAAATAG CAATAAAACA AAGCTAACTG CCAGAGTTCC AAAGCTAGTA GCAATGGTTA 1740 

CCAAAGCTAT TGTAAATAAG CTAGGTAAAA CAACCGTAAT GGCACCGATA GAGGATTGAA 1800 

CTGCTCCCAT TGACTCCTCA GGTATTTGTT TAAAAACGAG TTCTTGCAAT CTAGGAGAGA 1860 

GAACACCTGC GAAAAAGGCA TCCAAGGTAC TAAAGATGAG AATCCAGTCA AAACGAACTG 1920 

TGGCAAATCC TACTAGAAGA AGCAACTGGA TGACAAGTGA GGCATAGAGA GCTGTTTTTA 1980 

TGGAAATGGT ATGTTGCAGA TAGCCACTTA CAAGGCTTCC GACAATCAGG GCTGATAATT 2040 

CTAGTGTGGC TAACAAGGCA AGAGATTGAC CAGTTTGTAA ATTCAAAAAG GGCTGGTTCC 2100 
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TTAAAAATAG AGTGGAAATA GGAACCGTAA CATTTATCAC TGCTTGACTA GTAGAGATAA 2160 

TAAACAAAAC CAAGAGCACC TTATTCATAT TCCATATCAA TTTCGATGAT TGGAGCAAAT 2220 

GCTGGCAAAA GGATTTTACA GAGAGTCCTT CTTGATAGCT AATCGTTTTT TCTACTTTCA 2280 

AGAGGTCAGT TTTTATGAAG AGGATACCTA AAAATGCGAT TAAAAAGGTA AGAGCGTTCA 2340 

GTAAGGAAAT AAACTGGATG GATAGAATGC CTAGTAAGAC TCCTCCTAGG ATATTACTGA 2400 

TTGTTTTCAC TAAACTAACA GTTGACTGTT TAAAGCCAAT AGCTTCTGCC AGATGGTCTT 2460 

GCCCAATAAT TCTAATGAAA ATCGGAGTGA GCATGGCGCC TGAAAAATAA CTCAATGTGT 2520 

CAGACAAGAG GTTAATCAGA CAAATAAATG CTACTAGCAA CAAGGAGAAA GACTGCCCTG 2580 

AAAGTGATAA AGACACTATA GAGTAAAGCA AAAATTTTGC AAAACTAATG ACTGTGTATT 2 640 

TCAAGACACG ATGATGTTGA AAATCCGCCA AAACTCCCAG AAAGATTTGT AGAACTTGGG 2700 

GCAGGGTTTC TGAAATCGTG ATGAGTAAAA TCGCCAAAGG GGCAAAAGAT GCATCTGCCA 2760 

CATAATTCAG GAAGGCCAGA TAAAAAATCG TATCCCCAAG CGTTGAAATC CACTGGTTGA 282 0 

TAGTTAATTG CCTAAAATCT CTATTTTGAA GAAATACTTT CATCACAACT CCTTCTTAAG 2880 

TTCAAATGGG AATCTTTCCC CAAGGATAGA CCGCGATACT ACTAACAACC AAAATTACAG 2940 

TAACATCAAA AGCTGACCAA TGCCATTGTA GACTATATGC AGTCCAATAG GCCAATAAAT 3000 

TGACTTTGTC ATTCTAAATA AGACTGCAAA TATAAGACCT CCACCCATAT AGAAGACAAA 3060 

GTCTGTCAAG ACCCAACCGT GATTACTAAT GTGCGAGACC CCAAATAAAA CAGCGGAACC 3120 

AAGTACATCT AGCCCCCATT TCTTTCCTTT TTCCAGAGCA GTCATCACTA ATCCACGATA 3180 

AATCATGTCT TCAAAAATGG GACCTGCAAT CACAGGATAA AAAAAATACA TCAAAAATGC 3240 

TGTAGCCCCT GTAAAAGTCG GAGCAGCATG TTGATAAGAA ATTTCATTTC GAGTAGGTGG 3300 

GAAAAGAAAA AAGGTAACGA AATTCCAAAC AACAAAAGCA AGCAGAGCTA GGAAGGAATA 3360 

GAAAAGATAG GATCCTTTAA ACTTTCTACT ATTGATTTTC TGCCATTTCC CCGACCAAAT 3420 

CATAGCAATA AGAGCAAATA AAACCACAAG AAAATTCAAC ATCATATCCG ACAGATAATA 3480 

GGCAAAGTCA GATAGCCCAG TAACAAGGTC GCTGCGTAAA ACTAGAACAC TGAACTTCTG 3540 

GTCAGCAATA ACTAGTAGAA AAACTATAAT AAAGTAGCGG TGTGAGATTA TCTTTTTCAT 3 600 

ATATCACCTT TCTAATATCC AAATACCAAT AAAGTAACAA TGAGTAAGAA ACTATTCCAT 3660 

GAAGCATGCA GAGCTATAGC CCAATAGATG GATCGGGTGT AGCGAAACAT CATACAAAAT 3720 

ATCAAGCCCA TTCCAAAATA CTTTATGAAA TCTGTCGTTA TCCAACCATA CTGCAAAACA 3780 

TGCATAGCGC CAAATATGGC AGCGGAAACA AGAACATCAA GATAGTATCT CTTAACTTTA 3840 
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GATAAACTTG 


TCATCAAAAG 


ACCACGACAA 


ACAACCTCTT 


CTGATACAGG 


TGCGATAATA 


3900 


CTAGTATAAA 


GTATTCGCGT 


AACAAAATAG 


CTAATTCCTG 


TTAAATTGGT 


GGCTACTTCT 


3960 


ACGACTGTAC 


TTCCATTCTG 


GGTACGAGGA 


AAGATATAGG 


TTGTTAGATT 


TGCCCACACG 


4020 


AACAATAAGA 


AAAAAGAAAG 


AAGGAA7VACA 


CCCAGGTAAG 


ACCAACGAAA 


CTGGAAACGA 


4080 


CCACACTCTT 


TCCAATGTTC 


ACTTTTGACA 


AAAGCAATTG 


TAGCTATAGT 


TCCCAGAATA 


4140 


AGTACCAATA 


AAACTTGGAA 


CACATAGTAC 


ATATTATCAG 


ACAAAGCAAC 


CATAAAATCT 


4200 


AAGTCTGATG 


TGACATTAAA 


AATGAGGTAA 


TAAGTCAAAA 


TCAACAAGCC 


AGTTGCTAGG 


4260 


TGAAATTTCA 


CTTCTTTCAT 


TTTCTTCATC 


CTATTATCTC 


CTATAAGAGC 






CGGCGGCCAA 


ACAATCCATC 


TGCTAAATCT 


ATAGTCCAAT 


CAAAAGCTCC 


ACGATTAGGA 


4380 


CTCATCCCTT 


GATTGCCCCA 


ACCAGGGTAA 


ATTCCTGGGA 


CGCCCCAACC 


AGATATACCA 


4440 


CTTCTTCCAC 


CACCTCCCAT 


AGAATTTACG 


AGGTTGCCTC 


CTCTAACATC 


TTGCAACTCA 


4500 


GCTTCTGTCA 


ATTCCATTGT 


TTCTGCAAAT 


TGTAAATTTA 


ACATCTTTTA 


CACTCCTTCA 


4560 


ATTATCTTCA 


TTTGTAAACC 


ACTTCTGCGA 


CCTAGGATTT 


GCTTCAAGTG 


CTTTACAAGT 


4620 


ACAGTATAAC 


ACGAACATTG 


GCTTATTTTA 


GAAAATCGCA 


TATTTGATAT 


TTTTTCTTAT 


4680 


AGAAATTTCA 


GATTTGCGAT 


TTTGGTGAAT 


TTGATTACTT 


CTCTGGTATA 


ATAAAGTTAC 


4740 


TACTAATGAG 


GAGTGGAGAA 


ATATGAAGAA 


ACAAATTTTA 


ACATTATTGA 


AA 


4792 



(2) IwFOkmAtxon for SEQ id NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2156 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 

CCGTTCTCGG CGACGGCCAT CTGATGAAGC TATTTATGAG GGAAACTGGC AAGCTGGAGA 60 

GTCAGAGTAT CTAGTCTTTC ACCGATTGCT GTGGCAGCAG ATGTGCAGGG AAAAGGAGTT 120 

GCTCAAACCT TCTTAGAGGG CTTGATTGAA GGTTTTGATT ATCTTGATTT TCGCTCAGAT 180 

ACGCATGCTG AAAACAAGGT TATGCAACAT ATTTTTGAAA AACTTGGTTT TAAACAAGTC 240 

GGTAAGATGC CAGTAGATGG CGAACGCTTG GCCTATCAAG AATTAAAGAA ATAATGCAAA 300 

AGAAGTATGT AAAAATCCTC TACTCCTCAC CAATTGGTAT TCTATCACTT GTAGCTGATG 360 

ACCATTATTT GTATGGAATT TGGGTTCAGG AGCAGAAGCA TTTTGAGAGG GGACTAGGAG 420 

ATGAAACGAT AGAAGAAGTT GTTAGTCATC CTATTTTAGA CCCAGTTATT GCTTGCTTAG 480 
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ATGATTACTT TAAAGGCAAG CCTCAGGATT TATCCAACTT GCTCTTGGCG CCAATCGGAA 54 0 

CGAATTTTGA AAAGAGAGTT TGGGACTATT TACAGGGCAT TCCTTATGGT CAGACAGTGA 600 

CCTATGGACA AATTGCTCAA GACCTGCAAG TGGCTTCTGC TCAAGCAATT GGTGGAGCAG 660 

TGGGACGCAA TCCTTGGTCT ATCCTAGTAC CTTGTCATCG TGTGTTGGGA GCAGGCAAGC 720 

GTCTGACAGG TTATGCTGCA GGAGTGGAAA AGAAAGCTTG GCTCTTGGAG CATGAAGGAG 780 

TAGATTTTAA AGATAGAAGC AATAGAAGGA GAAGCACATG TTAGAATTTA TCGAATACCC 840 

CAAATGTTCA ACTTGTAAAA AAGCAAAACA AGAATTAAAT CAATTAGGTG TGGACTATAA 900 

AGCCGTCCAT ATCGTGGAAG AAACACCTAG CCAAGAAGTC ATTTTGAATT GGCTAGAAAC 960 

CTCAGGATTT GAATTGAAGC AATTTTTCAA CACCAGTGGT ATCAAATACC GTGAATTAGG 1020 

GCTAAAAGAT AAGGTAGGAA GTTTGTCAAA CCAAGAAGCG GCTGAGTTGC TAGCAAGTGA 1080 

CGGTATGTTG TTAAAACGGC CCATTTTAGT AGAAAATGGA ACTGTTAAGC AAATCGGTTA 1140 

TCGAAAATCT TATGAGGAAC TGGGACTGAA ATAGTTTTTA TCTATCTCTT TGATAGATAA 1200 

AATATATAAC TTCCCTGTTT CAAAGTATGA TAAACTAGTA GGTAGACAAA GTCTGTATCT 1260 

GACCGTAGCA AATAATTTCA TTGACGGCAG AAGCATGGTA GCATGAATCA TTATCAGAAG 1320 

AGGATGTTTT TATGAATGTT ACAACGATTT TAGCATCAGA TTGGTACCAA AACTTGATGC 1380 
AATTGATTCC GGATGGCAAG CTGTTTAGCC TACGTTCGGT CTTTGATGGA ATCCCTAGAA . 144 0 

TTGTCCAACA ACTTCCAACA ACAATTATGT TGACAATTGG TGGTGCCCTT TTTGGCTTGG 1500 

TTTTGGCGCT TCTTTTTGCC ATTGTGAAGA TCAATCGTGT CAAGATTTTA TATCCCTTGC 1560 

AGGCCTTCTT TGTTAGTTTC TTAAAAGGGA CACcGATTTT GGTGCAACTC ATGTTGACCT 1620 

ACTACGGAAT CCCTTTGGCT TTGAAAGCCC TCAATCAGCA ATGGGGAACT GGTCTCAATA 1680 

TCAATGCGAT TCCAGCTGCA GCTTTTGCGA TTGTCGCCTT TGCCTTTAAT GAGGCAGCTT 1740 

ATGCTAGTGA AACCATTCGT GCAGCCATTC TCTCAGTTAA TCCTGGTGAG ATTGAGGCGG 1800 

CACGCAGTCT GGGTATGACC CGAGCGCAAG TTTATCGACG AGTGATTATT CCTAATGCAG 1860 

CGGTGGTAGC TACTCCAACC TTGATTAATT CCCTCATCGG TTTGACCAAG GGAACATCTC 1920 

TAGCTTTTAG TGCGGGTGTT GTGGAAGTCT TTGCCCAAGC TCAGATTCTA GGTGGAGCTG 1980 

ATTATCGCTA TTTTGAACGC TTCATCTCCG TTGCCCTTGT TTATTGGGTA GTCAATATCG 2040 

GAATTGAAAG CCTCGGTCGT TTCATCGAGA GAAAAATGGC TATTTCTGCA CCTGATACAG 2100 

TGCAACAGAT GTGAAAGGAG ACCTTCGTTA ATGATTAAGA TTTCGAATTT AAGCAA 2156 
(2) INFORMATION FOR SEQ ID NO: 158: 



I 

r 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3140 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 

GTATCTCTAC ACATGTCTTC AATCGATTTT GTTGTCCTCC AATTTAATTC CTTATATGCT 60 

TTGTCTGCAT TTGCATAACA AGTTGCAACG TCTCCTGAAC GTCTTGGAAC TATTTTATAA 120 

GGAATAGGGA TCTTATTAAC ACTTTCAAAT GTATTTACAA GTTGTAATAC ACTAGTGCCT 180 

TCTCCCGAGC CTAGGTTATA GATATAAACA TCTGTTTTTT CAGATACTTT TTCTAAAGCT 240 

TTTATATGTC CTATTGCTAA ATCTACTACA TGGATATAAT CACGCACACC AGTACCATCA 300 

AGCGTATCAT AATCATTTCC GAACACACTT AGCTCTGATA GCTTACCTAC CGCTACTTGT 360 

GCAATATAAG GCATCAAGTT GTTAGGAATT CCTGAGGGAT CTTCCCCAAT CAAACCAGAC 420 

TCATGAGCAC CAATTGGATT GAAATAACGA AGCAACGCAA TACTCCATTC TGAATCTGCC 480 

ACATGAACAT CTTTTAAAAT TTGCTCAAGC ATCACTTTCG TATACCCATA AGGATTTGTC 540 

GCACTTGTTT GCATCGTCTC AATTAGAGGT GACTGATTGT TAATTCCATA TACAGTCGCA 600 

CTTGAAGAAA AGACAATCTT TTTAACATTA AATTCTGACA TCACTTCAAC AAGTGCCAAT 660 

CTACTCATAA TATTATTT-TT GtAGtaCATC ACAGGCTTTT GCACGGATTC TCCGACAGCT 720 

TTATAACCTG CAAAATGAAT TGCAGCATCA ATCGATTCTT GTTCAAATAC CTTTCTCAAT 780 

GCTTGTTTAT CACAAACATC TAATTCGTTVA AACACGGGAC GTATTCCTGT AATTGCTTCA 840 

ATACGGTCTA GCACCAAGAT GCTAGAGTTC GAAAGGTTGT CGACAATGAT AACTTCCTTT 900 

CCTAAATTTA GTAATTCTAC TACGGTATGG CTACCAATAT AACCAGCTCC GCCTGTTACC 960 

AATATTGCCA TCTGGGTTTC CTCCTAATTA ATTCCAACCG ACTTAACAAA TCTCATAAAC 1020 

GCTTCATGCC CAGACGGTGT ATTCTTATAA ACTCCTGCAT CTTCCAGAAC TCTCGCAAi'.C 1080 

ACTTGTCCTG CTTCGTGTTG AACTACGCTA TTAACCTCTT CTTTATTAAT GCGAGGATAT 1140 

TTTTCTTTCA ATTGGTCGGC CCATTCTAAA TGATAATCCG CAATTGCATT ATCCTCTCCT 1200 

AAAAGATATT TTCCAACTTC TTCTAACTCT GGTTTCAAAC GAGGTGGTAA TATCGCAAGT 12 60 

CCCATCACTT CGATTAACCC GATATTTTCC TTTTTAATAT GTTGTACATC TTGATGAGGA 1320 

TGGAAAACAC CATCTGGGTA TTGTTCAGTA GTATGATTAT CTCTTAGAAC AATATCTAAT 1380 

TCGTATCTCC CGTCCACTTT ACGAGCAATA GGAGTCACCG TATGGTGTGG GACATCTTCA 1440 

GTCATAGCAA TGATGTCTAC TTCTAAATCT GAATATTCTC TCCACTTATT TAGAATTTTA 1500 
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GTAGCTAAAT CTAACAAGCG ATTTTTATTT TCACTTTGTA ACCTAATTAC TGACATTGGC 1560 

CATTTTACAA TACCAGCATT AACATCCTCA AAGTCTTTAA AACAAAATTC ACTCTCAAAT 1620 

TTTGCTTTTT CCATTGGGAA AATATGTTTC CCTCCCTGGT AGTGGTTATG ACTAAGAATG 1680 

GAGCCTCCTG AGATAGGAAG ATCAGAATTT GAACCAGCAA AATATCCTGG CAAAATATCA 1740 

ACAATCTCCA ATAATTGTTC AAATGTTTTA GAGGTAATAG CCATTGGTAC ATGTTGACTA 1800 

TTCAAAAATA TCGCATGCTC ATTAAAGTAT GAGTAGGGAG AATACTGGAA TCCCCATACT 1860 

TCGTCACCAA GTTTCAACCG AATAATTCTA TGATTCGAAC GTGCTGGATA ATTTATTCGC 1920 

CCCTGATATC CTTCATTTTC CATACATAGT AAACATTTGG GATAATTAGT TGCTTTTACT 1980 

AATTTTTCAG CAGCAATTGT TTTTGGATCT TTTTCGGGTT TTGACAAATT TATCGTAATC 2040 

TCTAGCTCTC CGTATTTAGT TGATGCTCGA AACTCAATAT TCTTAGCAAT AGCAGAAGTT 2100 

TTAATATAAT CACTATCTTT ACTTAACTTA TAAAACTCTT CAACTGCTTC TTGAGGTGAT 2160 

ATATCATATG AACTCCAAAA AATATCATTT AATCGACTAG GTAAAGGAAC TATGAAATTC 2220 

ATTAACTCTG CTCCTAAACA TTCCTTTTCC TCGATTAAAT CTTTAATTTT ACCGTTTTTT 2280 

AAGGCGATTT CCACTAAGTA ATCTTTTATT TGTTTCAGGT CATTTTCATC GGAAATGCGA 234 0 

TCAATTCCCT CCTCACCTAT TAACGCTAGT ACTCTATTTT TCACATATAT TTTGTCAATT 2400 

TCATTATACA TTCCGTATTC 7ATTACTCTA TCAACAAAAT TATCAATAAT TCTTTTCATA 2460 

TATTTTTCTT TCTAATTTAT GTTCCCATAT TTTCTATACA TTATCCATTT ATAAATTGCT 2520 

TGCGTAGTAT GAGCAATTTT ATCAAGGTGA TGAATAATAT CTAAAGCACT AATTACTTCA 2 580 

GAAACGTTCC CATCATCTTC AAATATGTAA TTCATTATTT TCTTTTCCAT ATTTATACTA 2 640 

AGCTCTTCTA TCTCATTCTG TTTTTGTATA ACAACCATAT CTAAACATCC AGATTGTTCC 2700 

TCTCTATAAC AAGATATAGC CCTATTCATA TGCAGTCCGA TAACTTCATG AAGTATTTTT 2750 

ATTTTTGAAA TAATTTTCTT CAAAATTTCA TTATTTTGAA GAATCTGTAG ATTTTTTAAA 2820 

ATTTCAACAA TTCTATCCCC AATACGTTCA ATGTCAGTTG ATATTTTTAT TACACTAATA 2880 

ATTCTTCTTA AGTCATATGA AACAGGATGT TGTAAACAAA TTAACTCATA TCCTTTTTTA 2940 

TCAATATTTA GAACTGACTC ATTTATGATT AAATCTTCTT TAATCAATTC TACTCGTTCT 3000 

TCATTTGATA AATATTCAAA TAACTTCTCA TATTTATCAA GCACAGATAC CCAAATGGTC 3050 

TCTAAATTAT TTGATAATTC TATAATTTCA TTTTCTAAAT ATAACCTTAA CATTTAGGTA 3120 

CCTCTTCTTA ACAAAGTTCG 3140 
(2) INFORMATION FOR SEQ ID NO: 159: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9048 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 

CCGGATGATT TCCTGGTCAG ATAGGGGGAA AGTGACTTCC TCAGCAATCG CGCGTAGAGT 60 

AGGATTCCCT TCACGGATAA TATCGTTCAT ATCAATTAAG TGAGCAGCTT TTGTAATACG 120 

TTCTATTGCA GACATTTTCT CTCCTTATAT TATGTTTAGT GCAGTTAGCT ACTGCCAAAG 180 

CCCAAGTGGT ATACTTGGAA TAAGCCACTG TGGATTAGTT CATTTTCTTT CATTACCTCT 240 

ACATGATATC ACAAAATGAC AAGAATTGAA AGCATTATGG CATTTAGGAT TTATAGAAAA 300 

TAGATAGGAA GTTCAATTCA ATTGTGAAAG AAATACTTAT CTGTGATATA ATAAAAAGAA 360 

AAGGCTTGCA TAAGAAAGTA GGGAGAACGA AGATACAAAG AAGACAAAAT CGAAATCAGG 420 

GTGGTTTAGC TTTTCGTTTT ATG7AGGGCT TGGTAAACTT TTTAGGAGTT ATCGCAAGTG 480 

GAGCAATAAG GGATTTGTGG CGATACTCTT GCTAGCAGTT GGTTTATCAA TGGGCTTGGT 540 

CTTGTTGTTT GAAAGCTTCC AAGGAATCCC TTGACTAGTC AAAAACGAGA TACTATTTCT 600 

CAAGAGGGGA CTAAGCAAAA GTCTCAGGAG TAGGAAGAGG AAAAAACTGC CAGAATTATG 660 

UCCCACGGGG ATTTGCTCTA CCACGATGGA CTTTTCTTTT CAGCTAAAAA AGAAGACGGT 720 

ACCTATGACT TTCATGAAAA TTTTGAGTAT GTGACTCCTT GGCTCAAGCA AGGGGACTAA 780 

GCAGCAGATT TAGCTATTGG TGATTTTGAA GGAACCATTA ATAAGGATCA TTATTTAGCG 840 

GGTTATCTTC TCTTTAATGC TCCTGTTGAA GTTATGGATG CTATTAAGGA GGCAGGTTAT 900 

CATGTGCTGG ATTTAGCTCA TAATCATATT TTGGATTCGC AAATTGAGGG AGTTATTTCA 960 
ACGGCCGATA TTATTGAGAA AGCTGGAATC ACTCCAATCG GAGTTTATAC GCACGAACCA . 1020 

CGTGATCAGG CTCCGCTGGT CATTAAGGAA GTGAATGGTA TCAAGGTTGC ATTGTTAGCC 1080 

TATTCCTATG GTTTCAATGG AATTGAGCAG TATATTTCTC AGGAAGACTA TAATCGTTAT 114 0 

CTTTCAGATT TAAACGAAGA TAAGATGAAG GTTGAAATTG AACGGGCAGA GAAGGAAGCA 1200 

GATATCACCA TTATCATGCT TCAGATGGGT GTTGAGTATC GATTGGAACC AACTGAAGAA 12 60 

CAAAAAGCTC TTTATCACAA GATGATCGAT TTGGGAGCGG ATATTATCTT TGGAGGGCAT 1320 

CCTCACGTTG TTGAACCATC TGAAACGGTT GAAAAAGATG GAGATAAGAA ACTCATTATC 1380 

TATTAAATGG GGAACTTCAT TTCCAATCAA CGAATTGAAT CTATGGGAGA TGAAGAGAAT 1440 

GCTAAGTGGA CTGAACGTGG TGTTCTCATG GATGTCACCA TCAAGAAGAA GGATGGAAAA 1500 
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ACAACTATCG GAACAGCTAA AGCTCATCCT ACTTGGGTCA ATCGAACACC AAAGGGAACC 1560 

TTTTCACCAG AAGGATATCC CTTGTATCAT TACCAAACTT ATATTTTGGA AGATTTTATA 1620 

GAGGATGGCA GTCATCGTGA CCAGTTAGAT GAAGCGACTA AGGAACGAAT TGATACAGCC 1680 

TATAAAGAAA TGAATGAACA TGTGGGATTG AAGTGGTATT AGCTTGAATC CAGAGGAAAG 1740 

TAAATGATGA TTAAGGTAAT TGCGACAGAT ATGGATGGGA CCTTGCTGGA TGCTAGAGGT 1800 

CAGCTTGATC TCCCACGATT GGAAAAGATT TTAGATCAGT TGGATCAAAG GGGCATTCGT 1860 

TTTGTCATTG CGACGGGCAA TGAAATTCAC CGCATGAGAC AACTACTGAG TCCCTTGGTG 1920 

GATCGAGTGG TTCTGGTTGT TGCTAATGGC GCTCGTATTT TTGAAAACAA TGAATTGAIT 1980 

CAGGCTCAGA CATGGGATGA CGCCATTGTC AACAAGGCTT TGACTCATTT CAAGGGTCGA 2040 

GCGTGTCAGG ACCAGTTTGT TGTAACGGGG ATGAAGGGTG ATTTTGTCAA GGAAGGTACG 2100 

ATTTTTACAG ATCTTGAAAG TTTTATGACT CCAGAAATGA TTGAAAAATT CTACCAACGG 2160 

ATGCAATTTG TGGATGAATT AACATCTGAC CTCTTTGGTG GTGTGCTCAA GATGAGCATG 2220 

GTTGTTGGTG AGGAACGTTT GAGTTCGGTT TTGGAAGAAA TCAATGCTCT CTTTGATGGC 2280 

CGTGTCCGAG CTGTATCCAG TGGCTATGGT TGCATTGATA TCCTCCAAGC TGGGATTCAT 2340 

AAAGCATGGG GCTTGGAGGA ATTACTCAAG CGCTGGGACT TGAAATCCCA AGAAATCATG 2400 

GCTTTTGGTG ATAGTGAAAA TGATGTTGAA ATGCTTGAAA TGGCTGGAAT TGCCTATGCG 24 60 

ATGGAAAATG CTGATGAGAA AGCCAAAGCT GTGGCGACTG CTCTAGCACC AGCCAACAGC 2520 

CAAGGAGGAG TTTATCAAGT CTTGGAAAAC TGGTTAGAAA AAGGAGAATG AAGTGGCAGT 2580 

ACAGTTATTA GAAAATTGGC TCCTAAAGGA ACAAGAAAAA ATTCAAACTA AGTATCGTCA 2 640 

CCTAAATCAC ATTTCTGTTG TAGAACCAAA CATTCTTTTT ATTGGGGATT CCATTGTCGA 2700 

GTATTATCCT CTACAGGAGC TATTTGGGAC TTCAAAGACG ATTGTCAATC GAGGAATTCG 2760 

TGGCTATCAG ACAGGACTGT TACTAGAGAA CCTTGATGCT CATCTATATG GTGGAGCAGT 2820 

AGATAAAATT TTTCTTCTGA TTGGGACAAA TGATATCGGA AAGGATGTTC CTGTGAATGA 2880 

GGCTCTCAAT AATCTCGAAG CTATCATTCA ATCCGTTGCT CGCGATTATC CATTGACAGA 2940 

GATTAAATTG CTTTCCATTT TGCCTGTCAA TGAGAGAGAG GAGTACCAGC AGGCAGTCTA 3000 

TATCCGCTCG AATGAAAAAA TTCAGAACTG GAATCAAGCC TATCAAGAGC TTGCATCTGC 3060 

CTATATGCAG GTGGAATTTG TGCCAGTATT TGATTGTTTG ACAGACCAAG CAGGCCAACT 3120 

CAAAAAAGAA TATACAACTG ATGGACTGCA CCTCAGTATT GCTGGTTATC AGGCTTTGTC 3180 

AAAATCCTTG AAAGACTATC TTTACTAAAT AGCTAAATAA TGTTAAATTT GAGCATAATA 3240 
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TCTTGTAAAA AATTCTAAAA TCCTTTAAAA TAAAAAGTGA CGGAGGAATT TATGAATGTA 3300 

AATCAGATTG TACGGATTAT TCCTACTTTA AAAGCTAATA ATAGAAAATT AAATGAAACA 3360 

TTTTATATTG AAACCCTTGG AATGAAGGCC TTGTTAGAAG AATCGGCCTT TCTGTCACTA 34 20 

GGTGACCAAA CGGGTCTTGA AAAGCTGGTT TTAGAAGAAG CTCCCAGTAT GCGTACTCGT 3480 

AAGGTAGAGG GAAGAAAAAA ACTAGCTAGA TTGATTGTCA , AGGTGGAAAA TCCCTTAGAA 3540 

ATTGAAGGAA TCTTATCTAA AACAGATTCG ATTCATCGAT TATATAAAGG TCAAAATGGC 3 600 

TACGCTTTTG AAATTTTCTC ACCAGAAGAT GATTTGATTT TGATTCATGC GGAAGATGAC 3 660 

ATAGCAAGTC TAGTAGAAGT AGGAGAAAAG CCTGAATTTC AAACAGATTT GGCATCAATT 3720 

TCTTTAAGTA AATTTGAGAT TTCTATGGAA TTACATCTCC C7UVCTGATAT CGAAAGTTTC 3780 

TTGGAATCAT CTGAAATTGG GGCATCCCTT GATTTTATTC CAGCTCAGGG GCAGGATTTG 3 840 

ACTGTGGACA ATACGGTTAC CTGGGACTTA TCTATGCTCA AGTTCTTGGT CAATGAATTA 3 900 

GACATAGCAA GTCTTCGCCA GAAGTTTGAG TCTACTGAAT ATTTTATTCC TAAGTCTGAA 3960 

AAATTCTTCC TTGGTAAAGA TAGAAATAAT GTTGAATTGT GGTTTGAAGA AGTATGAAGT 4020 

GGACCAAGAT TATTAAAAAA ATAGAAGAAC AAATCGAGGC AGGGATTTAT CCCGGAGCCT 4080 

CTTTTGCGTA TTTTAAGGAC AATCAATGGA CAGAGTTCTA TTTAGGCCAG AGTGACCCAG 4140 

AGCATGGCTT GCAGACTGAG GCAGGACTAG TTTATGACCT AGCTAGTGTC AGCAAGGTTG 4200 

TTGGGGTTGG (T^.CACTTTGT ACCTTCTTGT GGGAAATAOu TCAATTAGAT ATTGATAGAC 42 60 

TGGTAATAGA TTTTTTACCT GAGAGTGATT ATCCAGACAT CACTATTCGC CAGCTCTTGA 432 0 

CTCATGCAAC AGACCTTGAT CCTTTTATTC CTAATCGTGA TCTTTTAACA GCCCCTGAAT 43 80 

TAAAGGAAGC GATGTTTCAT CTCAACAGAC GAAGTCAGCC AGCCTTTCTT TATTCGGATG 4440 

TCCATTTTTT GCTGTTGGGC TTTATTTTGG AAAGAATTTT TAATCAAGAT TTGGATGTGA 4 500 

TTTTAAAGGA TCAAGTCTGG AAACCTTGGG GAATGACGGA AACTAAGTTT GGGCCAGTTG 4 560 

AGCTTGCTGT TCCAACAGTT AGAGGTGTAG AGGCAGGCAT AGTGCATGAT CCCAAGGCTC 4 620 

GTCTCCTGGG TAGACATGCT GGGAGTGCTG GTTTATTTTC GACTATAAAG GATTTACAAA 4680 

TCTTTTTAGA ACACTATTTA GCAGATGATT TTGCAAGAGA CTTAAATCAA AATTTTTCTC 4740 

CTTTGGATGA CAAGGAACGT TCTTTAGCAT GGAATTTGGA AGGAGATTGG CTAGACCATA 4 800 

CGGGCTATAC AGGTACCTTT ATCATGTGGA ATCGTCAGAA GCAAGAAGCC ACTATTTTCC 4860 

TATCGAATCG TACCTATGAA AAGGACGAGA GAGCTCAATG GATATTAGAC CGCAATCAAG 4920 

TGATGAACTT GATTCGCAAA GAAGAGTAAG GAGAGACATG TCAAATAGTT TAAAAGGGAC 4980 

TTTACTAACA GTTGTGGCTG GTATTGCTTG GGGGTTGTCA GGAACGAGTG GCCAATACCT 5040 
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AATGGCACAC GGAATTTCGG CTCTGGTCTT GACTAACTTG CGTCTTTTAA TCGCTGGTGG 5100 

AATTCTCATG CTCTTGGCTT ATGCTACTGC AAAGGATAAA ATACTGGTCT TTTTAAAGGA 5160 

TAGAAAGAGT TTGCTGTCTC TTCTTATTTT TGCTCTGATT GGTCTTTTTC TCAACCAATT 522 0 

CGCCTATCTG TCTGCTATTC AGGAGACCAA TGCGGGAACA GCGACGGTGC TTCAGTATGT 52 80 

TTGTCCTGTC GGAATTTTAA TTTATAGCTG TATCAAGGAT AGGGTGGCAC CGACACTGGG 5340 

AGAGATAGTT TCCATCATAT TCGCCATCGG AGGAACCTTC CTGATCGCAA CACATGGGCA 5400 

GTTGGACCAG TTATCCATGA CACCTGCTGG TCTGTTCTGG GGTCTCTTTT CTGCCTTGAC 5460 

TTATGCTCTG TATATCATTT TACCCATAGC CTTGATTAAA AAGTGGGGGA GCAGCTTGGT 5520 

CATTGGTGTG GGAATGGTCA TAGCAGGTTT GGTCGCCCTT CCTTTTACAG GGGTTCTACA 5580 

GGCCGATATC CCGACTAGTC TTGATTTTCT CCTTGCGTTT GCAGGCATTA TCCTTATCGG 5640 

GACTGTCTTT GCCTATACAG CTTTCCTTAA AGGAGCCAGT CTGATAGGAC CGGTCAAGTC 5700 

AAGCTTGTTG GCTTCAATTG AGCCAATATC GGCGATTTTC TTTGCCTTCT TAATAATGAA 5760 

TGAACAATTT TATCCCATTG ATTTTCTTGG TATGGCAATG ATATTGTTTG CTGTAACTTT 5820 

GATTTCTTTG AAAGATTTAT TCTTAGAAAA ATAAAAAAGA CTCTTTGTCC GTGACAGAGA 5880 

GTTTTTGCGT GGTAATCTAA TTATTTTCAA GATAAAATTC AAAGCGTTCG CCTACATATT 5940 

GACTTTTTAC GTATTCAAAA GCAGTACCAT CTTCTAGGTA GGAAACCTGG GTCAATCCAA 6000 

GAATAGCATG TCCTTTTTCA ACTTCCAAAT AGTGGGCAAT CTTTTCTTTA GCAAGGCGAG 6060 

CATAGATGGT CTGTTGAGAT TTGCCGATAC GATAGCCATG TTTTTGCAAG GTTTGGAAGA 612 0 

AATGACTGGT GATTTCTTCT TTTTTAAAGT CCTTAATGAA TTTTTCAGGA ATAGAAGCAA 6180 

CTTCATAAAC TAGGGGAACT TGGTCGGCAT AGCGGACCCG CTCCATTCGG ATAATATTGT 6240 

CCGTTGGAAA AATTCCTAGC TTGGCAACTT CTTGCTCATT GGGAATGGTT TTTTTGTAGG 6300 

AAATGAGGTG GCTAGAGGGA ACTTTACCTT GGGATTTGAC AATTTCAGTA AAACTGGTTG 63 60 

TCCCTCGCAT CTTTTCTTGT ACTCGAGTAC TGGAAACAAA GGTGCCGCTT CCTACACGGC 6420 

GCTCTAAGAC GCCTTCTTCG ACTAATAGAG ATACGGCTTG GCGGAGGGTC ATGCGACTGA 6480 

CCGCAAACTG CTCAGCTAAA TCTCTTTCAC TGGGAAGCCT CTCACCAATA GCCCAACGGT 6540 

ACTCGTCAAT ATCCTTTTTT ATCTGATCAT GGATTTTTAT ATAAGCAGGT AGCATATTTT 6600 

TCACTTCATT TCTATCTTTT CTCTATTGTA CCCCAATAAA CTAGAAAAAG TCAAACTTCG 6660 

CCTTGTTTAG TTGGTAATTC GCCCTTATTT GTGATAGAAT ATTGAGAAAA GATATTTCTT 6720 

TTGAGAAAGG AAAAAGATGA GCAACATTTC AACTGATTTG CAAGATGTAG AAAAAATCAT 6780 
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CGTATTGGAC 


TATGGTAGCC 


AGTACAACCA 


GCTGATTTCA 


CGCCGTATCC 


GTGAGATTGG 


6840 


TGTTTTTTCA 


GAACTAAAAA 


GCCATAAAAT 


TTCAGCTGCT 


GAAGTTCGTG 


AAGTCAATCC 


6900 


TGTAGGAATT 


ATTCTATCAG 


GTGGTCCAAA 


TTCTGTATAT 


GAAGATGGTT 


CATTTGATAT 


6960 


TGACCCAGAA 


ATCTTCGAAC 


TCGGAATTCC 


AATTTTGGGA 


ATCTGTTATG 


GTATGCAGTT 


7020 


ATTGACCCAT 


AAACTTGGAG 


GAAAAGTTGT 


TCCTGCAGGT 


GATGCTGGAA 


ATCGTGAATA 


7080 


CGGTCAATCA 


ACCCTAACTC 


ACACACCATC 


AGCGCTTTTT 


GAATCAACAC 


CTGATGAACA 


7140 


GACTGTTTTG 


ATGAGCCATG 


GTGATGCGGT 


TACTGAGATT 


CCTGCTGACT 


TTGTTCGTAC 


7200 


AGGTACATCA 


GCTGACTGCC 


CATACGCAGC 


CATCGAAAAC 


CCAGATAAAC 


ACATTTACGG 


7260 


TATCCAATTC 


CACCCAGAAG 


TTCGTCATTC 


TGTATACGGA 


AATGATATCC 


TTCGTAACTT 


7320 


TGCCCTTAAC 


ATTTGTAAGG 


CTA7VAGGTGA 


CTGGTCAATG 


GATAATTTCA 


TTGACATGCA 


7380 


GATCAAAAAA 


ATTCGTGAAA 


CCGTCGGTGA 


TAAACGTGTC 


CTTCTTGGTC 


TATCAGGTGG 


7440 


TGTTGACTCA 


TCTGTCGTTG 


GGGTTCTTCT 


CCAAAAAGCG 


ATTGGCGATC 


AATTGATCTG 


7500 


TATCTTCGTA 


GACCACGGTC 


TTCTTCGTAA 


AGGCGAAGCT 


GATCAAGTTA 


TGGACATGCT 


7560 


CGGTGGTAAG 


TTTGGTTTGA 


ATATCGTCAA 


AGCAGACGCT 


GCTAAACGTT 


TCCTTGACAA 


7620 


ACTTGCTGGC 


GTTTCTGACC 


CTGAACAAAA 


ACGTAAAATC 


ATCGGTAACG 


AGTTTGTCTA 


7680 


TGTATTCGAT 


GACGAAGCAA 


GCAAGCTCAA 


AGATGTGAAA 


TTCCTTGCTC 


AAGGTACTTT 


7740 


ATATArAGA.T 


^TTATCGAGT 


CTGGTACGGA 


TACAGCTCaa 


AL'TATCAAGT 


CACACCACAA 


7800 


CGTGGtGGTC 


TTCCAGAAGA 


TATGCAGTTT 


GAATTGATTG 


AACCACTCAA 


TACTCTTTAC 


7860 


AAGGATGAAG 


TTCGTGCTCT 


TGGTACAGAG 


CTTGGTATGC 


CAGACCATAT 


CGTATGGCGC 


7920 


CAACCATTCC 


CAGGACCAGG 


ACTTGCTATC 


CGTGTCATGG 


GTGAAATCAC 


TGAAGAGAAA 


7980 


CTTGAAACCG 


TTCGTGAATC 


AGACGCTATT 


CTTCGTGAAG 


AAATCGCTAA 


AGCTGGACTT 


8040 


GACCGCGATA 


TTTGGCAATA 


CTTCACTGTT 


AACACAGGCG 


TTCGTTCAGT 


CGGTGTTATG 


8100 


GGTGACGGTC 


GTACGTATGA 


CTACACGATT 


GCAATCCGTG 


CTATCACTTC 


TATCGATGGT 


8160 


ATGACTGCTG 


ATTTTGCCAA 


AATTCCATGG 


GAAGTACTTC 


AAAAAATCTC 


AGTACGTATC 


8220 


GTAAATGAAG 


TGGATCATGT 


TAACCGTATC 


GTCTACGATA 


TTACAAGTAA 


ACCACCTGCA 


8280 


ACAGTTGAGT 


GGGAATAATC 


GCAAAAAAAT 


TAAAAGCTTT 


GTAAAATCAA 


CGGTTACAGA 


8340 


GGATTAAAAA 


CTGTAACTGG 


GATTAAAACG 


GGAACATTTG 


CTAAAAAGAA 


TAAATTGAAT 


8400 


AATAGTTCCA 


AGTGGTTTAC 


ATTTGGACAA 


AAAATTAGAC 


CGTAGTTTTC 


AAGCTGCGGT 


8460 


CTTTTGATAT 


ATATAATGAG 


AATTAATGGC 


TCTTTGTCAA 


CTGTAGTGGG 


TTGAAGTCAG 


8520 


CTAAGCTCGA 


GAAAGGACAA 


ATTTTGTCCT 


TTCTTTTTTG 


ATATTCAGAG 


CGATAAAAAT 


8580 
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CCGTTTTTTG AAGTTTTCAA AGTTCCGAAA ACCAAAGGCA TTGCGCTTGA TAAGTTTGAT 8640 

GAGATTATTG GTCGCTTCCA ATTTGGCGTT AGAATAGTGT AGTTGAAGGG CGTTGACGAT 8700 

TTTCTCTTTG TCCTTTAGAA AGGTTTTAAA GACAGTCTGA AAAAGAGGAT GAACCTGCTT 87 60 

TAGATTGTCC TCAATGAGTC CGAAAAATTT CTCCGGTTCC TTATTCTGAA AGTGAAACAG 8820 

CAAGAGTTGA TAGAGCTGAT AGTGATGTTT CAAGTCTTGT GAATAGCTCA AAAGCTTGTT 8 880 

TAAAATCTCT TTATTGGTTA AATGCATACG AAAAGTAGGG CGATAAAAAT GTTTATCGCT 8940 

GAGTTTACGA CTATCCTGTT GTATGAGCTT CCAGTAGCGC TTGATAGCCT TGTATTCATG 9000 

AGACTTTCGA TCCAATTGAT TCATGATTTG AACACGCACA CGACTCGG 9048 



(2) INFORMATION FOR SEQ ID NO: 160: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10399 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
{D} TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 

GTACCTTTAT TGATGAATGG ACTGTTTAAA TCAGTAGCAC GCCAACCAGA TATGCTTTCT 60 

GAGTTTCGTA GTTTGATGTT TTTAGGTGTT GCCTTTATTG AAGGAACTTT CTTTGTAACT 120 

CTTGTCTTCT CATTTATTAT CAAATAAATA CATGGAACGA GAAGAAAAGG GAGGATTTTA 180 

GATGGAAGAA AGTATTAATC CAATCATCTC TATTGGTCCT GTTATCTTCA ATCTGACTAT 24 0 

GTTAGCCATG ACTTTGTTGA TTGTGGGAGT TATTTTTGTC TTTATTTATT GGGCAAGCCG 300 

CAATATGACC TTGAAACCCA AAGGAAAGCA AAATGTACTT GAGTATGTCT ATGACTTTGT 3 60 

TATTGGATTT ACAGAACCTA ACATTGGTTC GCGCTACATG AAAGATTACT CACTCTTTTT 420 

CCTTTGTTTA TTCCTTTTCA TGGTGATTGC CAATAACCTT GGCTTAATGA CAAAGCTTCA 480 

AACGATCGAT GGGACTAACT GGTGGAGTTC GCCAACCGCT AATTTACAGT ATGACTTAAC 540 

CTTATCTTTT CTTGTCATTT TGTTGACACA TATAGAAAGC GTTCGTCGTC GTGGATTTAA 600 

AAAAAGTATA AAATCTTTTA TGAGTCCTGT TTTTGTCATA CCGATGAATA TCTTGGAAGA 660 

ATTTACAAAC TTCTTATCTT TGGCTTTGCG GATTTTTGGG AATATCTTTG CAGGAGAGGT 720 

CATGACGAGT TTGTTACTTC TTCTTTCCCA CCAAGCTATT TATTGGTATC CAGTAGCCTT 780 

TGGAGCTAAT TTGGCTTGGA CTGCATTTTC TGTCTTTATT TCCTGCATCC AAGCTTATGT 840 

TTTTACTCTT TTGACATCTG TGTATTTAGG GAATAAGATT AATATTGAAG AGGAATAGAA 900 
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AGGAGTAACT GATGCACGTA ACAGTAGGTG AATTAATTGG TAATTTTATT TTAATCACTG 960 

GCTCTTTTAT TCTTTTGCTA GTCTTGATTA AAAAATTTGC ATGGTCTAAT ATTACAGGCA 1020 

TTTTCGAAGA AAGAGCTGAA AAAATTGCTT CAGATATTGA CAGAGCTGAA GAAGCCCGTC 1080 

AAAAAGCAGA AGTATTGGCT CAAAAACGCG AAGATGAATT GGCTGGTAGC CGTAAAGAAG 1140 

CTAAGACAAT CATTGAAAAT GCAAAGGAAA CAGCTGAGCA AAGTAAGGCT AATATCTTAG 1200 

CAGATGCTAA ACTAGAAGCA GGACACTTAA AAGAAAAAGC CAATCAAGAA ATTGCTCAAA 12 60 

ATAAAGTAGA AGCTTTACAG AGTGTTAAGG GTGAGGTCGC AGATTTGACC ATCAGCTTAG 1320 

CTGGTAAAAT CATCTCACAA AACCTTGACA GTCATGCCCA TAAAGCACTC ATTGATCAGT 1380 

ATATCGATCA GCTAGGAGAA GCTTAATGGA CAAGAAAACA GTAAAGGTAA TTGAAAAATA 1440 

CAGCATGCCT TTTGTCCAAT TGGTACTTGA AAAAGGAGAA GAAGACCGTA TCTTTTCAGA 1500 

CTTGACTCAA ATCAAGCAAG TTGTTGAAAA AACAGGTCTG CCTTCTTTTT TAAAACAAGT 1560 

GGCAGTAGAC GAGTCGGATA AGGAAAAAAC AATTGCTTTT TTCCAAGATT CTGTGTCGCC 162 0 

TTTATTACAA AACTTTATCC AGGTTCTGGC CTACAATCAC AGAGCAAATC TTTTTTATGA 1680 

TGTGCTTGTA GATTGCTTGA ACCGACTTGA AAAAGAAACA AATCGATTTG AAGTGACGAT 1740 

TACGTCTGCT CATCCTCTAA CTGATGAACA GAAGACTCGT TTGCTCCCTT TGATTGAGAA 1800 

AAAAATGTCT CTGAAAGTAA GGAGTGTAAA AGAACAAATC GATGAAAGTC TCATTGGTGG 1860 

TTTTGTCATT tT-TGCCaaTC ACAAGACAAT TGATGTGAGT ATTAAACAAC AACTTAAAGT 192 0 

TGTTAAAGAA AATTTGAAAT AGAAAGTGGT GTTCTTTTGG CAATTAACGC ACAAGAAATC 1980 

AGCGCTTTAA TTAAGCAACA AATTGAAAAT TTCAAACCCA ATTTTGATGT GACTGAAACA 2040 

GGTGTTGTAA CCTATATCGG GGACGGTATC GCGCGTGCTC ACGGCCTTGA AAATGTCATG 2100 

AGTGGAGAGT TGTTGAATTT TGAAAACGGC TCTTATGGTA TGGCTCAAAA CTTGGAGTCA 2160 

ACAGACGTTG GTATTATCAT CCTAGGTGAC TTTACAGATA TCCGTGAAGG CGATACAATC 222 0 

CGCCGTACAG GGAAAATCAT GGAAGTCCCT GTAGGTGAAA GTCTGATTGG TCGTGTTGTG 2280 

GATCCGCTTG GTCGTCCAGT TGACGGTCTT GGAGAAATCC ACACTGATAA AACTCGTCCA 2340 

GTAGAAGCAC CAGCTCCTGG TGTTATGCAA CGTAAGTCTG TTTCAGAACC ATTGCAAACT 2400 

GGTTTGAAAG CTATTGACGC CCTTGTACCG ATTGGTCGTG GTCAACGTGA GTTGATTATC 24 60 

GGTGACCGTC AGACAGGGAA AACAACCATT GCGATTGATA CAATCTTGAA CCAAAAAGAT 2520 

CAAGATATGA TCTGTATCTA CGTCGCGATT GGACAAAAAG AATCAACAGT TCGTACGCAA 2580 

GTAGAAACAC TTCGTCAGTA CGGTGCCTTG GACTACACAA TCGTTGTGAC AGCCTCTGCT 2640 

TCACAACCAT CTCCATTGCT CTTCCTAGCT CCTTATGCTG GGGTTGCTAT GGCGGAAGAA 2700 
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TTTATGTATC AAGGTAAGCA TGTTTTGATT GTATACGATG ATCTATCAAA ACAAGCGGTA 2760 

GCTTATCGTG AACTGTCGCT CTTGCTTCGT CGTCCTCCAG GTCGTGAAGC CTTCCCAGGG 2820 

GATGTTTTCT ATCTCCACAG CCGTTTGCTT GAGCGCTCAG CTAAAGTTTC TGATGAACTT 2880 

GGTGGTGGAT CAATTACAGC CCTACCATTT ATCGAGACAC AAGCAGGAGA TATCTCAGCC 2940 

TATATCGCAA CCAACGTGAT TTCTATCACT GATGGACAAA TCTTCCTTGG CGATGGCCTC 3000 

TTCAATGCAG GTATTCGTCC AGCCATCGAT GCGGGTTCAT CTGTATCTCG TGTAGGTGGT 3060 

TCTGCACAAA TCAAAGCCAT GAAGAAGGTT GCTGGTACAC TTCGTATCGA CCTTGCTTCA 3120 

TACCGTGAGT TGGAAGCCTT TACTAAGTTT GGTTCTGACT TGGACGCAGC AACACAGGCT 3180 

AAGTTGAACC GTGGACGTCG TACCGTTGAG GTCTTGAAAC AACCTGTTCA CAAACCATTA 3240 

CCTGTTGAGA AACAAGTAAC CATTCTTTAT GCTTTGACAC ATGGTTTCTT GGATACTGTT 33 00 

CCAGTAGATG ATATTGTTCG TTTCGAGGAA GAGTTCCATG CCTTCTTTGA TGCTCAACAT 3360 

CCAGAGATTT TGGAAACCAT TCGTGATACA AAAGACTTGC CAGAAGAAGC AGTCTTGGAT 3420 

GCTGCGATTA CAGAGTTTCT CAATCAATCT AGCTTCCAAT AAGAATAGAG GTGTCAGATG 3480 

GCAGTATCTC TAAATGATAT TAAAACAAAA ATCGCCTCAA CAAAAAATAC GAGTCAAATC 3540 

ACTAATGCCA TGCAAATGGT ATCGGCTGCT AAGCTAGGTC GTTCTGAAGA AGCTGCTCGC 3600 

AACTTCCAAG TTTACGCTCA GAAAGTGCGT AAACTTTTGA CAGATATCCT TCATGGTAAT 3660 

GGAGCTGGTG CTTCAACTAA TCCGATGTTG ATTAGCCGTT CTGTGAAGAA GACAGGCTAT 3720 

ATCGTTATCA CTTCAGACCG CGGTTTGGTT GGAGGTTATA ATTCCTCTAT TTTGAAAGCT 3780 

GTTATGGAGT TGAAAGAAGA ATACCACCCA GACGGTAAAG GTTTTGAAAT GATCTGTATC 384 0 

GGTGGGATGG GAGCTGATTT CTTTAAGGCT CGCGGTATTC AACCACTTTA TGAATTACGT 3900 

GGCTTGTCAG ACCAACCTAG CTTTGATCAA GTTCGTAAGA TTATTTCAAA AACTGTTGAA 3950 

ATGTACCAAA ATGAACTCTT TGATGAGCTT TATGTTTGCT ACAACCACCA TGTCAATACG 4020 

CTAACCAGTC AAATGCGTGT GGAACAAATG CTTCCGATTG TTGACTTGGA TCCAAATGAA 4080 

GCGGATGAAG AGTACAGCTT GACTTTTGAA TTGGAAACCA GCCGAGAAGA AATTCTGGAG 4140 

CAGTTGTTGC CTCAGTTTGC AGAAAGTATG ATTTACGGTG CCATTATCGA TGCCAAGACA 4200 

GCTGAGAATG CTGCGGGCAT GACAGCCATG CAAACAGCGA CAGATAATGC TAAGAAAGTC 4260 

ATCAATGATT TGACAATTCA GTATAACCGT GCCAGACAGG CGGCGATTAC ACAAGAAATT 4320 

ACAGAAATCG TAGCAGGTGC TAGTGCCTTA GAATAGGCTC TAGTCCAGCT CGTATGAAAA 4380 

TGAACTTAGG ACCTAGTTGA GCTAGGAACC GACAGTATCT TATATAGAAT AGGAGAAGGA 444 0 
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ao40 



GATGAGTTCA 


GGTAAAATTG 


CTCAGGTTAT 


CGGTCCCGTT 


GTAGACGTTT 


TGTTTGCAGC ' 


4500 


AGGGGAAAAA 


CTTCCTGAGA 


TTAACAATGC 


ACTTGTCGTC 


TACAAAAATG 


ACGAAAGAAA 


4560 


AACAAAAATC 


GTCCTTGAAG 


TAGCCTTGGA 


GTTAGGAGAT 


GGTATGGTTC 


GTACTATCGC 


4620 


CATGGAATCA 


ACAGATGGGT 


TGACTCGTGG 


AATGGAAGTA 


TTGGACACAG 


GTCGTCCAAT 


4680 


CTCTGTACCA 


GTAGGTAAAG 


AAACTTTGGG 


ACGTGTCTTC 


AACGTTTTGG 


GAGATACCAT 


4740 


TGACTTGGAA 


GCTCCTTTTA 


CAGAAGACGC 


AGAGCGTCAG 


CCAATTCATA 


AAAAAGCTCC 


4800 


AACTTTTGAT 


GAGTTGTCTA 


CCTCTTCTGA 


AATCCTTGAA 


ACAGGGATCA 


AGGTTATTGA 


4860 


CCTTCTTGCC 


CCTTACCTTA AAGGTGGTAA 


AGTTGGACTT 


TTCGGTGGTG 


CCGGAGTTGG 


4920 


TAAAACTGTC 


TTAATCCAAG 


TVATTGATTCA 


CAACATTGCC 


CAAGAGCACG 


GTGGTATTTC 


4980 


AGTATTTGCT 


GGTGTTGGGG 


AACGTACTCG 


TGAGGGGAAT 


GACCTTTACT 


GGGAAATGAA 


5040 


AGAATCAGGC 


GTTATCGAGA 


AAACAGCCAT 


GGTCTTTGGT 


CAGATGAATG 


AGCCACCAGG 


5100 


AGCACGTATG 


CGTGTTGCCC 


TTACTGGTTT 


GACAATCGCT 


GAATACTTCC 


GTGATGTGGA 


5160 


AGGCCAAGAC 


GTGCTTCTCT 


TTATCGATAA 


TATCTTCCGT 


TTCACTCAGG 


CTGGTTCAGA 


5220 


AGTATCTGCC 


CTTTTGGGTC 


GTATGCCATC 


AGCCGTTGGT 


TACCAACCAA 


CACTTGCTAC 


5280 


GGAAATGGGT 


CAATTGCAAG 


AACGTATCAC 


ATCAACCAAG 


AAGGGTTCTPG 


TAACCTCTAT 


5340 


CCAGGCTATC 


TATGTGCCAG 


CGGATGACTA 


TACTGACCCA 


GCGCCAGCAA 


CAGCCTTCGC 


5400 


TCACTTGGAT 






TAAGTTGGTA 


CAAl"i't;GGTA 


TCTACCCAGC 


5460 


CGTTGACCCA 


CTTGCTTCAA 


GCTCACGTGC 


CTTGGCACCT 


GAAATCGTTG 


GAGAAGAGCA 


5520 


CTATGCAGTT 


GCTGCTGAAG 


TAAAACGTGT 


CCTTCAACGT 


TACCATGAAT 


TGCAAGATAT 


5580 


CATTGCTATC 


CTTGGTATGG 


ATGAGCTTTC 


TGATGAAGAA 


AAGACCTTGG 


TTGCTCGCGC 


5640 


CCGTCGTATC 


CAGTTCTTCT 


TGTCACAAAA 


CTTCAACGTT 


GCGGAACAAT 


TTACTGGTCA 


5700 


GCCAGGTTCT 


TATGTTCCAG 


TTGCTGAAAC 


TGTACGTGGC 


TTTAAGGAAA 


TCCTTGATGG 


5760 


TAAATACGAC 


CACTTGCCAG 


AAGATGCCTT 


CCGTGGTGTA 


GGTTCTATCG 


AAGATGTGAT 


5820 


TGCAAAAGCT 


GAAAAAATGG 


GATTTTAAGA 


GGTGATCTAT 


GGCTCAGTTA 


ACTGTCCAGA 


5880 


TCGTGACACC 


AGATGGTCTC 


GTCTATGATC 


ACCATGCCAG 


CTATGTATCG 


GTTCGAACTC 


5940 


TGGATGGTGA 


GATGGGGATC 


TTGCCACGAC 


ATGAAAATAT 


GATTGCGGTT 


TTAGCAGTTG 


6000 


ATGAAGTAAA 


GGTAAAACGT 


ATCGATGATA 


AAGATCACGT 


GAACTGGATT 


GCAGTAAACG 


6060 


GTGGCGTTAT 


TGAAATTGCC 


AATGATATGA 


TCACAATCGT 


CGCTGACTCT 


GCAGAACGTG 


6120 


CTCGTGATAT 


CGATATCAGT 


CGTGCAGAAC 


GTGCCAAACT 


TCGTGCAGAA 


CGTGCAATTG 


6180 


AAGAAGCACA 


AGACAAACAT 


TTGATTGACC 


AAGAACGTCG 


TGCTAAGATT 


GCTTTGCAAC 


6240 
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GTGCTATTAA CCGTATTAAT GTCGGAAATA GACTATAAGA AAAAATGAAC TTGAAAATAC 6300 

CAAGTTCATT TTTTATGGTG TTTTAAGGAG CAAAACGGAT GCAGACTGCT TCGGGAACAT 63 SO 

GGAAGTCGTT GGAGAGTTCT GCTAGACGAC CATTGTCACA ATTACGTTTA AAGACAGTTG 642 0 

CATTGTCAGA GTCTTGATGG ACAACAATGA GAAATTTTTG GTCGGGTGTC AAATCAAAAT 64 80 

CACGTGGAGT CTGACCATGC GTTGGAACGA TTTCTAATAA CTCTAAGCTA CCGTCCGCAA 6540 

GGATGGTATA TACTGCGATA GAATCATGGC CACGGTTAGA AGCGTAGAGG TATTTACCGT 6600 

CTTTAGAGAG ATGAATAGCA GCGGTTCCAT TAAAGCCTTC GTAAGCTTCC GGTAAAGTTG 6660 

AAATGACCTG CATACGTTCA AATTCGCCAA CGCCATCGTA GATTAAAACT TCGATAGTAC 6720 

TATTGAGTTC ACAAATGAGA TAAGCGATTT TATAGTGGTT ATGGAAAATG ATATGGCGTG 6780 

AGCCTGCTCC TGGCTTGCTG TGATAGGTAT AGAGCTTAGA TAATTTTCCT TCTTGATCGA 6840 

GGTCATAGGT GATGACTTGG TCAGTTCCCA AGTCGCAGGT CACTAGATAG TGGTCAGGTG 6900 

TTAAATCTGT ATAGTGAACA TGGGGGGAAG CTTGATTTTC ATGTGGACCT TGGCCACTGT 69 60 

GTTGATCCAT ATCACTAAGT AGAAGACTAC CATCTTCCTG GCGTTTATAA ACAAGGACTT 7020 

GTCCCTTGTG ATAGTTAGCT GCGTAAACCA AATCACGCTT TTCATCGACA GCAACATAAC 7080 

AGTGGGGAGC TCCTTCTTCA ACAACATGAT TTAACACAGT CCCGTCAGTT TGATAGGCTG 7140 

CAATTCCCCC CTTATCGTCT TGGCTACCAA CAGTGTATAA ATGTTGGTGC TGGTCAAAGG 7200 

CAAGGTAGGT TGGACTTGGC TCAGCTGCAA AAAGTTCTAG ATTTGAAAGC TGACCAGTTT 72 60 

CTGTATCAAA GTCTGCCTTG TAAATCCCTT GAGAAGTACG ACGTGTATAA GTTCCAAAAT 7320 

AAACAGTTTC TTTCATTACT ATACCTCTGT GTAAAGATAA G ACT ATT ATA TCACAAAAAC 7380 

AAGTAAATTA AAGATATCCA ATTAGATGTA AGCACTTTAA AAAAGAGTTA TTTTGTTTCA 7440 

AAAATGGTAT AATGAGAGAA CAATAGAAAG GAAGTATTTA TGGAGCAAAA AGAGAAACAT 7500 

TTTAGCCTAT CTTGGTTTTT CAAGTGGTTT TTAGATAACA AGGCAATTAC GGTATTTTTA 7560 

GTAACCTTAT TATTGGGACT GAATCTTTTT ATTTTAAGTA AGATTAGTTT TCTATTTTCA 7620 

CCTGTTTTAG ACTTTTTAGC AGTTGTGATG TTGCCAGTCA TTTTGTCTGG TTTGTTATAT 7680 

TATTTGTTGA ATCCTATTGT TGATTGGATG GAGAAGCATA AGGTTAATCG TGTTATAGCT 7740 

ATCACTATTG TCTTTGTTAT CATCGCTCTC TTTATCATTT GGGGCTTGGC AGTCGCCATT 7800 

CCAAATCTGC AACGTCAGGT TTTGACCTTT GCAAGAAACG TTCCTGTTTA CTTAGAAGAT 7860 

ATAGATAGGA TTGTTAATGG ATTGGTAGCC CAGCACCTGC CAGATGATTT CAGACCTCAA 7920 

TTAGAGCAAG TTTTGACCAA TTTTTCTAGC CAGGCTACAG TTTTGGCAAG TAAGGTTTCA 7980 
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TCTCAGGCAG 


TCAACTGGGT 


GAGTGCCTTT 


ATTAGCGGGG 


CTTCTCAAGT 


GATTGTTGCC 


B040 


TTGATTATCG 


TTCCTTTCAT 


GCTCTTTTAT 


CTCTTGCGTG 


ATGGGAAAGG 


CTTGCGTAAC 


8100 


TATTTGACCC 


AATTCATTCC 


AAGAAAATTG 


AAGGAACCTG 


TTGGACAAGT 


TTTATCAGAT 


8160 


GTGAATCAAC 


AGTTGTCCAA 


CTATGTTCGA 


GGGCAAGTGA 


CAGTGGCTAT 


TATTGTAGCA 


8220 


GTAATGTTTA 


TCATCTTCTT 


CAAGATTATT 


GGTCTACGCT 


ATGCGGTTAC 


GCTGGGGGTT 


8280 


ACTGCTGGTA 


TTTTAAATCT 


GGTCCCTTAT 


CTTGGTAGCT 


TTCTAGCCAT 


GCTTCCTGCT 


8340 


CTAGTATTGG 


GTTTGATTGC 


TGGTCCAGTC 


ATGCTTTTGA 


AAGTAGTGAT 


TGTCTTTATC 


8400 


GTAGAACAAA 


CTATTGAAGG 


CCGTTTTGTC 


TCTCCATTGA 


TTTTGGGAAG 


TCAATTAAAC 


8460 


ATCCACCCTA 


TTAATGTTCT 


CTTTGTTTTG 


TTAACTTCAG 


GATCTATGTT 


TGGTATCTGG 


8520 


GGAGTTTTAC 


TTGGTATTCC 


GGTTTATGCC 


TCTGCTAAGG 


TTGTCATTTC 


AGCCATTTTC 


8580 


GAATGGTATA 


AGGTAGTCAG 


TGGTCTATAT 


GAATTAGAGG 


GTGAGGAAGT 


CAAGAGTGAA 


8640 


CAATAGTCAA 


CAGATGTTAC 


AGGCTTTGGA 


GGAGCAAGAT 


TTAACTAAGG 


CTGAGCATTA 


8700 


TTTCGCCAAA 


GCTTTAGAAA 


ATGATTCAAG 


TGATCTTCTG 


TATGAATTGG 


CAACTTATCT 


8760 


TGAAGGGATT 


GGTTTCTATC 


CTCAGGCCAA 


GGAAATTTAC 


CTGAAAATTG 


TAGAGGATTT 


8820 


TCCAGAGGTT 


CATCTTAATC 


TAGCTGCAAT 


TGCTAGCGAG 


GATGGTCAAA 


TAGAAGAAGC 


8880 


CTTTACCTAT 


CTTGAGGAAA 


TCCAAGCTGA 


CAGTGACTGG 


TATGTCTCGT 


CTTTGGCTCT 


8940 


GAAGGCAGAC 


CITTACC AG C 


T^nn ^»ACGTTT 


GACAGATGTG 


GCACOTUAUA 


AATTATTGGA 


9000 


GGCCTTGACC 


TACTCAGAGG 


ATTCTCTCTT 


GATATTGGGT 


TTGGCAGAGT 


TGGATAGTGA 


9060 


GTTGGAAAAT 


TACCAAGCGG 


CTATTCAAGC 


CTATGCCCAG 


TTAGATAATC 


GCTCGATTTA 


9120 


TGAGCAAACG 


GGCATTTCCA 


CCTATCAACG 


AATTGGCTTT 


GCCTATGCTC 


AGTTAGGGAA 


9180 


ATTTGAAACG 


GCTACTGAGT 


TTTTAGAAAA 


AGCCCTGGAG 


TTAGAATACG 


ATGACTTAAC 


9240 


AGCTTTTGAG 


TTGGCCAGTC 


TTTATTTTGA 


TCAAGAAGAA 


TATCAAAAAG 


CCACCCTCTA 


9300 


CTTTAAGCAG 


CTTGATACCA 


TTTCTCCTGA 


CTTTGAAGGC 


TATGAGTATG 


GGTACAGTCA 


9360 


GGCTTTACAT 


AAGGAACATC 


AAGTTCAAGA 


AGCCCTGCGT 


ATCGCTAAGC 


AAGGATTAGA 


9420 


GAAAAATCCC 


TTTGAAACTC 


GCCTCTTGCT 


AGCTGCTTCA 


CAATTTTCTT 


ATGAATTGCA 


9480 


TGATGCTAGT 


GGTGCAGAAA 


ATTATCTCCT 


TACTGCAAAA 


GAAGACGCTG 


AGGATACAGA 


9540 


AGAAATCTTG 


CTTCGTTTAG 


CCACTATTTA 


TCTGGAGCAG 


GAGCGTTATG 


AGGATATTCT 


9600 


AGAATTGCAG 


AGTGAGGAGC 


CAGAAAATCT 


TTTGACCAAG 


TGGATGATTG 


CTCGTTCTTA 


9660 


TCAAGAAATG 


GACGATTTGG 


ATACTGCTTA 


TGAGTATTAT 


CAAGAGTTGA 


CAGGAGATTT 


9720 


GAAGGACAAT 


CCAGAATTTC 


TGGAACACTA 


TATCTATCTC 


TTGCGTGAAT 


TGGGACATTT 


9780 



wo 98/18931 



PCT/US97/19588 



GCGACCGATC GCAATCTTTA TTTACTTGTT TCTGATAATG GAATCGGTAT TTCGACAGAA 780 

GATAAAAAGA AAATTTTTGA CCGTTTTTAT CGAGTAGACA AGGCTAGAAC CCGGCAAAAA 840 

GGTGGTTTTG GTTTAGGATT ATCCCTAGCC AAGCAAATTG TAGATGCTCT AAAAGGAACT 900 

GTTACTGTCA AAGATAATAA ACCCAAGGGA ACAATCTTTG AAGTGAAGAT TGCCATTCAG 960 

ACACCATCTA AAAAGAAAAA ATAAAAATAT CGCTCCAATT GGGGCGATAT TTTGGATTTA 1020 

TCTTCTACGT TTTCGTTTGA TAATAGACCG TTGAACTTTT AAAACAAGTA AGCTGAATCC 1080 

GATTGCTGCG GCAAAGGCAA GAGCAGTTGA TAATTTTAAT GCTAAAAAGA TAAAACTAAA 1140 

GATAGCAATA CAGATACAAA AAACAGCGAT ATTAATAAAA AATAGGATTT CCTTGAGATT 1200 

GGCATCAGAT TGCGCTTCAG GTGTATAAGC TTGGTAATGA GGAAGCTGCT GGTTTAATTC 1260 

TTCTTGATAG TCTACCTCAT AGGATTGTAA TTTTCTTACG GGCATGATTC TCTCCTTAAC 1320 

AGTACATACC TATTTTATCA TTTTTTCGGC AGAGAATTAT TACAGAAAGG TTACAAAAAG 1380 

AATAAAGTCC CTTTTCATTT TCAAAGCATG GCTGATTTTG GAGAAATGTG GTATAATTTT 1440 

TCTTATGGAA AAGATTGTCA TTACAGCAAC TGCTGAAAGT ATTGAACAAG TTGAACAACT 1500 

ACTCGAAGCT GGCGTAGACC GTATCTATGT CGGTGAGAAA GATTTTGGTC TTCGTCTGCC 1560 

AACGACCTTT AGTTATGACC AATTACGTGA AATCGCTAAG TTGGTTCATG ATGCTGGTAA 1620 

GGAATTGATC GTTGCGGTCA ATGCTCTCAT GCACCAAGAT ATGATGGACC GTATCAAGCC 1680 

TTt^;ttaAAC TTCTTGGAAG AAATCAAGAC AGACTATATT ACGATTGGGG ATGCAGGCGT 1740 

CTTTTACGTA GTTAACCGCG ATGGTTATTC ATTTAAGACC ATCTACGATG CTTCAACCAT 1800 

GGTAACTAGC AGTCGTCAGA TTAACTTCTG GGGACAAAAG GCTGGCGCAT CTGAGGCTGT 1860 

TTTGGCGCGT GAAATTCCAT CAGCTGAACT TTTCAAAATG CCAGAGATTT TGGAAATTCC 1920 

TGCTGAAGTT TTGGTTTACG GTGCTAGCGT CATCCATCAT TCTAAACGTC CACTCTTGCA 1980 

AAACTACTAT AACTTTACAC ATATCGATGA TGAAAAGACG CATAAACGTG ACCTCTTCTT 2040 

GGCTGAGCCA AGTGATCCAG AGAGCCACTA TTCCATTTTT GAAGATAATC ATGGGACCCA 2100 

TATCTTTGCC AACAATGACC TTGATTTGAT GATCAAATTA ACAGAATTGG TGGAGCATGG 2160 

CTTTACTCGC TGGAAACTAG AAGGGCTCTA CACTCCTGGT CAGAACTTTG TTGAGATTGC 2220 

AAAACTCTTT ATCCAAGCGC GTAGCTTGAT TCAAGAGGGC AACTTTAGTC ATGCTCAAGC 22 80 

CTTCTTGCTG GATGAAGAAG TTCGTAAACT TCACCCTAAA AACCGTTTCC TTGATACAGG 2 340 

ATTTTATGAC TACGATCCTG ACATGGTTAG ATAAAATACA TGATTCGTTG AGAGAAGGAA 2400 

GATGCAAACA TTTCTTCTCT CAATTTTTCG TATTTCTTCA CTATTTTACA AAAATCAGCA 2460 

GGCTAGAATG CTCTATTCGA TGGGATTTTT AAGAAAAGTA GTGTTCTTGA GTTTGAAAAT 2520 




wo 98/18931 



PCT/US97/19588 



I: 



1045 



TATCCTATGT TTGCAGGTGC CAAATGGCCC TTTTTTTGGT ATAATTTTTT ATAATGAAAA 
CGATTGGTAA TCGCTATGTT GTGGTGGATT TAGAGGCAAC TAGCACAGGT AGTAAGGCTA 
AAATTATCCA AGTGGGAATT GTCGTGATTG AGGACGGAGA AATCGTCGAT CACTATACGA 
CGGATGTCAA TCCACATGAA CCCTTGGATG CTCATATCAA AGAACTGACA GGATTGACAG 
ACCAACGTCT GGCGCAAGCA CCTGATTTTT CGCAAGTTGC CAGAAAAATA TTTGACTTGG 
TGGAGGATGG GATTTTTGTA GCCCATAATG TTCAGTTTGA TGCTAATCTC TTGGCGGAAA 
ATTTATTTTT TGAAGGCTAT GAGCTAAGAA ACCCTCGTGT. TGATACGGTC GAATTGGCCC 
AGGTCTTTTT CCCTGAACTG GAAAAATATA GCTTGCCGAT TTTGTGTCGA GAATTAGGAA 
TTCCTCTTAA ACACGCACAC ACAGCCCTTT CAGATGCCCA AGCTACAGCA GAATTACTTC 
TTTTTTTACG GAAAAAGATG ACCCAGCTTC CTAAAGGTCT CTTGGAACGC TTGCTGGAAA 
TGGCTGACGC TCTCCTATAT GAGTCCTACC TGGTTATTGA GGAAACTTAT CGCAACCAAT 
CTATCCTGAG TTCTCCAGAC TTGGTCCAAG TTCAAGGTCT ATATTTTAAG AAAACGGAAG 
CTTCTCTGGA GCCACGAAAA CTATCTCAAG ACTTTTCTAA AAATATTTCT CTGTTGAACC 
TTGAAGTGAG GGAGGAACAA GAAAGTTTTG CTAAAGAGGT TGGCTTGCTA TTGAAAGATG 
AACCTGTCTC TCTGATTCAA GCGCCGACAG GGATTGGGAA AACCTATGGC TATCTCTTAC 
CCGCTTTATC TCAATCCAAA GAGCGACAAA TTGTTCTTAG TGTTCCGACA AAGATTCTTC 
AAAATCAAAT CATGGAAGAA GAAGGTAAAC GCCTCAAGGA AGTGTTCCAT ACAGATATTC 
ATAGCTTAAA GGGACCACAA AATTATCTGA AGTTGGATGC CTTTTATCAT TCCTTGCAGG 
AAAATGATGA AAATCGCTTA TTTAGACGCT TTAAAATGCA AGTCTTGGTC TGGCTTACTG 
AGACAGAGAC AGGAGATTTG GATGAAATCG GGCAACTCTA CCGTTACCAA CATTTTCTAG 
CAGACCTTCG TCATGATGGG AATTTATCAT CCCAGAGCTT ATTTGTGACG GAAGATTTTT 
GGAAACGTAG TCAAGAAAGG GCAGAGACTT GCAAGCTTTT AGTGACTAAT CATGCCTATC 
TCGTAACCAG ACTTGAAGAT AATCCTGAAT TTGTCAGTGA CCGTTTACTG ATTATTGATG 
AAGTCCAAAA GATTTTGTTA GCTCTAGAAA ATCTGCTTCA AGAGACCTAC GATATACAAT 
CTATTATCGA TTTAATTGAT AAGGCTTTAG TAGGAGAAGA AAACAGGGTT CAACAACGGA 
TACTAGAAAG TATTCGCTTT GAGTGTCTCT ACTTGATAGA ACAATTTCAG TCTGGCAAAT 
CTAGGAAAAA TATCTTAGAT TCTCTGGACA ATCTCCATCA GTATTTTTCA GAATTGGAAG 
TAGAAGACTT TGATGAGCTG GTTCGCTATT TTACAGCTGA AGGTGATTAC TGGCTTGAAG 
TAACTGAAAC GAGTCAAAAG AAAATTCAGA TTTCTTCTAC AAAATCAGGC CGTACTCTTC 



2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
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TGTCCTCTTT ACTTCCTGAG AGTTGCCAAG TCTTGGGAGT ATCGGCTACT CTTGAGATTA 4 320 

GTCAGAGGGT TTCTTTGGCA GACCTTTTAG GCTATCCTGA AGCTAAATTT GTCAAGATTG 4380 

AATCTCGGGG AAAACAGGAA CAAGAAGTGG TCATGGTCAA AGATTTCCCT CTGGTAACAG 4440 

AAACCTCCTT AGAAGTCTAT GCCAGAGAGG TAGCTGCTTT ACTAGTGGAA ATTCAAGCTT 4 500 

TCCAGCAACC GATTTTGGTT CTCTTTACCG CTAAAGACAT GCTTCTAGCA GTATCGGATT 4560 

TACTTACAGT TAGCCACTTG GCCCAGTATA AAAATGGGGA TGTTCATCAG CTAAAGAAAC 4 620 

GCTTTGAAAA AGGTGAACAA CAAATCTTGC TTGGTGCAGC AAGTTTCTGG GAGGGAGTTG 4680 

ATTTTTCAAG CCATCCTTCT GTGATTCAAG TTGTACCGAG GCTTCCTTTC CAAAATCCTC 4740 

AAGAACCCTT GACGAAAAAG ATTAATCAAG AACTGAATCA AGAAGGGAAA AATGCCTTTT 4800 

ATGATTATCA ATTGCCAATG GCCATTATTC GTTTAAAACA GGCTTTGGGA AGAAGTATGA 4860 

GACGTGAATA CCAACGTTCC TTAACTCTTA TTTTGGATAG GAGAATCGTC GGAAAACGAT 4920 

ACGGCAAACA AATAGTAGCA TCTCTAGCAG AAGAAGCGAC TGTTAAAACC ATCTCTCGAT 4 980 

CCGAAGTTGA CGAGGCTATT GATAGATTTT TTAATGAGCT TTGATAAATA GTATTGTATG 5040 

AAAGTATAAG GTTAGTATAT ATGAAACGTT CTCTCGACTC AAGAGTCGAT TACAGTTTGC ,5100 

TCTTGCCAGT ATTTTTTCTA CTGGTCATCG GTGTGGTGGC TATCTATATA GCCGTTAGTC 5160 

ATGATTATCC CAATAATATT CTGCCCATTT TAGGGCAGCA GGTCGCCTGG ATTGCCTTGG 5220 

GGCTTGTGAT TGGTTTTGTG GTCATGCTCT TTAATACAGA ATTTCTTTGG AAGGTGACCC 5280 

CCTTTCTATA TATTTTAGGC TTGGGACTTA TGATCTTGCC GATTGTATTT TATAATCCAA 5340 

GCTTAGTTGC ATCAACGGGT GCCAAAAACT GGGTATCAAT AAATGGAATT ACCCTATTCC 5400 

AACCGTCAGA ATTTATGAAG ATATCCTATA TCCTCATGTT GGCTCGTGTC ATTGTCCAAT 54 60 

TTACAAAGAA ACATAAGGAA TGGAGACGCA CGGTTCCGCT GGACTTTTTG TTAATTTTCT 5520 

GGATGATTCT CTTTACCATT CCAGTCCTAG TTCTTTTAGC ACTTCAAAGT GACTTGGGGA 5580 

CGGCTTTGGT TTTTGTAGCC ATTTTCTCAG GAATCGTTTT ATTATCAGGG GTTTCTTGGA 5640 

AAATTATTAT CCCAGTATTT GTGACTGCTG TAACAGGAGT TGCTGGTTTC TTAGCTATCT I-;700 

TTATTAGCAA GGACGGACGA GCTTTTCTTC ACCAGATTGG AATGCCGACC TACCAAATTA 57 60 

ATCGGATTTT GGCTTGGCTC AATCCCTTTG AGTTTGCCCA AACAACGACT TACCAGCAGG 5820 

CTCAAGGGCA GATTGCCATT GGGAGTGGTG GCTTATTTGG TCAGGGATTT AATGCTTCGA 5880 

ATCTGCTTAT CCCAGTTCGA GAGTCAGATA TGATTTTTAC GGTTATTGCA GAAGATTTTG 5940 

GCTTTATTGG CTCTGTCCTG GTTATTGCCC TCTATCTCAT GTTGATTTAC CGTATGTTGA 6000 

AGATTACTCT TAAATCAAAT TU^CCAGTTCT ACACTTATAT TTCCACAGGT TTGATTATGA 6060 
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TGTTGCTCTT CCACATCTTT GAGAATATCG GTGCTGTGAC TGGACTACTT CCTTTGACGG 6120 

GGATTCCCTT GCCTTTCATT TCGCAAGGGG GATCAGCTAT TATCAGTAAT CTGATTGGTG 6180 

TTGGTTTGCT TTTATCGATG AGTTACCAGA CTAATCTAGC TGAAGAAAAG AGCGGAAAAG 6240 

TCCCATTCAA ACGGAAAAAG GTTGTATTAA AACAAATTAA ATAAGGAGAA AATCATGGTA 6300 

AAAGTAGCAG TTATATTAGC TCAGGGCTTT GAAGAAATTG AAGCCTTGAC AGTTGTAGAT 6360 

GTCTTGCGTC GAGCCAATAT CACATGTGAT ATGGTTGGTT TTGAAGAGCA AGTAACGGGT 6420 

TCGCATGCAA TCCAAGTAAG AGCAGATCAT GTCTTTGATG GAGATTTATC AGACTATGAT 6480 

ATGATTGTTC TTCCTGGAGG TATGCCTGGT TCTGCACATT TACGTGATAA TCAGACCTTG 6540 

ATTCAAGAAT TGCAAAGCTT CGAGCAAGAA GGGAAGAAAC TAGCAGCCAT TTGTGCGGCA 6600 

CCAATTGCCC TCAATCAAGC AGAGATATTG AAAAATAAGC GATACACTTG TTATGACGGC 6660 

GTTCAAGAGC AAATCCTTGA TGGTCACTAC GTCAAGGAAA CAGTAGTGGT AGATGGTCAG 6720 

TTGACAACCA GTCGGGGTCC TTCAACAGCC CTTGCCTTTG CCTACGAGTT GGTGGAGCAA 6780 

CTAGGAGGGG ACGCAGAGAG TTTACGAACA GGAATGCTCT ATCGAGATGT CTTTGGTAAA 6840 

AATCAGTAAA ACGGGAGTTA TTCTCTCGTT TTTTATGTGG AAAACTCAGG GAAATCATCG 6900 

CTTTTTTCAT AAAAAAATGC TATAATGAAG GGTATGAAAT ATCACGATTA CATCTGGGAT 6960 

TTAGGTGGAA CTTTACTGGA TAATTATGAA ACTTCAACAG CTGCATTTGT TGAAACATTG 7020 

GCACTGTATG GTATCACACA AGACCATGAC AGTGTCTATC AAGCTTTAAA GGTTTCTACT 7080 

CCTTTTGCGA TTGAGACATT CGCTCCCAAT TTAGAGAATT TTTTAGAAAA GTACAAGGAA 714 0 

AATGAAGCCA GAGAGCTTGA ACACCCGATT TTATTTGAAG GAGTTTCTGA CCTATTGGAA 7200 

GACATTTCAA ATCAAGGTGG CCGTCATTTT TTGGTCTCTC ATCGAAATGA TCAGGTTTTG 7260 

GAAATTTTAG AAAAAACCTC TATAGCAGCT TATTTTACAG AAGTGGTGAC TTCTAGCTCA 7320 

GGCTTTAAGA GAAAGCCAAA TCCCGAATCC ATGCTTTATT TAAGAGAAAA GTATCAGATT 7380 

AGCTCTGGTC TTGTCATTGG TGATCGGCCG ATTGATATCG AAGCAGGTCA AGCTGCAGGA 7440 

CTTGATACCC ACTTGTTTAC CAGTATCGTG AATTTAAGAC AAGTATTAGA CATATAAGAA 7500 

AAAGGAATAA GATGACAGAA GAAATCAAAA ATCTGCAGGC ACAGGATTAT GATGCCAGTC 7560 

AAATTCAAGT TTTAGAGGGC TTAGAGGCTG TTCGTATGCG TCCAGGGATG TACATTGGAT 7620 

CAACCTCAAA AGAAGGTCTT CACCATCTAG TCTGGGAAAT TGTTGATAAC TCAATTGACG 7680 

AGGCCTTGGC AGGATTTGCC AGCCATATTC AAGTTTTTAT TGAGCCAGAT GATTCGATTA 7740 

CTGTTGTGGA TGATGGGCGT GGTATCCCAG TCGATATTCA GGAAAAAACA GGCCGTCCTG 7800 
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CTGTTGAGAC CGTCTTTACA GTCCTTCACG CTGGAGGAAA GTTCGGCGGT GGTGGATACA 7860 

AGGTTTCAGG TGGTCTTCAC GGGGTGGGGT CGTCAGTAGT TAATGCCCTT TCCACTCAAT 7920 

TAGACGITCA TGTTCACAAA AATGGTAAGA TTCATTACCA AGT^TACCGT CGTGGTCATG 7980 

TTGTCGCAGA TCTTGAAATA GTTGGAGATA CGGATAAAAC AGGAACAACT GTTCACTTCA 8040 

CACCGGACCC AAAAATCTTC ACTGAAACAA CAATCTTTGA TTTTGATAAA TTAAATAAAC 8100 

GGATTCAAGA GTTGGCCTTT CTAAATCGCG GTCTTCAAAT TTCAATTACA GATAAGCGCC 8160 

AAGGTTTGGA ACAAACCAAG CATTATCATT ATGAAGGTGG GATTGCTAGT TACGTTGAAT 822 0 

ATATCAACGA GAACAAGGAT GTAATCTTTG ATACACCAAT CTATACAGAC GGTGAGATGG 8280 

ATGATATCAC AGTTGAGGTA GCCATGCAGT ACACAACTGG TTACCATGAA AATGTCATGA 8340 

GTTTCGCCAA TAATATTCAT ACCCATGAAG GTGGAACACA TGAACAAGGT TTCCGTACAG 8400 

CCTTGACACG TGTTATCAAC GATTATGCTC GTAAAAATAA GTTACTGAAA GACAATGAAG 84 60 

ATAATTTAAC AGGGGAAGAT GTTCGCGAAG GCTTAACTGC AGTTATCTCA GTTAAACACC 8520 

CAAATCCACA GTTTGAAGGA CAAACCAAGA CCAAATTGGG AAATAGCGAA GTGGTCAAGA 8580 

TTACCAATCG CCTCTTCAGT GAAGCTTTCT CCGATTTCCT CATGGAAAAT CCACAGATTG 8640 

CCAAACGTAT CGTAGAAAAA GGAATTTTGG CTGCCAAGGC TCGTGTGGCT GCCAAGCGTG 8700 

CGCGTGAAGT CACACGTAAA AAATCTGGTT TGGAAATTTC CAACCTTCCA GGGAAACTAG 8760 

CAGACTGTTC TTCTAATAAC CCTGCTGAAA CAGAACTCTT CATCGTCGAA GGAGACTCAG 882 0 

CTGGTGGATC AGCCAAATCT GGTCGTAACC GTGAGTTTCA GGCTATCCTT CCAATTCGCG 8880 

GTAAGATTTT GAACGTTGAA AAAGCAAGTA TGGATAAGAT TCTAGCCAAC GAAGAAATTC 8940 

GTAGTCTTTT CACAGCCATG GGAACAGGAT TTGGCGCAGA ATTTGATGTT TCGAAAGCCC 9000 

GTTACCAAAA ACTCGTTTTG ATGACCGATG CCGATGTCGA TGGAGCCCAC ATTCGTACCC 9060 

TTCTTTTAAC CTTGATTTAT CGTTATATGA AACCAATCCT AGAAGCTGGT TATGTTTATA 9120 

TTGCCCAACC ACCAATCTAT GGTGTCAAGG TTGGAAGCGA GATTAAAGAA TATATCCAGC 9180 

CGGGTGCAGA TCAAGAAATC AAACTCCAAG AAGCTTTAGC CCGTTATAGT GAAGGTCGTA 9240 

CCAAACCGAC TATTCAGCGT TATAAGGGGC TAGGTGAAAT GGACGATCAT CAGCTGTGGG 9300 

AAACAACCAT GGATCCCGAA CATCGCTTGA TGGCTAGAGT TTCTGTAGAT GATGTGCAGA 9360 

AGCAGATAAA ATCTTTGATA TGTTGATGGG GATCGAGTTG TCCTCGTCG 9409 
(2) INFORMATION FOR SEQ ID NO: 162: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6415 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 

CCTGGGAAAG TCTTGAAAAT TATGATAGAA TGGTGGAAGG AAAAATTCAG GAGAGTAGTA 60 

GTGACTCAAA ATGTTGAAAG TCTTCTCGTA TCCATTGTAA TCAGTGCATA CAATGAAGAA 120 

AAATATCTGC CTGGTCTAAT TGAAGACTTA AAAAATCAAA CCTATCCTAA AGAGGATATT 180 

GAAATTCTAT TTATAAATGC TATGTCCACA GATGGGACCA CAGCTATCAT TCAGCAATTT 240 

ATAAAGGAAG ATACAGAGTT TAACTCAATT AGATTGTATA ACAATCCTAA GAAAAATCAA 300 

GCTAGTGGTT TTAACCTGGG AGTTAAACAT TCTGTAGGGG ACCTTATTTT AAAAATTGAT 360 

GCTCATTCAA AAGTTACTGA GACTTTTGTA ATGAACAATG TGGCTATTAT TCAACAAGGT 420 

GAATTTGTCT GTGGGGGGCC TAGACCGACG ATTGTCGAAG GAAAAGGAAA ATGGGCAGAG 480 

ACCTTGCATC TTGTTGAGGA AAATATGTTT GGCAGTAGCA TTGCCAATTA TCGAAATAGT 540 

TCTGAGGATA GATATGTTTC TTCTATTTTT CATGGAATGT ATAAACGAGA GGTTTTCCAG 600 

AAGGTTGGTT TAGTAAATGA GCAACTTGGC CGAACTGAAG ATAATGATAT TCATTATAGA 660 

ATTCGAGAAT ATGGTTATAA AATCCGCTAT AGCCCAAGTA TTCTATCTTA TCAGTATATT 720 

CGACCAACAT TCAAGAAAAT GCTGCATCAA AAGTATTCAA ATGGTTTGTG GATTGGCTTfi 780 

ACAAGTCATG TTCAGTTTAA GTGTTTATCA TTATTTCACT ATGTTCCTTG TTTATTTGTT 840 

TTGAGTCTTG TGTTTAGTCT AGCATTGTTA CCGATCACAT TCGTATTCAT AACTTTACTA 900 

TTAGGTGCCT ATTTTCTACT TTTGTCATTA CTCACTTTGC TGACTTTATT AAAACATAAA 960 

AATGGATTTC TAATTGTGAT GCCCTTTATT TTATTTTCCA TTCACTTTGC TTATGGCCTT 1020 

GGGACGATTG TAGGTTTAAT TAGAGGATTT AAATGGAAGA AGGAGTACAA GAGAACAATA 1080 

ATTTATTTGG ATAAAATAAG CCAAATAAAT CAAAATATGC TATAATAACA ATATAGTAAA 1140 

ACTCTTTTAA GGAGGAGTAG ATTTCTATGA ATAAAAAACT AACAGATTAT GTGATTGATC 1200 

TGGTGGAAAT TTTAAATAAA CAACAAAAGC AGGTTTTCTG GGGAATATTT GATATTTTCA 1260 

GTATGGTGGT TTCCATCATT GTATCTTATA TTTTATTTTA TGGGCTGATT AATCCAGCAC 1320 

CTGTTGACTA CATTATCTAT ACGAGTTTGG CCTTCCTGTT CTATCAATTG ATGATTGGTT 1380 

TTTGGGGGTT GAACGCGAGC ATTAGTCGTT ACAGCAAGAT TACGGATTTC ATGAAAATCT 1440 

TTTTTGGTGT GACTGCTAGC AGTGTCTTGT CATATAGTAT CTGTTATGCC TTCTTGCCAC 1500 

TCTTCTCCAT CCGTTTCATC ATTCTCTTTA TCTTGTTGAG TACCTTCTTG ATTTTATTGC 1560 
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CACGGATTAC 


TTGGCAGTTA 


ATCTACTCCA 


GACGCAAAAA 


AGGTAGTGGT 


GATGGAGAAC 


1620 


ACCGTCGGAC 


CTTCTTGATT 


GGTGCCGGTG 


ATGGTGGGGC 


TCTTTTTATG 


GATAGTTACC 


1680 


AACATCCAAC 


CAGTGAATTA 


GAACTGGTCG 


GTATTTTGGA 


TAAGGATTCT 


AAGAAAAAGG 


1740 


GTCAAAAACT 


TGGTGGTATT 


CCTGTTTTGG 


GCTCTTATGA 


CAATCTGCCT 


GAATTAGCCA 


1800 


AACGCCATCA 


AATCGAGCGT 


GTCATCGTTG 


CGATTCCGTC 


GCTGGATCCG 


TCAGAATATG 


1860 


AGCGTATCTT 


GCAGATGTGT 


AATAAGCTGG 


GTGTCAAATG 


TTACAAGATG 


CCTAAGGTTG 


1920 


AAACTGTTGT 


TCAGGGCCTT 


CACCAAGCAG 


GTACTGGCTT 


CCAAAAAATT 


GATATTACGG 


1980 


ACCTTTTGGG 


TCGTCAGGAA 


ATCCGTCTTG 


ACGAATCGCG 


TCTGGGTGCA 


GAACTGACAG 


2040 


GTAAGACCAT 


CTTAGTCACA 


GGAGCTGGAG 


GTTCAATCGG 


TTCTGAAATC 


TGTCGTCAAG 


2100 


TTAGTCGCTT 


CAATCCTGAA 


CGCATTGTCT 


TGCTCGGTCA 


TGGGGAAAAC 


TCAATCTACC 


2160 


TTGTTTATCA 


TGAATTGATT 


CGTAAGTTCC 


AAGGGATTGA 


TTATGTACCT 


GTGATTGCGG 


2220 


ACATTCAAGA 


CTATGATCGT 


TTGTTGCAAG 


TCTTTGAGCA 


GTACAAACCT 


GCTATTGTTT 


2280 


ATCATGCGGC 


AGCCCACAAG 


CATGTTCCTA 


TGATGGAGCG 


CAATCCAAAA 


GAAGCCTTCA 


2340 


AAAACAATAT 


CCGTGGAACT 


TACAATGTTG 


CTAAGGCTGT 


TGATGAAGCT 


AAAGTGTCTA 


2400 


AGATGGTTAT 


GATTTCGACA 


GATAAGGCAG 


TCAATCCACC 


AAATGTTATG 


GGAGCAACCA 


2460 


AGCGCGTGGC 


GGAGTTGATT 


GTCACTGGCT 


TTAACCAACG 


TAGCCAATCA 


ACCTACTGTG 


2520 


CAGTTCGTTT 


TGGGAATGTT 


CTTGGTAGCC 


GTOGTAGTCT 


CATTCCAGTC 


TTT-OAACGTC 


2580 


AGATTGCTGA 


AGGTGGGCCT 


GTAACGGTGA 


CAGACTTCCG 


TATGACCCGT 


TACTTTATGA 


2640 


CCATTCCAGA 


AGCTAGCCGT 


CTGGTTATCC 


ATGCTGGTGC 


TTATGCCAAA 


GATGGGGAAG 


2700 


TCTTTATCCT 


TGATATGGGC 


AAACCAGTCA 


AGATTTATGA 


CTTGGCCAAG 


AAGATGGTGC 


2760 


TTCTAAGTGG 


CCACACTGAA 


AGTGAAATTC 


CAATCGTTGA 


AGTTGGAATC 


CGCCCAGGTG 


2820 


AAAAACTCTA 


CGAAGAACTC 


TTGGTATCAA CCGAACTCGT TGATAATCAA GTTATGGATA 


2880 


AGATTTTCGT 


TGGTAAGGTT 


AATGTCATGC 


CTTTAGAATC 


CATCAATCAA 


AAGATTGGAG 


2940 


AGTTCCGCAC 


TCTCAGTGGA GATGAGTTGA AGCAAGCTAT 


TATCGCCTTT 


GCTAATCAAA 


.3000 


CAACCCACAT 


TGAATAAAAA 


AGAAAAACGC 


ATAGTATCAA 


GTTACACAAC 


CTTGGTAATA 


3O60 


TGCGTTTTAT 


TATGTAGAGA 


CTTATACTCT 


TCGAAAATCT 


CTTCAAACCA 


CGTCAACGTC 


3120 


GCCTTGCCGT 


ATATGGTTAC 


TGACTtCGTC AGTTCTATCC ACAACCTCAA AACAGTGTTT 


3180 


TGAGytGACT 


TCGTCAGTTC 


TATCCACAAC 


CTCAAAACAG 


TGTTTTGAGc 


TGACtTCGTC 


3240 


AGTTCTATCC 


ACAACCTCAA 


AACAGTGTTT 


TGAGCTGAcT 


TCGTCAGTTC 


CATCCACAAC 


3300 


CTTAAAACAG 


TGTTTTGAGy TGACnTTCGT 


CAGTTCCATC 


TACAACCTTA 


AAACAGTGTT 


3360 
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TTGAGCTGCC CGCAGCTAGT TTCCTAGTTT GCTCTTTGAT TTTCATTGAG TATTACTTCA 3420 

TTTTCTTCTG AAATGGAATT GTTACCCAGT CTATGCTATT GAAAATACGC CAAAACTTCT 3480 

AAGGGTTTGT GAGCGATATA ATCAGGTTGA TAGTTTAGTA GATCTGCTTG CTCTCCAAAT 3540 

CCCCAAGTGA TGGCCAATTT CTGAATACCT GTTTCTCGAG. CTCCCAGCAT ATCAAACTTG 3600 

GTATCTCCGA TGATGATGGC TTGTTCTGGT GCTAGTTGAT GTGTCTGCAA GGCTTGGTGA 3 660 

ATGACATCTG CCTTATGGGG TGCTTCAGGG CTAGAACCAT AAATGCCATC AAAGAAATGA 3720 

TGGATTTCCA AGTTTTTTGC CATGTCTTGA GCAGTAGATG TATCCTTTGT CGTGGTGATG 3780 

TAGAGTGGAT AACTGCTCGA TAACTCCTCA AGCAAGTCTA TAATCTGAGG AAAGAGTTGA 3840 

GCTTCATAGA TGCCTTTTGC CTTATAGTAA GAACGATATA TCTGCACGGC TTCAGAAATT 3 900 

TGGTCTTTGG ACAGGCAGGT CGCAAAACTA CTTTCGAGAG GTGGTCCCAT AAAACCACGA 3 960 

ATAGTTTTGG CATCAGGGCT AGGCACCCCC AGCTCTTTAA AGGTATAGGT AAAGGCATTG 4020 

TGAATCCCGA TAGAACTATC AACGAGGGTT CCATCCAAAT CGAAAAAAAT CGCTGTGATA 4080 

GAGGTCATGG TTTCTCCTAT TTGATAAGCT TATTCTCCGA 7VAATTTCTTT TTGGAGGCGA 414 0 

CGACCAGTAG GGGTGGTAGC GAGTCCACCT TCAGCTGTTT CACGAAAGGC AGTTGGCATG 4200 

CTTGCTCCTA CTTGGTACAT GGCATCGATC ACTTCATCCA CAGGGATTTT AGATTCGATA 4260 

CCTGCCAAGG CCATGTCTGC TGCGATGAAA GCAAAGCTAG CTCCCATGGC ATTACGTTTG 43 20 

ACACAGGGAA CTTCGACCAA ACCTGCAACA GGGTCACAGA TGAGGCCTAG CATATTTTTA 4 380 

ATGACAAAGG CAATAGCTTG ACTGGCCTGA TAAGGTGTTC CACCTGCAGC CAGAGTCAAG 4440 

GCGGCAGCAC TCATAGCAGA GGCTGAACCA ACTTCAGCTT GACACCCACC CTCAGCACCT 4500 

GAGATGGAGG CATTGTTTGC GATGACTAGT CCAAAGGCAC CAGCAGCAAA GAGGAAATCC 4560 

AATTGTTGCT CGTGGCTGAG GTCTAATTTT TCAATAGCAG CAGTGAGAAC GGATGGCAGA 4 620 

CAGCCAGCAC TTCCAGCGGT TGGAGTGGCA CAGACCAAGC CCATTTTGGC ATTGTGTTCA 4 680 

TTGACTGCGA TGGCATTTCG GGCAGCCGAG AGAATCGTAT AATCTGACAG AGTTTTTCCG 4 740 

TTTTCGATGT AGTGATCCAA TTTGGCAGCA TCTCCACCTG TCAGGCCACT ACGAGATTTA 4800 

TTTTCATTGA GGCCAAGTTG GACAGAGGCT TTCATAACTT CCAGATTGCG TTCCATGAGA 4 860 

AGGAAGACTT CTTCACGTTC GCGACCGGTC AATTCAAACT CTGTTGTAAT CATGAGTTCT 4920 

GCGACATTTC CTTGAAAGTC CAGATCTGCT TGCTCGACCA ATTCTTTGAT AGAATAAAAC 4980 

ATGCTTCCTC CTATTTAAAG AAATTGACAT TGTGGAGATG AGGGATTTTT CGAATTTCTT 5040 

CGATAGCCTC ATCACAGTTG CGACTGTCAA CTTCGATAAT CATAATGGCT TTTTCACCAG 5100 
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CTTTTTCACG 


AGTGACATTC 


ATCTGGGCGA 


TATTGATACC 


ATAGCGGGAA 


AGCGCCTCTG 


5160 


TAACAAGGGC 


AATCATACCT 


GGAATATCTT 


GATGAACGAT 


GATGATAGTC 


GGTGTATTCA 


5220 


TATTGAGAGA 


GACGGCAAAA 


CCATTGAGTT 


CGGTTACCTG 


AATATTTCCT 


CCACCGATAG 


5280 


AAATACCAGT 


CACGCTGATG 


GTCTTGTGGG 


CATTTTTAAC 


AGTAATTTTA 


GTGGTGTTAG 


5340 


GGTGAGGGGC 


ATTGCTGTCT 


TTCTGAATGG 


TCCAGACAAT 


CTTGATACCA 


CGCTTGTGGG 


5400 


CAATTTCCAG 


ACTATTTGGA 


ATTTCAGGAT 


CATCTGTATC 


CATTCCTAAA 


ATACCTGCAA 


5460 


CAAGGGCTAG 


GTCTGTTCCG 


TGACCACGAT 


AGGTCTTGGC 


AAATGAGTTA 


AAAAGTTGGA 


5520 


ATTCAACTTC 


TGTCGGAGTA 


TCATCAAAAA 


TGG7VAGAGAC 


AATCTTCCCA 


ATACGAACAG 


5580 


CACCAGCGGT 


ATGGCTACTA 


GATGGGCCAA 


TCATAACTGG 


TCCGATGATA 


TCAAAGACAG 


5640 


ATTGAAAACG 


AAGTGATTTC 


ATCAGTTTCC 


CCTTATAAAA 


ATTCTTATCT 


CTATTATATC 


5700 


AAAGAATGAG 


GGGCTTGGCT 


TTAATTGTGG 


ATGAAAACCT 


TTCTAATACC 


TCAAATAGCA 


5760 


TAAAAATAGT 


ATCTTTTATG 


ACAAAAAACA 


CCTTATTTAG 


GGAAATAAAA 


AATAATTTTG 


5820 


TAATATTTCT 


ACATAAAAGT 


GTCAAGAAAC 


GGTAATATTT 


AAAGGGTATG 


ATAGAACTAT 


5880 


AGAAAGAAGG 


AGAATTTTCG 


AATATGAAAT 


CAATAACTAA 


AAAGATTAAA 


GCAACTCTTG 


5940 


CAGGAGTAGC 


TGCCTTGTTT 


GCAGTATTTG 


CTCCATCATT 


TGTATCTGCT 


CAAGAATCAT 


5000 


CAACTTACAC 


TGTTAAAGAA 


GGTGATACAC 


TTTCAGAAAT 


CGCTGAAACT 


CACAACACAA 


6060 


CAGTTGAAAA 


a rrrr\Q q C AG AA 




TTGATAACA'i' 


TCATTTGATT 


TATGTTGATC 


5120 


AAGAGTTGGT 


TATCGATGGC 


CCTGTAGCGC 


CTGTTGCAAC 


ACCAGCGCCA 


GCTACTTATG 


6180 


CGGCACCAGC 


CGCTCAAGAT 


GAAACTGTTT 


CAGCTCCAGT 


AGCAGAAACT 


CCAGTAGTAA 


5240 


GTGAAACAGT 


TGTTTCAACT 


GTAAGCGGAT 


CTGAAGCAGA 


AGCCAAAGAA 


TGGATCGCTC 


6300 


AAAAAGAATC 


AGGTGGTAGT 


ATACAGCTAC 


AAATGGACGT 


TATATCGGAC 


GTTACCAATT 


6360 


AACAGATTCA 


TACCTGAACG 


GTGACTACTC 


AGCTGAAAAC 


CAAGAACGGG 


TACCG 


6415 



(2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8494 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 163: 
TACCCCTTTC GAATTTTGGC AAAAATTCGG TAAGGCTTTG ATGGTAGTTA TCGCGGTTAT 60 
GCCGGCTGCT GGTTTGATGA TTTCAATCGG TAAGTCTATC GTGATGATTA ACCCAACCTT 120 
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TGCACCACTT GTCATCACAG GTGGAATTCT TGAGCAAATC GGTTGGGGGG TTATCGGTAA 180 

CCTTCACATT TTGTTTGCCC TAGCCATTGG AGGAAGCTGG GCTAAAGAAC GTGCTGGTGG 240 

TGCTTTCGCC GCTGGTCTTG CCTTCATCTT GATTAACCGT ATCACTGGTA CAATCTTTGG 300 

TGTATCAGGC GATATGTTGA AAAATCCAGA TGCTATGGTA ACTACTTTCT TTGGTGGTTC 360 

AATCAAAGTT GCTGATTACT TTATCAGTGT TCTTGAAGCT CCAGCCTTGA ACATGGGGGT 420 

ATTCGTAGGG ATTATCTCAG GTTTTGTAGG GGCAACTGCT TACAACAAAT ACTACAACTT 430 

CCGTAAACTT CCTGATGCAC TTTCATTCTT CAACGGGAAA CGTTTCGTAC CATTTGTAGT 540 

TATTCTTCGT TCAGCAATCG CTGCAATTCT ACTTGCTGCT TTCTGGCCAG TAGTTCAAAC 600 

AGGTATCAAT AACTTCGGTA TCTGGATTGC CAACTCACAA GAAACTGCTC CAATTCTTGC 660 

ACCATTCTTG TATGGTACTT TGGAACGTTT GCTCTTGCCA TTTGGTCTTC ACCACATGTT 720 

GACTATCCCA ATGAACTACA CAGCTCTTGG TGGTACTTAT GACATTTTAA CTGGTGCAGC 780 

TAAAGGTACT CAAGTATTCG GTCAAGACCC ACTATGGCTT GCATGGGTAA CAGACCTTGT 840 

AAACCTTAAA GGTACTGATG CTAGTCAATA TCAACACTTG TTAGATACAG TACATCCAGC 900 

TCGTTTCAAA GTTGGACAAA TGATCGGTTC ATTCGGTATC TTGATGGGTG TGATTGTTGC 9 60 

TATCTACCGT AATGTTGATG CTGACAAGAA ACATAAATAC AAAGGTATGA TGATTGCAAC 1020 

AGCTCTTGCA ACATTCTTGA CAGGGGTTAC TGAACCAATC GAATACATGT TCATGTTCAT 1080 

CGCAACACCT ATGTATCTTG TTTACTCACT TGTTCAAGGT GCTGCCTTCG CTATGGCTGA 1140 

CGTCGTAAAC CTACGTATGC ACTCATTCGG TTCAATCGAG TTCTTGACTC GTACACCTAT 1200 

TGCAATCAGT GCTGGTATTG GTATGGATAT CGTTAACTTC GTTTGGGTAA CTGTTCTCTT 12 60 

TGCTGTAATC ATGTACTTTA TCGCAAACTT CATGATTCAA AAATTCAACT ACGCAACTCC 1320 

AGGGCGCAAC GGAAACTACG AAACTGCTGA AGGTTCAGAA GAAACCAGCA GCGAAGTGAA 1380 

AGTTGCAGCA GGCTCTCAAG CTGTAAACAT TATCAACCTT CTTGGTGGAC GTGTAAACAT 1440 

CGTTGATGTT GATGCATGTA TGACTCGTCT TCGTGTAACT GTTAAAGATG CAGATAAAGT 1500 

AGGAAATGCA GAGCAATGGA AAGCAGAAGG AGCTATGGGT CTTGTCATGA AAGGACAAGG 1560 

GGTTCAAGCT ATCTACGGTC CAAAAGCTGA CATTTTGAAA TCTGATATCC AAGATATCCT 1620 

TGATTCAGGT GAAATCATTC CTGAAACTCT TCCAAGCCAA ATGACTGAAG CACAACAAAA 1680 

CACTGTTCAC TTCAAAGATC TTACTGAGGA AGTTTACTCA GTAGCAGACG GTCAAGTTGT 1740 

TGCTTTGGAA CAAGTAAAGG ATCCAGTATT TGCTCAAAAA ATGATGGGTG ATGGATTTGC 1800 

AGTAGAACCT GCAAATGGAA ACATTGTATC TCCAGTTTCA GGTACTGTGT CAAGCATCTT 1860 
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CCCAACAAAA CATGCTTTTG GTATTGTGAC GGAAGCAGGT CTTGAAGTAT TGGTTCACAT 1920 

TGGTTTGGAC ACAGTAAGTC TTGAAGGTAA ACCATTTACA GTTCATGTTG CTGAAGGACA 1980 

AAAAGTTGCA GCAGGAGATC TCCTTGTCAC AGCTGACTTG GATGCTATCC GTGCAGCAGG 2040 

ACGTGAAACT TCAACAGTAG TTGTCTTCAC AAATGGTGAT GCAATTAAAT CAGTTAAGTT 2100 

AGAAAAAACA GGTTCTCTTG CAGCTAAAAC AGCAGTTGCT AAAGTAGAAT TGTAATATAC 2160 

TTGAGGTTGG AAGCTGTATT CCAACCTCTT ATTTTGGGAG AAAAGAATGA AATTTTTAAC 2220 

ACTCAATACT CACAGTTGGA TGGAGAAAGA AGCAGAGGAA AAATTCCAGA TTTTGCTTGA 2280 

AGATATTCTT GAAAAGGACT ATGATTTGAT TTGTTTTCAA GAAATCAATC AGGAGATGAC 2340 

CTCGTCAGAG GTGGAGGTTA ATGACCTTTA TCAAGCTTTG CCAGCAGCTG AGCCTATTCA 2400 

CCAAGACCAT TATGTTAGAC TCTTGGTTGA AAAGTTGTCT GAGCAAGGGA AAAATTACTA 2460 

CTGGACCTGG GCCTATAACC ATATCGGCTA TAACCGCTAC CACGAAGGTG TGGCTATCTT 2520 

GTCTAAAACA CCTATTGAAG CCAGAGAAAT TTTGGTTTCA GATGTGGATG ATCCAACAGA 2580 

CTATCATACT CGCCGTGTTG CCCTAGCTGA AACTGTAGTC GATGGCAAGG AGCTAGCAGT 2640 

TGCCAGTGTT CATCTCTCTT GGTGGGATAA AGGTTTCCAA GAAGAATGGG CACGATTTGA 2700 

GGCTGTCTTG AAAAAATTGA ACAAGCCACT TTTACTAGCT GGAGATTTCA ACAATCCGGC 2760 

TGGACAGGAA GGTTACCAAG CTATTTTAGC TAGTCCATTA GGCTTACAAG ACGCATTTGA 2820 

AGT'l'UCTCAA GAGAAAAGTG GTAGCTATAC TGTTCCGCCT GAAATTGATG GCTGGAAAGG 2880 

GAACACTGAA CCCCTTCGAA TCGATTATGT CTTTACTACC AAAGAGTTAG CGGTGGAAAA 2940 

TTTACATGTC GTATTTGATG GTAACAAGAG TCCACAAGTG AGTGATCACT ATGGCTTGAA 3000 

TGCTATATTA AACTGGAAAT AATAACTGAA AAGAGGTTGG AACTATAAAA TTCCAGCCTT 3060 

TTCTTACTAG AGAAGCTACT GGAAATAGCC TAAATAAGTG AGACTACTGT AATGGAATAA 3120 

AATATGGTAT AATTGATAAG GTAGATAGAA TCGAGGATGT TATGTCATTT ACGAAATTTC 3180 

AATTTAAAAA CTATATTAGA GAAGCCTTGA AGGAGTTAAA ATTTACAACT CCAACAGAGG 3240 

TGCAAGACAA GTTGATTCCT ATTGTTTTGG CAGGTCGTGA CCTAGTAGGA GAATCAAAAA 3 300 

CAGGTTCAGG TAAGACTCAT ACTTTCTTGT TACCGATTTT CCAGCAATTA GATGAAGCTA 3360 

GCGATAGTGT ACAAGCAGTG ATTACTGCAC CGAGTCGTGA GTTGGCTACT CAAATTTACC 3420 

AAGTAGCGCG TCAGATTTCA GCTCACTCAG ATGTCGAAGT TCGTGTGGTT AATTATGTGG 3480 

GTGGTACGGA TAAGGCTCGC CAGATTGAGA AATTGGCAAG CAATCAGCCT CATATTGTTA 3 540 

TTGGAACACC AGGCCGTATC TACGACTTGG TTAAATCTGG TGATTTAGCT ATTCATAAAG 3600 

CCAAGACATT TGTTGTTGAT GAAGCAGATA TGACCTTGGA TATGGGATTC TTGGAAACTG 3660 
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TTGATAAGAT TGCTGGCAGT CTTCCAAAAG ACTTGCAATT CATGGTCTTC TCAGCGACTA 3720 

TCCCACAAAA ACTGCAACCA TTCTTGAAAA AATACTTATC AAATCCTGTT ATGGAGAAAA 3780 

TTAAGACCAA AACGGTTATT TCTGACACCA TTGATAATTG GTTGATTTCG ACCAAGGGAC 3840 

ATGATAAGAA TGCTCAAATT TACCAGTTGA CTCAGTTGAT GCAGCCGTAT TTGGCAATGA 3900 

TTTTTGTTAA CACTAAAACG CGTGCTGATG AATTGCATTC ATATCTGACT GCTCAAGGCT 3960 

TGAAGGTTGC AAAAATCCAT GGCGATATTG CCCCTCGTGA ACGCAAGCGA ATCATGAATC 4020 

AGGTGCAAAA TCTGGATTTT GAGTATATTG TCGCAACAGA TTTGGCAGCG CGTGGGATTG 4080 

ACATTGAAGG TGTCAGCCAT GTCATCAATG ATGCCATTCC GCAAGACTTA TCTTTTTTTG 4140 

TTCATCGTGT TGGTCGTACT GGACGAAATG GCCTACCAGG TACAGCTATT ACCCTTTATC 4200 

AGCCAAGTGA TGACTCGGAT ATCCGTGAGT TGGAGAAATT GGGAATCAAG TTTAGTCCTA 4260 

AGATGGTCAA AGACGGGGAA TTTCAAGATA CCTATGACCG TGATCGTCGT GCCAACCGTG 4320 

AGAAAAAACA AGATAAACTT GATATCGAAA TGATTGGTTT GGTTAAAAAG AAAAAGAAAA 43 80 

AAGTCAAACC GGGTTATAAG AAGAAAATTC AATGGGCGGT TGATGAAAAG CGCCGTAAAA 4440 

CCAAGCGTGC TGAAAATCGC GCTCGCGGTC GTGCAGAGCG TAAAGCTAAA CGCCAAACAT 4500 

TTTAATAGAA ATTGTTGGAG TATTGAGCTC CAACTTTTTT ATTTATGAGA ACGAACTATC 4560 

TAAACCGAAA CAfTTACATTA AAGACTGCAA .ATTGCGATTA AAT'J'.TGGTAT AATGATAAAG 4 620 

TTATATAGTC CCGATAAGAT GGTAGGTATT TATTACGAAG AGTTTTCCTA TCAGTACTTT 4680 

GTAACTCTAT AACAATATTT TTTAAGGGGG GACATTTTTA TGTCAGAGCG TAAATTATTC 4740 

ACGTCTGAAT CTGTATCTGA GGGGCATCCG GATAAGATTG CAGACCAAAT TTCAGATGCG 4800 

ATTTTGGATG CTATTTTAGC AAAGGATCCA GAGGCGCACG TTGCTGCTGA AACAGCTGTA 4860 

TATACTGGTT CTGTCCACGT TTTTGGTGAA ATTTCTACAA ATGCCTATGT GGATATTAAC 4920 

CGTGTGGTTC GTGATACCAT TGCAGAGATT GGTTATACCA ATACAGAATA TGGATTTTCT 4980 

GCTGAGACGG TGGGAGTACA CCCATCTTTG GTGGAACAAT CTCCTGACAT CGCTCAAGGT 5040 

GTTAACGAAG CCTTGGAGGT TCGTGGAAAT GCTGATCAAG ATCCACTGGA CTTGATTGGA 5100 

GCAGGTGACC AAGGGCTCAT GTTTGGATTT GCAGTAGATG AAACAGAAGA GCTTATGCCA 5160 

TTGCCAATTG CACTCAGTCA TAAATTGGTT CGTCGTCTGG CAGAACTTCG TAAGTCTGGA 5220 

GAAATTAGCT ATCTCCGTCC AGATGCAAAA TCACAAGTTA CAGTTGAGTA CGATGAAAAT 5280 

GACCGTCCGG TACGTGTAGA TACAGTCGTT ATTTCTACTC AGCATGATCC AGAGGCCACT 5340 

AATGAACAAA TCCATCAAGA TGTGATTGAC AAGGTCATCA AAGAAGTTAT TCCATCTTCT 5400 
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TATCTTGATG ATAAGACAAA ATTCTTTATC AATCCGACAG GTCGTTTTGT AATCGGTGGT 5460 

CCTCAAGGGG ACTCAGGTTT GACTGGTCGT AAGATTATTG TAGATACTTA TGGTGGCTAC 5520 

TCTCGTCATG GTGGTGGTGC CTTCTCTGGT AAAGATGCGA CTAAGGTGGA TCGTTCAGCC 5580 

TCTTATGCGG CTCGCTATAT TGCCAAGAAT ATCGTTGCAG CAGACCTTGC TAAGAAGGCA 5640 

GAAGTGCAGT TGGCCTATGC TATCGGTGTT GCGCAACCTG TTTCTGTTCG TATCGATACT 5700 

TTCGGTACAG GAACAGTAGC TGAAAGTCAA CTTGAAAAAG CGGCTCGTCA AATCTTTGAC 5760 

CTTCGCCCTG CAGGGATTAT CCAAATGCTG GACCTCAAGC GTCCAATTTA CCGTCAAACA 5820 

TCGGCTTACG GTCACATGGG ACGTACAGAT ATTGATCTTC CATGGGAACG TTTGGATAAG 5880 

GTAGATGCTT TGAAAGAAGC AGTAAAATAA GATTTTAAGA GGGGAACGTC CTCTCTTTTT 5940 

TATAGTTTTT AACTATACTG GGATACTGTT CTGAAAATCC ATTTTGCGAA AGTAGAGATT 6000 

TACATGTATA GTAGATTGAA ACTAGAATAG TACACCTCAA CTTCTAAAAC ATTGTTAGCA 6060 

ATCAATTTGA CTGTCCTGAT CGATTTCTCC TGTTCTTGTT TCATTTTACT ATATTTCTTT 6120 

AAAAATGATA AAGGTTAAGA TTTCTCCTCG TAATAGATAA TCTTGGGGAT ATTTCAATCC 6180 

AAAGTTTTAT TCGTTATCAC TTGACTATTG CAAGGTTTTC TAGAGCAACA GAGTCATGGA 6240 

ATGGACTCAT GGTTGAGATT TCTCCTTGTT GCTTGGACTT CATTCAAAAG TCTGTTACCC 6300 

AAGCCTTGTT CAAACTTCTA ATACACTAGC TGTTTCCATA GCATGACTTC TGTACTAGAC 63 60 

T-rrL-TTTTCC GAATAAATAG ATAGAACCAC AGAATCTAGT AAACCTAGAA TTAAAATTAT 6420 

GGTATAATAT TAGCAATAAA AGAAATCTGG AGGATTAGAA TCATGGTATC AACGAAAACA 6480 

CAAATTGCTG GTTTTGAGTT TGACAATTGC TTGATGAATG CAGCAGGTGT GGCTTGTATG 6540 

ACGATAGAGG AGTTAGAAGA GGTCAAAAAC TCAGCGGCAG GAACCTTTGT TACTAAGACA 6600 

GCGACCTTGG ACTTCCGTCA GGGGAATCCT GAGCCACGCT ACCAAGATGT TCCACTTGGT 6660 

TCCATCAACT CTATGGGCTT GCCAAATAAT GGCTTAGACT ATTATTTGGA TTATCTTTTA 6720 

GATTTGCAGG AAAAAGAGTC GAACCGAACT TTCTTCTTAT CTCTGGTCGG CATGTCTCCA 6780 

GAGGAAACCC ATACTATTTT GAAAAAAGTC CAAGAGAGTG ATTTTCGTGG TCTGACTGAG 6840 

CTAAATCTTT CCTGTCCAAA TGTTCCAGGT AAACCTCAGA TTGCCTATGA TTTTGAGACA 6900 

ACAGACCGGA TTTTGGCAGA AGTGTTTGCT TACTTCACCA AACCTCTTGG AATTAAATTG 6960 

CCACCTTATT TTGATATTGT TCACTTTGAC CAAGCGGCAG CTATTTTCAA CAAATATCCG 7020 

CTCAAGTTTG TCAACTGCGT TAACTCTATC GGAAACGGCC TCTATATAGA AGACGAATCT 7080 

GTCGTTATTC GGCCTAAGAA TGGTTTTGGT GGAATTGGTG GAGAATACAT CAAACCGACT 7140 

GCTTTAGCCA ATGTTCACGC CTTTTATCAA CGTTTAAATC CTCAAATCCA AATTATCGGA 7200 
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ACAGGTGGCG TTCTGACTGG TCGAGATGCC TTTGAACACA TCCTCTGTGG AGCAAGTATG 7260 

GTGCAGGTGG GAACGACCCT TCACAAAGAA GGCGTCAGTG CTTTTGACCG CATTACCAAT 7320 

GAACTGAAAG CAATCATGGT GGAAAAAGGC TACGAGAGCT TAGAAGATTT CCGTGGGAAA 7380 

TTGCGCTATA TTGACTAAAT TAAATCGAAA AATCTGAAGA AAGGAGAGAC GATGCTAGCC 7440 

ATTGAAGAAA GTCAGAAGTT GACTTTATCA AATTTACCGA GCCTGAGCCT ATTTACAGGG 7500 

ACAGATCAGG GTCAGTTTGA AGTGATGAAG AGTCAAATGT TGAAACAGAT TGGGTATGAT 7560 

TCTGCTGACC TCAACTTTGC CTACTTTGAT ATGAAAGAAG TAGTTTACAA GGATGTGGAA 7620 

CTGGAGTTGG TCAGCCTTCC TTTCTTTGCG GATGAGAAAA TCGTGATATT AGATTATTTT 7680 

ATGGATATCA CGACTGCTAA GAAACGCTTT TTGACAGATG ATGAGCTTAA GTCATTTGAG 7740 

GAATACCTTG ACAATCCTTC TCCAACAACC AAGTTGATAA TCTTTGCAGA AGGAAAGCTG 7800 

GATAGCAAAA GACGGTTAGT CAAATTACTT AAGCGTGATG CCAAGGCCTT CGATGCAGTA 7860 

GAAGTAAAAG AACAAGAATT GCGCCAGTAC TTCCAAAAGT GGAGTCAGAA ACAAGGTCTG 7920 

CAGTTTACCA ATCATTCTTT TGA/^TCTC CTCATCAAGT CGGGGTTTCA ATTTAGCGAA 7980 

ATCCAGAAAA ATCTTCTCTT TTTACAGTCC TATAAGGCGA ATTCTGTTAT TGAGGAAGAG 8040 

GATATTGTTA ACGCAATTCC CAAGACTTGC AGGACAATAT TTTTGATTTA ACTCAGTTTA 8100 

TTCTGACTAA AAAGATGGAT CAGGCGCGCG ATTTGGTGAG AGACTTGACC TTGCAAGGGG 8160 

AAGATGAAAT CAAACTGATT GCAGTCATGC TGGGACAATT TCGGACTTTT ACTCAGGTGA 8220 

AGATTTTGGC GGAGTCTGGC CAAACAGAAT CGCAGATTGC AAGTAGTTTA GGTAGTTATC 8280 

TGGGACGTAA CCCAAATCCT TATCAAATCA AGTTTGCATT AAGAGATTCG AGAGGACTTT 834 0 

CTTTGAGCTT TTTGAAGCAA GCTATTTCCT ATTTGATTGA GACAGACTAT CAGATl^AAGA 8400 

CAGGTCTTTA TGAAAAAGGT TTCCTTTTTG AAAAGGCACT CTTACAGATT GCTAGTCAGG 84 60 

TCAATTGACA TTTGTTGAAA CTACTAACCC GCGG 8494 
(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9707 base pairs 

(B) TYPE: nucleic acid 

(C) STRT^DEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 

CCGGTCAGTT CGTTCAGTAC AAGGAATCAT AATGAACGAT CAATCAGAAA AAAAGACTAG 60 
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AAAGAAGACT GTATGGATAA TCGACCAATT GGTTTTTTGG ATTCGGGTGT CGGGGGCTTG 120 

ACCGTTGTGC GCGAGCTCAT GCGCCAGCTT CCCCATGAAG AAATCGTCTA TATTGGAGAT 180 

TCGGCGCGGG CGCCCTATGG CCCCCGTCCT GCTGAGCAAA TTCGTGAATA TACTTGGCAG 240 

CTGGTCAACT TTCTCTTGAC CAAGGATGTC AAAATGATTG TCATTGCTTG TAACACTGCG 300 

ACTGCGGTCG TCTGGGAAGA AATCAAGGCT CAACTAGATA TTCCTGTCTT GGGTGTAATT 360 

TTGCCAGGAG CTTCGGCAGC CATCAAGTCC AGTCAAGGTG GGAAAATCGG AGTGATTGGA 420 

ACGCCCATGA CGGTACAATC AGACATATAC CGTCAGAAAA TCCATGATCT GGATCCCGAC 480 

TTACAGGTGG AGAGCTTGGC CTGTCCCAAG TTTGCTCCCT TGGTTGAGTC AGGTGCCCTG 540 

TCAACCAGTG TTACCAAGAA GGTGGTCTAT GAAACCCTGC GTCCCTTGGT TGGAAAGGTG 600 

GATAGCCTGA TTTTGGGCTG TACTCATTAT CCACTCCTTC GCCCTATTAT CCAAAATGTG 660 

ATGGGGCCAA AGGTTCAGCT CATCGATAGT GGGGCAGAGT GCGTACGGGA TATCTCAGTC 720 

TTACTCAATT ATTTTGAAAT CAATCGTGGT CGCGATGCTG GACCACTCCA TCACCGTTTT 780 

TACACAACAG CCAGTAGCCA AAGTTTTGCA CAAATTGGTG AAGAATGGCT GGAAAAAGAG 840 

ATTCATGTGG AGCATGTAGA ATTATGACAA ATAAAATTTA TGAATATAAG GATGACCAGG 900 

ACTGGTATGT TGGGTCTTAT AGTATTTTTG GTGGCGTTAA CAGTTTGAGC GACTATAAGA 960 

CAGATTTTCC TCTGTTTGAA TTCTCCAAAA TATTTGGAGA TGAAGAGTAT GGTTTCCCGC 1020 

TTTCAGTTAC TGTTTTACGC TATGGTTCTA TCTACCGTTT GTTCTCCTTT GTGGTAGACA 1080 

TGCTTAATCA AGAAATGGGA CGAAACTTGG AAGTTATTCA ACGTCATGGG GCCCTGCTCT 1140 

TGGTTGAAAA TGGGCAACTC TTGTATGTAG AATTGCCTAA AGAAGGGGTC AATGTTCATG 1200 

ATTTCTTTGA GACAAGCAAG GTCAGAGAAA CCTTGTTGAT TGCGACTCGT AACGAAGGTA 1260 

AAACCAAGGA ATTCCGAGCT ATCTTTGATA AGTTAGGCTA CGATGTGGAA AATCTTAATG 1320 

ACTACCCTGA CCTGCCTGAA GTAGCAGAAA CAGGTATGAC CTTTGAAGAA AATGCCCGCC 1380 

TTAAGGCAGA AACCATTTCT CAATTAACGG GCAAGATGGT TTTGGCAGAT GATTCTGG'i'C 1440 

TCAAAGTCGA TGTCCTTGGT GGCTTACCAG GCGTCTGGTC AGCTCGTTTC GCAGGTGTGG 1500 

GAGCAACTGA CCGTGAAAAT AATGCCAAAC TCTTGCACGA ATTGGCCATG GTCTTTGAAC 1560 

TCAAGGACCG CTCGGCTCAG TTCCACACAA CCCTAGTCGT AGCCAGCCCA AATAAGGAAA 1620 

GTTTAGTTGT TGAAGCAGAC TGGTCAGGTT ATATTAACTT TGAACCTAAG GGTGAAAATG 1680 

GCTTTGGCTA TGATCCCCTC TTCCTTGTAG GAGAAACAGG TGAGTCATCA GCTGAATTAA 1740 

CCCTGGAAGA AAAAAATAGT CAATCTCACC GTGCCTTAGC CGTTAAGAAA CTTTTGGAGG 1800 

TATTTCCATC ATGGCAAAGC AAACCATCAT TGTAATGAGC GATTCCCATG GCGATAGCTT 1860 
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GATTGTGGAA GAAGTCCGTG ATCGCTATGT GGGCAAAGTC GATGCTGTTT TTCATAACGG 1920 

CGATTCTGAA CTACGTCCGG ATTCTCCACT TTGGGAGGGC ATCCGCGTTG TTAAAGGGAA 1980 

CATGGACTTC TACGCCGGCT ACCCAGAACG TCTGGTGACT GAGCTTGGTT CGACCAAGAT 2040 
TATCCAAACT CATGGTCACT TGTTTGACAT CAATTTCAAC TTTCAAAAGT TGGACTACTG ' 2100- 

GGCTCAGGAG GAAGAGGCCG CTATCTGCCT CTATGGTCAC TTGCATGTGC CAAGTGCTTG 2160 

GTTGGAAGGC AAGATCCTCT TTCTAAATCC AGGTTCTATC AGTCAACCAC GAGGTACCAT 2220 

CAGAGAATGT CTCTATGCTC GTGTGGAGAT TGATGATAGT TACTTCAAAG TGGACTTTTT 2280 

GACACGAGAT CACGAGGTGT ATCCAGGTTT GTCCAAGGAG TTTAGCCGAT GATTGCCAAG 2340 

GAGTTTGAGA CTTTCTTGTT GGGGCAGGAG GAAACTTTTT TGACCCCTGC TAAAAATCTA 2400 

GCTGTGTTGA TTGATACCCA CAATGCGGAT CATGCGACCC TCTTGCTCAG TCAGATGACC 2460 

TATACCCGTG TTCCCGTTGT GACAGATGAA AAACAGTTTG TTGGGACGAT TGGACTCAGA 2520 

GATATTATGG CTTATCAGAT GGAGCATGAC TTGAGCCAAG AAATCATGGC GGATACGGAT 2580 

ATCGTTCATA TGACAAAAAC GGACGTAGCG GTTGTTTCGC CTGATTTCAC CATTACGGAG 2 640 

GTCTTGCACA AGCTAGTAGA TGAGTCCTTC TTACCGGTTG TGGATGCAGA GGGTATTTTC 2700 

CAAGGGATTA TTACGCGCAA GTCCATCCTC AAGGCCGTTA ATGCCCTCTT GCATGACTTT 2760 

AGTAAGGAAT ATGAGATTCG ATGCCAATGA GAGACAGGAT TTCAGCCTTT TTAGAGGAAA 2320 

AGCAGGGCTT GTCTGTCAAT TCCAAGCAGT CCTATAAGTA TGATTTGGAG CAATTTTTAG 2880 

ACATGGTAGG TGAGCGGATT TCTGAGACCA GTCTCAAGAT TTACCAAGCC CAGCTAGCCA 2940 

ATCTAAAAAT CAGCGCCCAG AAGCGAAAGA TTTCGGCCTG TAACCAATTT CTATACTTTC 3000 

TCTATCAAAA AGGAGAGGTG GACAGCTTTT ACCGCTTGGA ATTAGCCAAA CAAGCTGAAA 3060 

AGAAGACGGA AAAGCCAGAG ATTCTATACC TAGACTCTTT TTGGCAGGAA AGCGACCATC 3120 

CAGAGGGCCG CTTGCTAGCG CTCTTAATCC TAGAAATGGG GCTCTTGCCC AGTGAGATTT 3180 

TAGCCATCAA GGTTGCGGAC ATCAATCTGG ATTTTCAGGT GTTGCGAATC AGCAAGGCTT 3240 

CCCAACAGAG GATTGTCACC ATTCCCACGG CCTTGCTTTC AGAATTGGAA CCCTTGATGG 3300 

GGCAGACCTA TCTTTTTGAA AGAGGAGAGA AACCCTATTC TCGTCAGTGG GCCTTTCGTC 33 60 

AGTTAGAATC TTTTGTCAAG GAGAAAGGTT TTCCATCCTT ATCA6CTCAA GTCTTACGTG 3420 

AACAGTTTAT TCTAAGACAA ATAGAAAACA AGGTCGATTT GTACGAAATT GCAAAAAAAT 3480 

TAGGATTAAA AACAGTCCTG ACCTTAGAAA AATATAGATA ATGGATATTA AATTAAAAGA 3540 

TTTTGAAGGA CCCCTGGACT TGCTCTTGCA TCTGGTTTCT AAGTACCAGA TGGATATCTA 3600 
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CGATGTGCCC ATTACX3GAAG TCATCGAACA GTATCTAGCC TATGTCTCAA CCCTGCAGGC 3 660 

CATGCGTCTG GAAGTGACGG GTGAGTACAT GGTCATGGCT AGTCAGCTCA TGCTGATTAA 3720 

GAGTCGTAAA CTCCTTCCGA AGGTAGCAGA AGTGACAGAC TTGGGGGATG ACCTGGAGCA 3780 

GGACCTCCTC TCTCAAATCG AAGAATATCG CAAGTTCAAG CTCTTGGGTG AGCACTTGGA 3840 

AGCCAAGCAC CAAGAACGGG CCCAGTATTA TTCCAAAGCG CCGACAGAGT TGATTTACGA 3900 

AGATGCGGAG CTTGTGCATG ACAAGACGAC CATTGACCTC TTTTTGACTT TTTCAAATAT 3960 

CCTAGCCAAG AAAAAAGAGG AGTTTGCACA AAATCACACG ACGATCTTGC GGGATGAGTA 4020 

TAAGATTGAG GACATGATGA TTATCGTGAA AGAGTCCTTG ATTGGACGAG ATCAATTGCG 4080 

CTTGCAGGAT TTGTTCAAGG AAGCCCAGAA TGTCCAAGAG GTCATCACCC TCTTTTTGGC 414 0 

AACCCTAGAG TTAATCAAAA CCCAGGAGTT GATCCTCGTG CAAGAGGAGA GTTTTGGAGA 4200 

TATCTATCTC ATGGAAAAGA AGGAAGAAAG TCAAGTGCCT CAAAGCTAGA CTTGATAGAG 4260 

AGGAAAGATG AGTACTTTAG CAAAAATAGA AGCGCTCTTG TTTGTAGCGG GTGAAGATGG 432 0 

GATTCGGGTC CGCCAGTTAG CTGAACTCCT CTCTCTGCCA CCGACAGGCA TCCAGCAAAG 4 380 

TTTAGGAAAA TTAGCCCAGA AGTATGAAAA GGACCCAGAT TCCAGTTTGG CTTTGATTGA 4440 

GACAAGTGGT GCTTATAGAT TGGTGACCAA GCCTCAATTT GCAGAGATTT TGAAGGAATA 4 500 

CTCTAAGGCG CCTATCAACC AGAGCTTGTC TCGGGCTGCC CTTGAGACCT TGTCCATTAT 4560 

•tgcctacaaa CAGCCGATTA CGCGGATAGA AATTGATGCC ATCCGTGGAG TTAACTCGAG 4620 

TGGAGCCTTG GCAAAGTTGC AGGCTTTTGA CCTGATAAAG GAAGACGGGA AAAAGGAAGT 4680 

ATTGGGGCGC CCCAACCTCT ATGTGACTAC GGATTATTTC CTAGATTACA TGGGGATAAA 4740 

CCATTTAGAA GAATTACCAG TGATTGATGA GCTTGAGATT CAAGCCCAAG AAAGCCAATT 4800 

ATTTGGTGAA AGGATAGAAG AAGATGAGAA TCAATAAGTA TATTGCCCAC GCAGGTGTGG 4860 

CCAGTAGGAG AAAAGCAGAA GAGCTGATTA AGCAAGGCTT GGTGACGGTT AACGGCCAAG 4920 

TGGTGCGTGA ACTAGCAACC ACTATCAAGT CAGGCGACAA GGTCGAAGTT GAAGGTCAAC 4980 

CTATCTACAA CGAAGAAAAG GTCTACTATC TGCTTAACAA ACCACGCGGT GTGATTTCCA 5040 

GTGTGACAGA TGATAAGGGT CGCAAGACGG TTGTCGACCT CTTGCCCAAT GTCAAAGAGC 5100 

GTATTTACCC TGTGGGTCGT TTGGACTGGG ATACATCAGG TGTCTTGATT TTGACCAATG 5160 

ATGGGGACTT TACAGACGAG ATGATTCACC CTCGTAATGA GATTGACAAG GTTTATGTCG 5220 

CGCGTGTTAA AGGTGTGGCC AATAAGGACA ATCTCCGCCC CTTGACCCGT GGTCTTGAGA 5280 

TTGATGGTAA GAAAACCAAG CCAGCTGTTT ATGAAATTCT CAAAGTGGAC CCAGTCAAAA 5340 

ATCGCTCTGT GGTGCAGTTG ACCATCCATG AAGGGCGTAA CCATCAGGTT AAAAAGATGT 5400 
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TTGAAGCTGT TGGTCTCCAA GTAGATAAGT TGTCTCGGAC TCGTTTCGGA CACCTAGACT 5460 

TGACAGGACT CCGTCCAGGA GAATCCCGTC GTCTTAATAA AAAAGAAATC AGCCAACTAC 5520 

ACACCATGGC TGTAACTAAG AAATAATGAA ACGAATTTTA ATAGCGCCTG TGCGCTTTTA 5580 

CCAACGTTTT ATCTCACCAG TCTTTCCACC CTCTTGTCGC TTTGAGCTGA CTTGTTCCAA 5640 

CTACATGATT CAGGCTATTG AAAAACATGG GTTTAAGGGG GTATTGATGG GCTTGGCTCG 5700 

GATTTTACGT TGTCATCCCT GGTCGAAAAC AGGTAAGGAC CCCGTTCCAG ACCGCTTTTC 5760 

CCTTAAACGA AATCAAGAAG GGGAATGAGG TGGGGTAAAT AGATTTCAAA ATGATAAAAA 5820 

CGCATCCTAT CAGGTTTGAG TGAACTTGAT AGGATGCGTT TTAGAATGTC AAAATTTTAT 5880 

ACTCTTCGAA AATCTCTTCA AACCGCGTCA GCTTTCATCT GCAACCTCAA AACAGTGTTT 5940 

TGAGCAACCT GCGGCTAGTT TCCTAGTTTG CTCTTTGATT TTCATTGAGT ATTAAATTGA 6000 

GTTTGAAGTG GCTTATTTCA AAGCTTTTTG TATGTCTTCA ATCATGAGTT TTGTTGATTC 6060 

AAGTCCGCCT CCGCTTAGAT ACCAGAGGTC TGGTGTTAGT TGGATAATCT TACCATTTTT 6120 

AGCAGCAGGT GTTTCAGCGA TAAGGGCATT TTCTAGGACA CCGTCGTTGC TAGAGTTGTC 6180 

CCCACCGATG GCAAGGGTAC GGTTGATGAC AAAGAGGATG TCAGGGTTGA TTTCTTTGAC 6240 

ACTTTCAAAG CTGACTTCTT GTCCGTGGCG TGAGTCTTCA AATTTTGTAT CAGTTGGTTT 6300 

GAATTTrAAG GTTTGGTACA AGATJiGACAA ACGAGATTTG GCACCAAAGG CTGCCA'l-'l-I'T 6360 

TCCTTCATTA AGGAGGATCG CAAGGGCTTT TTTGTCAGAG CTTTCATTTT TAGTAGCGAC 6420 

TTCTTGGATG CTCTTGTCTA GCTTGGTCAA TTCTTCCTTG GCTTTCTGTG TACCAGTTTC 6480 

GCCGAAGGCA CTTGCTAAGG ATTCGATATT AGCCTTGGTA GAAGTCCAGT AGTCGTCCTT 6540 

GCTTGCTTGG AAGAGAACGG TTGGGGCGAT TTCTTTGAAT TTGTCTACGA ATTTTTGTGT 6600 

ACGTGGCGAA GCGATAATCA AATCAGGCTC AAGGGCGGCG ATAGCTTCTA AATCAGGTTC 6660 

TTTCATAGAA CCAACATTTT TGACAGTTCC CACTAGGTCT TTTAGATAAG TCGGAACAGT 6720 

TTTTGTAGGC ATTCCGACGA TATTTTTTTC AAATCCTAAA GCGCGAATAG TATCCGCAGC 6780 

GCCGAGGTCA AAGGTCACAA TCTTTTCAGG AACTTTGGAA AGTTTGACCT CGTCCAGTGA 6840 

ACTTTTAATG GTTACCTCTG TTGGAGCAGA GCTACTGGTC TCTGTCTGAC TAGTGCTTGA 6900 

GTTTGTACTA CATGCACCAA GTAGGAGCAA GAAGCTGGCC ACTAGGGCAG TGAAATAAAG 6960 

TTTAAGGGAT GTTTTCATAA TTTCTCCTTT TTAAAATGTG ATAACGATTT AGGGAGTCTC 7020 

TTAATCTTAT TGACTAAGAG ACTGAAGGTT CTCTAACTTG AGCTTTTATG TTACTAGCTA 7080 

TAGATACAGA TCTTTTTGTC ATTGATATCA GCTAGCGTGA TGGGAATCTC ATAAAGTTGA 7140 
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CTGAGCAGGT CAGCCTGCAT GATTTGATCG GTTCTTCCCT TGCTAAAGAC CTGGCCGTCC 7200 

TTGAAGGCGA CAATTTCATC TGCATACTGA CTGGCCATGT TGATATCGTG GAGGACGATG 7260 

ATAATGGTCT TGCCGAGTTC CTCCACCAGT CGTCGAAGAA TCTGCATCAT GCTGACGCTT 7320 

TGCTTGATAT CGAGATTGTT GAGTGGTTCG TCCAGCAAGA TAAAGTCCGT ATCCTGGGCC 7380 

AGTACCATAG CGATAAAGAC GCGCTGGAGT TGCCCCCCTG ACAGGCTATT GATGTAGCGG 7440 

TCTTTTAAGT TGGTCAGTTC TAAATAGTTC AGAGTTTCTC GGATTTTTTC CCAGTCTTCT 7500 

GATCTAAGTC GACCTCGGCT GTAGGGAAAA CGTCCAAAAC TGACCAGTTC TTCAACAGTC 7550 

AATTTGGCTT GGTAATTGAT TTTCTGTTTT AGGATGGTTA GTTCTTGGGC CAGTTCTTGC 7620 

GAATTCCAGC TCTCGATTTC ACGTCCTTTG ATACTGAGAA CTCCCTGATC TTTCTTGGTT 7 680 

AGCCTGCTCA TGATGGAGAG GAGAGTCGAT TTTCCAGCAC CATTTGGACC AATAAAGGCT 7740 

GTCAGTTTTT GAGGACTGAC TTCAAGCGAA ATGCCTTGCA AAATATCCTG TTTTTGAATG 7800 

GATTTGTCAA TGTTTTCCAG TTTCACTGAC GAGACCTCCT ATATAGTAAG ATAAAGAATA 78 60 

AGAAGCCACC CACACTCTCA ATGATCATAC TGATACGAAT TTCCAGTGCA AAGACTCGTT 7920 

CAATCAAGGC TTGCCCCAAG GTTAAGCTAA TAAATCCAAC CAGAATGGCC ACTATAAAGA 7980 

GTAACTTGTG CTGATAGTCT TTGACAATCA GGTAGGTGAG GTTGGCCAGT ATAAAGCCGA 8040 

AGAAGGCCAT AGGTCCTACC AAGGCAGTGG CCGTTGAGGT CAAAAGCACG ATTCCCCAGA 8100 

GGAGCTCT?T CTGTTCTxTT tCAaCATCGA GTCCCAATAT CTGAGCCGTT TCTCTTTGCA 8160 

GGTGCAAGAC ATCTAGAACG ACTGCTTTTC GAAAGAAAAA GATTGTCAAA GCGAGGATGA 8220 

TCAGAGAACC GATGGCTAGG ATGGAAGTGT TGAGATGTTG AAAGGAGGCA AAAAGACTAT 8280 

TTTGCAGTTT ATCGTATTCG TTTGGATCCA TTAGGACTTG AAGGAAGGTG CTGATATTTC 8340 

GAAAGAGACT TCTGAGCGCT AGACAGATCA GCAGGACGAA GACCAGGTCT TGCTTCATCA 8400 

GTGTCTTCAA GTAACCTTGT AAGGCGAGAA AGAAGAGGGA CTGGACAAGA AGTAAGACTA 8460 

GGAATTCTAA GATAGGGGAT TTGCCAAGTT GAAGAAACTT GCTTTCAAAA ACCAGTAGTA 8520 

GGGTTTGTAG TAGGACGTAG AAGGATTCAA TTCCCAAAAT ACTAGGCGTC AGGAAGCGAT 3580 

TTTCCGTCAG GGTTTGAAAA CTAATGGTCG AAATCCCAGT CGCGATGGCT ACCAAGAGAT 8640 

AAACGATGAT CTTTTGGGAA CGCAACTTCC AAGCAAAGGC TGACAAGTGA GTGATGGGCC 8700 

AAAAGTAGAG AAGACAAGCT CCGATGGCAA GAATAATGAG AATCCAGAAG AGCTTGGTAT 8760 

GTTTGCTTTT AGTCTGCATC TTTTCGTCCC CCTCTCCAGA GAAGTAGGAT AAAGACGAGA 8820 

CTACCGATGA TTCCTAGCAA GAGACTGACA GACAACTCAT AGGGCCTAAT CAGAACTCGG 8880 

GATAGGATAT CGCAAGCCAG AACTAGATTG GCACCAACCA GTGCGACCAT GAGTTTGGTT 8940 
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TGACTTAGAT TATCTCCATA GCGCTTGCGA ACAAGATTGG GAACGATAAC TCCGAGAAAT 9000 

GGTAGGCCAC CCACGGTAAT CATGGTGACG CTTGTCGTTA GCGCCACCAG AAAGAGGGCC 9060 

AGTTTTTCAA GTAGGGAGTA GGAAATCCCC AAACTCTCGC TGGTTTCTTT CCCTAGATTC 9120 

ATGATGGTGA AGGTTTGGGA TAATTTCCAA ACGGTTATCA GGATGATGAG GCCTAAGAAG 9180 

AGCCACTCAT ACTGATGGGT CTGAATCATG GAGAAGGAGC CCTGGGTCCA GGCAGTCATA 9240 

CTCTGAACCA GATTGAAACG ATAGGCGATA ACTTCTGTGA CTGAGCCGAT AATCCCGCTA 9300 

TAGATGATCC CAATCAGAGG CAACATCCAC CTTTCCTTTA CAGTAAAAAT GGTCATAAAG 9360 

GCTAGGAAGA AGAGGGTGAA TACGATGGAT GAAACAAAAG CGAAGAGCAT CTTGTGGGTC 9420 

AGACTAGCCG ATGGAAAGAC AAAAAGGCTC AGCACCATTC CCAGTTTGGC GGCTTCAGTC 9480 

GTTCCAACTG TACTCGGTGC AGCAAACTGA TTTTGGGTAA TAGTCTGCAT GAGAAGGCCT 9540 

GCCATACTCA TACTAGAGGC AGTCAGGAGA ATACTGATAG TTCTTGGGAG ACGGGACTCT 9600 

TGAAAGAGGA GCCAGGTCTG CTGGTCGAAA TCAAATAGCT TTCCCCATGA AAAATCACTG 9660 

GTCCCAATGC TAATAGAGAG AAAGACTAGG AGTAGAAGTA AGCCAGG 9707 
(2) INFORMATION FOR SEQ ID NO: 165: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59X0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
JDJ TOPOLOGY; linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 

CCGCAATTAT GCTTGAAAAG GAGTATACTT ATAAGTAACG CAAACGTTTG CGTCTGAAAA 60 

ATACGCAACG TTCCATTATT TTAACACACG AGGTGCTATT ATGAAAAAAC GTCAAAGTGG 120 

TGTGTTGATG CACATCTCTT CTCTTCCAGG AGCTTACGGA ATCGGATCAT TTGGTCAAAG 180 

TGCTTACGAC TTCGTTGATT TCTTGGTCCG TACAAAACAA CGTTACTGGC AAATCCTTCC 240 

ATTAGGAGCA ACTAGTTACG GGGATTCTCC TTACCAATCT TTCTCAGCCT TCGCAGGAAA 300 

CACTCATTTT ATCGATTTAG ATATCTTGGT GGAGCAAGGT TTGTTGGAAG CAAGTGACCT 3 60 
TGAAGGAGTT GACTTTGGTA GCGATGCGTC TGAAGTTGAC TATGCTAAAA TCTACTATGC • 420 

ACGTCGTCCT CTTTTAGAAA AAGCGGTGAA ACGTTTCTTT GAAGTCGGAG ATGTTAAAGA 480 

TTTTGAGAAA TTTGCTCAAG ACAACCAATC ATGGCTTGAG CTCTTTGCTG AGTATATGGC 540 

TATCAAAGAG TATTTTGACA ATCTTGCTTG GACTGAATGG CCAGATGCAG ATGCTCGTGC 600 
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TCGTAAAGCT TCAGCACTTG AAAGCTATCG TGAGCAATTG GCAGACAAGT TGGTTTACCA 660 

CCGTGTGACT CAATACTTCT TCTTCCAACA ATGGTTGAAA TTGAAAGCTT ACGCTAACGA 720 

CAACCACATC GAAATCGTTG GGGACATGCC AATCTACGTA GCGGAAGATT CAAGTGATAT 780 

GTGGGCAAAT CCACATCTCT TCAAAACAGA TGTCAATGGT AAGGCTACTT GTATCGCAGG 840 

ATGCCCACCA GATGAGTTTT CTGTAACTGG TCAGCTTTGG GGTAATCCAA TCTATGACTG 900 

GGAAGCAATG GACAAAGACG GCTACAAATG GTGGATTGAA CGCTTGCGTG AAAGCTTCAA 960 

AATCTACGAT ATCGTTCGTA TCGACCACTT CCGTGGCTTC GAATCTTACT GGGAAATCCC 1020 

TGCTGGTTCC GATACAGCAG CACCTGGTGA GTGGGTGAAA GGTCCAGGTT ACAAGCTTTT 1080 

TGCAGCCGTT AAGGAAGAAC TTGGTGAGCT AAACATCATC GCAGAAGACC TTGGCTTCAT 1140 

GACAGATGAA GTGATCGAAT TGCGTGAACG TACTGGCTTC CCAGGAATGA AGATTCTTCA 1200 

ATTTGCCTTC AACCCAGAAG ACGAAAGCAT TGATAGCCCA CACTTGGCAC CTGCTAACTC 12 60 

AGTTATGTAC ACAGGAACAC ACGATAACAA TACGGTTCTT GGTTGGTACC GTAATGAGAT 1320 

TGATGATGCG ACTCGTGAGT ACATGGCTCG TTACACGAAC CGTAAAGAAT ACGAAACAGT 1380 

GGTACACGCT ATGCTTCGTA CAGTATTTTC ATCAGTTAGC TTTATGGCAA TTGCAACTAT 1440 

GCAAGATTTA CTAGAATTGG ATGAGGCAGC TCGTATGAAC TTCCCATCTA CCCTTGGTGG 1500 

AAACTGGTCT TGGCGTATGA CTGAAGATCA ATTGACACCA GCTGTCGAGG AAGGTTTGCT 1560 

TGACTTGACA ACAATTTATC GCCGAATTAA TGAAAATTTG GTAGATTTAA ArtAAATAAGA 1£2C 

CAATAATCAG GAGACAACTA AACATGTTAT CACTACAAGA ATTTGTACAA AATCGTTACA 1680 

ATAAAACCAT TGCAGAATGT AGCAATGAAG AGCTTTACCT TGCTCTTCTT AACTACAGCA 1740 

AGCTTGCAAG CAGCCAAAAA CCAGTCAACA, CTGGTAAGAA AAAAGTTTAC TACATCTCAG 1800 

CTGAGTTCTT GATTGGTAAA CTCTTGTCAA ACAACTTGAT TAACCTTGGT CTTTACGACG I860 

ATGTTAAAAA AGAACTTGCA GCTGCAGGTA AAGACTTGAT CGAAGTTGAA GAAGTTGAAT 1920 

TGGAACCATC TCTTGGTAAT GGTGGTTTGG GACGTTTGGC TGCCTGCTTT ATCGACTCAA 1980 

TTGCTACTCT TGGTTTGAAT GGTGACGGTG TTGGTCTTAA CTACCACTTT GGTCTTTTCC 2040 

AACAAGTTCT TAAAAACAAC CAACAAGAAA CAATTCCAAA TGCATGGTTG ACAGAGCAAA 2100 

ACTGGTTGGT TCGCTCAAGC CGTAGCTACC AAGTACCATT TGCAGACTTT ACTTTGACAT 2160 

CAACTCTTTA CGATATTGAT GTTACTGGTT ATGAAACAGC GACTAAAAAC CGCTTGCGTT 2220 

TGTTTGACTT GGATTCAGTT GATTCTTCTA TTATTAAAGA TGGTATCAAC TTTGACAAGA 2280 

CAGATATCGC TCGCAACTTA ACTCTCTTCC TTTACCCAGA TGATAGTGAC CGTCAAGGTG 2340 

AATTGCTCCG TATCTTCCAA CAATACTTCA TGGTTTCAAA CGGTGCGCAA TTGATCATCG 2400 
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ACGAAGCAAT CGAAAAAGGA AGCAACTTGC ATGACCTTGC TGACTACGCA GTTGTCCAAA 2460 

TCAACGATAC TCACCCATCA ATGGTGATTC CTGAATTGAT TCGTCTTTTG ACTGCACGTG 2520 

GTATCGATCT TGACGAAGCA ATCTCAATTG TTCGTAGCAT GACTGCCTAC ACTAACCACA 2580 

CAATCCTTGC TGAAGCGCTT GAAAAATGGC CTCTTGAATT CTTGCAAGAA GTGGTTCCTC 2 640 

ACTTGGTACC AATCATCGAA GAATTGGACC GTCGTGTGAA GGCAGAGTAC AAAGATCCAG 2700 

CTGTTCAAAT CATCGATGAG AGCGGACGTG TTCACATGGC TCACATGGAT ATCCACTACG 2760 

GATACAGTGT TAACGGGGTT GCAGCACTCC ATACTGAAAT CTTGAAAAAT TCTGAGTTGA 2820 

AAGCCTTCTA CGACCTTTAC CCAGAAAAGT TCAACAACAA AACAAACGGT ATCACTTTCC 2880 

GTCGTTGGCT TATGCATGCT AACCCAAGAT TGTCTCACTA CTTGGATGAG ATTCTTGGAG 294 0 

ATGGTTGGCA CCATGAAGCA GATGAGCTTG AAAAACTTTT GTCTTATGAA GACAAAGCAG 3000 

TTGTCAAAGA AAAATTGGAA AGCATCAAGG CTCACAACAA ACGTAAATTG GCTCGTCACT 3060 

TGAAAGAACA CCAAGGTGTG GAAATCAATC CAAATTCTAT CTTTGATATC CAAATCAAAC 3120 

GTCTTCACGA GTACAAACGC CAACAAATGA ACGCTTTGTA CGTGATCCAC AAATACCTTG 3180 

ACATCAAAGC TGGTAACATC CCTGCTCGTC CAATCACAAT CTTCTTTGGT GGTAAAGCAG 3240 

CTCCAGCCTA CACAATCGCT CAAGACATTA TCCATTTAAT CCTTTGCATG TCAGAAGTTA 3300 

TTGCTAACGA TCCAGCAGTA GCTCCACACT TGCAAGTAGT TATGGTTGAA AACTACAACG 33 60 

TxACtGCALJC AAGTTTCCTT ATCCCAGCAT GTGATATCTC AGAACAAATC TCACTTGCTT 3420 

CTAAAGAAGC TTCAGGTACT GGTAACATGA AATTCATGTT GAACGGAGCT TTGACACTTG 3480 

GTACTATGGA CGGTGCTAAC GTGGAAATCG CTGAGTTGGT TGGAGAAGAA AACATCTACA 3540 

TCTTCGGTGA AGATTCAGAA ACTGTTATCG ACCTTTACGC AAAAGCAGCT TACAAATCAA 3 600 

GCGAATTCTA CGCTCGTGAA GCTATCAAAC CATTGGTTGA CTTCATCGTT AGTGATGCAG 3660 

TTCTTGCAGC TGGAAACAAA GAGCGCTTGG AACGTTTTTA CAATGAATTG ATCAACAAAG 3720 

ACTGGTTCAT GACTCTTCTT GATTTGGAAG ACTACATCAA AGTCAAAGAG CAAATGCTTG 3780 

CTGACTACGA AGACCGTGAC GCATGGTTGG ATAAAGTCAT CGTTAACATT TCTAAAGCAG 3840 

GATTCTTCTC ATCTGACCGT ACAATCGCTC AGTATAACGA AGACATCTGG CACTTGAACT 3900 

AATACTCTTC GAAAATCTCT TCAAACCACG TCAGCTTTAT CTGCAACCTC AAAGCAGTGC 3960 

TTTGAGCAAC TGCGGCTAGC TTCCTAGTTT GCTCTTTGAT TTTCATTGAG TATAAGATAC 4020 

AAATTTATAC TAATACATTT TGTAAAAAAG CGAGTTTCGA TTGAAATTCG CTTTTTTAAT 4080 

GATGTAGATT TGGGTCAATC TTGTCTAAAA ATAGGGAAAT CCTAGATACA GTGAAGGCTT 4140 
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TAAATGCTGG TTTTTACTGT CCTCAGCCTT ATATTTTTTC GTAGTTGGTT ACCTCATATC 4200 

TATTATATTC GCTTACATAA AGTATTATAA TATAATTGTA GGAAAGAAGG TGTTTTTATG 4260 

ATATACACAC TTAAATTGGT GTTGTTTATT ACCTTTCTTG TAATAAGCTT GTTACCTGAT 4320 

AAGATTTTTG GAAAAAATAA AAAAATTTGG AAAATAGTTT TTGCAATATT GACGGCAGTG 4380 

GCAGCATTGT CATTTATGTA CTAAGTTATT TTAAGAATGT AGGGAAATAA ACCCTACATT 4440 

CTTTTTAGTT TTTTCTGTTT TCTAAATTCT ATTTATCCAA GCGATTCAAC ATTTCTTGCT 4500 

TCTTCGCTTC AAGTTCTGCA CGCTTTTCTT CGATTTCGGC ATGTTTTTTC TCGAGTTCAG 4560 

AACAACTTGC ACCATTGCTA AATTCTTTTC GCCATCAGGA GATAGGGTGA GTCGACATGT 4620 

CTATTACTCA CCCAAAGCAG TCCTACAAAG CAGGAATTTT CTGTTACTTT TTTGGAAATA 4680 

GTAACGTTTA TACAGCTTTG ACACTTCGTA TCAAAGCGCC AAACACACTC CGAGGGGTTT 4740 

ACAGAAAGCA GAAAAGGAAT GATCTGGTAT AAGATCATTC CTTTTCyCTC TTTTTCTTTA 4800 

AGTAATTATA TACAATGTAC GACGAAGTCG TCATTGCAAT GCTGATCCAC CACCTAAAGG 48 60 

GAACTTTAAA CAACATTGAT AAGATAAAGA ATATAAACAA CGAAAATACG TTATACCCAA 4920 

TTAATTTTAT TGTATATCTC ATGATTAAAA GTTAATCCTT CCGTTGTTAG GAATGGCATC 4980 

ATTTTTATCC CATAATTGTG CTAAATAAGT CCCCGGTGAT AATAAATTCA TAGCGAATTC 5040 

TAAAGCAACA TCATTTACAA ACCAACTACC TAGATATCTA GAAATTGCTG AACGAATAGC 5100 

ACTTTTTGCT GCATGTTTTC CTTTTACTTT AATTAGATTT GCAAGGCCTG PAGTACTTCC 51 £C 

TCCTAATGCT AAAGCTATTG CAGTATCTAA TAGAGCACCC ATTTGATTAA CTGTAATACC 5220 

TTGCCAAACT GCTCTAAATG GAGAGTATGT AGGTGGGATT GTATAATCGC CTTGTAATTG 5280 

TCGGTTAATT ACTTCTTTGA TCCATTGTTG TGAGACGTCT GGATGAAAAG ATTGGATTTC 5340 

GTTTGCAAGT GTATTGATTT GTTCTTCTGT TAGAGAAGTG ACAGGTTGAA GTTCCATATT 5400 

TGTTTCAATT TGTGATACTT GTTCAGAAGC GTATACAGCT GAAACACTTG GAATCGCTGA 54 60 

TACAATTAAC ACAATTGACG TCAAAAAAAC CGAAATAAAT TTCATTAATT TGTTCATGAG 5520 

CTTTTCTCCT TTTTATTTGC ATCTGCTTAC ATTTTATCAT ATACTGTTAT TATAGTCAAA 55 80 

AAAATATGCT ATTATGTTAA AAAAATATTT TTCAAAATAT AAATGGACGG ATTTATTTTG 5640 

GATTTTATTT GTTATTTTGA CCTGCCTCTA TATTGGTAAC CATGATTTGT TTACTCTCAA 5700 

TCATCAAGAA TTCTCTTTTC GTGGTAGCGT TTGGGGTCTG GTACTGGCCT TATATCACTT 5760 

ACTATTCATT GATAAGTTTG TTATATCGAA TCGAAAATAA AGATTAGAGC TATGCTTGAC 5820 

TGTGTACTTT TAGGATTTAT TTTGGAGGAA GATTTTGTCT CTATTATTTA TTATTTTAAA 5880 

TTTATTTATT TTGTATAAGA TCTATTCTTT 5910 
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(2) INFORMATION FOR SEQ ID NO: 166: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5406 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) 


SEQUENCE DESCRIPTION: 




166* 








CTCATTTTTT 


CAACTGTCCA 




lagactaatt 


TAACCTCAGT 


60 


ATCCGTTACT 


TCTGGAACCT 


CTATCATAGC 


A i LA i AAA 1 \^ 


1 \J\3 1 1 Lj I t.A 


AAAGGTCTGC 


120 


TAAGGGACAA 


CCCATAGTTG 


TCAAAGTCAT 


o I L AA 1 L 1 L. i 


13 1 1 1 0(_Ct_TG 


TGTCACCGTC 


180 


AAAACGAATC 


TCATAGATCA 


AACCAAGATT 


Vj A(_ AA TA TLtj 


ATTCCCAACT 


CAGGGTCGAT 


240 




AAGGCTGTTA 


AAATCCGTGT 


III OA 1 \j 1 1 r 


TCAATTTGCT 


CTTCTGTATA 


300 


AGCCATATTT 


TCCTCACTCT 


TAGTCTTCAA 


1 AAAA i. l,At_\j 


aagcggtttg 


CTACGACTTG 


360 


GTTGGCGTAG 


TTTTCTCAAA 


GCCTTTGCTT 


r* a aT'PTT' a rr* 




CGAGTTACGT 


420 


TAAAGACTTT 


CCCCACATCT 


TCAAGTGTGC 


\3\^r\l ill 


A 1 LA 1 L 1 Ao i 


CCAAAACGTA 


480 


GACGCAGAAC 


ATTTTCTTCA 


CGGTCTGTAA 


r; a T* a T i n 

Kiivj 1 rV X L J. rvn 


V.3A 111 LA i LL 


AATTGCTCAC 


540 


GCAAGACGAT 


ACGAGTCGTA 


TAATCCACTG 


GATTTTCAAT 


cacttcatct 


TCGATAAAGT 


600 


CTCCAAGG'i'G 


GCTATCGTCC 


TCTTCACCGA 


TAGGAGTTTC 


aagagatact 


GGTTCTTGGG 


660 


CAATCTTCAA 


GATTTCACGA 


ACCTTATCAG 


GTGTCATATC 


cattcgttca 


GCAATCTGTT 


720 


CTGGTGTCGG 


ATCTTGCCCC 


AATTCTTGAA 


GGAGATTCCG 


ctgttcacga 


ACCAATTTAT 


780 


TGATAGTTTC 


AACCATGTGA 


ACTGGGATAC 


GGATGGTACG 


AGCTTGGTCC 


GCAATAGCAC 


840 


GAGTGATAGC 


CTGACGAATC 


CACCAAGTTG 


CATAAGTTGA 


AAACTTGAAC 


CCTTTAGAAT 


900 


AGTCAAACTT 


GTCAACCGCC 


TTCATCAAGC 


CCATATTTCC 


TTCTTGAATC 


AAGTCAAGGA 


960 


ACTGCATACC 


ACGACCGACA 


TAGCGTTTGG 


CAATGGAAAC 


AACCAAACGA 


AGATTGGCTT 


1020 


CCGCAAGACG 


TTGTTTGGCT 


TCGATATCAC 


CAGCTTCAAC 


AGCCAGTGCC 


AACTCTTTCT 


1080 


CCTCTTCATT 


GGTCAAGAGA 


GGAACGACCC 


CTATTTCTTT 


CAAGTACATA 


CGGACAGGGT 


1140 


CATTGACCTT 


AGCAGAAGTT 


GACCCAATCA 


AGTCCTCATC 


GCTGAGTTCT 


GGTTCTTCTT 


1200 


CATTGCTGAG 


AACACGCGCA 


CTTGGATTTC 


CTTCGTTATC 


TGTGATAGAA 


ATGCCTGCAT 


1260 


CCTGAATCCG 


TTGCAAGAGA 


TCTTCAATCC 


CATCAGCGTC 


CAAGGTAAAA 


GGAATAACCA 


1320 


GACTTGCATT 


GATTTCATCA 


TCTGTTGCTG 


TCCCTTTTTG 


CTTATGATTA 


CGGATAAATT 


1380 
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CTGCTACCTG TACGTCAAAT GTTGTTACTT CTTTTTGTTT TGTTGCCATT ATTACTCCAT 1440 

TCTTCTCTTT TGGGAAATTA AACGTTCCAA TTCTTCTAGG GCTGTATCTG TATCTCCTAC 1500 

ATGGCTAGCT TCCTGCACCT TCTTTTTGAT TCTCATATTG TCCTGATTCA AGAGAGCCTT 1560 

GTTTCGAGTC ATCTCTACTT CACTAAGTTC CTGCGGCGAT ATCTCAGCAG GCAAATCCTG 1620 

AGCTAAAACT TGGTACCAAG CTCTTTCAAC TTCCTCTGTC TGCTCTGCTA AAACTTCTGG 1680 

AGGAAGATTT CCATACTGGC CAAGCAAGTC ATATAAGACC TGAAATTCAG GTGTAGCAAA 1740 

TGCAAAGTCT TCTCGCAAAC GGTAATCGTT CAAAACAAGA GGGGATTCCA TCATCCGATA 1800 

GAGTAGATGG GCTTCTGCCC TCATAATAGC CGATAACTGC TTGGTGACAG GCATGGTGAT 1860 

TGGCGTCGGT CTGGAAATTC CTTCCATGCG ATTCTGCCTT TGCACCTGAC GACTCTCATT 192 0 

AACAATCTGC TCAATCTGGG TATAATCAAA GGACGCCAGA CTGTCAGCTA AAATATGAAT 1980 

ATAGCTGTTT TGAGCAGCGA TGGACTTTTC TTGAACAATC AAGGGAGCTA TTTTTTCAAG 204 0 

AAACTCAATC TGAGCCTGCA GATTTTCACT GTTTTCAGGT TTGTACTGAT GAATGTAGAA 2100 

CTCAATCGGA CTAATACGAG TTTTCGTTAA TAGATAGGCC AAGTCTTCTG GACCATTTTT 2160 

TTGTAGATAC TCATCAGGAT CCAAGTTATC AGGCATGCTG ACGATTTGCA CAGGCATATC 2220 

ACCAATTTCA TCCAATGCTT TCAATGTCGC GGCTTGCCCA GCCTTATCTC CATCGTAAAC 2280 

AAGAACCAAT TTCTTGGTTA ACCTTTTCAG ATGCTCAACA TGCTCTCGAC TCAAGGCTGT 2340 

TCCCATCGAC GCCACAGCAT TTTCGATTCC AGCCCGATAG GCTGCAATAA. CATCCATC^J^. 24GG 

TCCTTCCATC AGGTAAATCT CACTAGCTTT TCCAGAAGAT CTTTTTGCCC TATCCATATG 2460 

ATATAATTCG TAACTTTTGT TAAAAATTGC AGTCGATCGG CTGTTTTTAT ACTTAGAAGT 2520 

TTGTGAATCC GTTTTTTGCC AGATACGACC TGAGAAGGCA ATGACCTTTC CTTGGTCATT 2580 

TGTCAGGGGA AACATAATGC GATTGTGAAA GGTGTCTACA AATTGATTGG CATCCGAGAG 2640 

ATAAAACAGG CCTGAATCCA GTAAATCCTC TTCACGATAC TGATCAGACA AACGTTGATA 2700 

GAGATAGTTT CGTTCTGGAG GTGCTAAACC AATCCAAAAA TGTTTAAGCA CTTCATCTGT 2760 

CAACCCCCGC TGATAAAGGT AATTTCTGGC CTCTTCGCCC ATAGTCGTTG TCATGAGAAT 2820 

AGCATGGTAA AATTTGGCTG CATCTTCGTG CATATCATAA AGAGCTTGGT GAGGTGAGGC 2880 

TGACTTCTGC TCACTATAAA GCGGTTTTTC AACCTCAATT CCAACACGCT GACCTAAGAT 2940 

TTGGACTGCT TCTATAAAGG GAACCCCTTG GTACTCCTCG ATGAACTTAA AGACATCACC 3000 

TGAGCGACCA CAACCGAAAC AGTGATAAAA CTGCTTGTCC TCTACAACAT TGAAAGATGG 3060 

TGTTTTTTCA CCATGAAAAG GACAGAGCCC TAGATAGTTC CGTCCTGCCT TTTGTAAAGA 3120 

AATCACATCT CCTATGACTT CCACAATGTT GGCATTGTTT TTGATTTCTT CAATGACTTG 3180 
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TTTGTCAACC ATACACAATA CCTCCATGTT ATCATAGTTT ACTTTATATA GTATACTTTA 3240 

TTTCAGAAAA AAAGTAAACC ATTTCACTCA TTTTCCCTAC TTTATTCAAA GAGTTGATAA 3300 

TAATCAGAGA TTTTCATTTT TGCTTTTTCT TCTTGGTTTA AATCTTGGAT 7U\TTCGTCCT 3360 

TCTTTCATGA CAATCAAGCG ATTGCCGTAT TTGAGAGCAT CTTCCATATG ATGAGTAATC 3420 

ATAAGGGCTG TTAGCTGATC TTTCTTAACA AATTCATCTG TCAATTCCAT CAAAGCAACA 3480 

CTAGTCTTTG GATCCAGGGC AGCAGTATGC TCATCTAACA GGAGTAATTC AGGTCGCTTC 3540 

AAGGTTGCCA TCAAGAGACT CAAAGCCTGT CTTTGTCCAC CTGATAAGAA CTCAATCGGT 3600 

GTATTCAAGT GTTTCTCAAG ACCATTTCCT ACTTTTTCAA TGGTTGCCTG AAATTCATCC 3660 

TTATAGCTAG TCAAGCGTCG TGGTAACAAT CCACGCTTTT CACCACGAAA CTTGGCGATT 3720 

AAAAGATTTT CAGCGACCGT CATACGGGGA GCTGTCCCCA TCTTTGGATC TTGGAAGACA 3780 

CGAGACAGGT ACTTGGCACG CTTCTCGGGT GAAAACTTAG TGAGATCTTC ACCTAAAATA 3840 

CGGATAGTTC CACTAGTTAG TGATAAGGTC CCTGCTATAG TGTTAAAGAG AGTTGATTTT 3900 

CCAGCACCAT TTCCGCCCAA AATCGTGATA AAGTCCCGTT CAAAAATTTC TAAGGAAACA 3960 

TCATTTAAAA TAATCTTTTC TTCATCAAAG CCATTTTTAA CGATTTTGGT TGCATTTTTT 4020 

AATTCTACAA TTGCTGTCAT TTGCTTAACT TGGCTCCTTT CAAGATTGTT TGCTTAAATG 4 080 

TTGGAATCAT GAGGCAGACT GCTAAAATCA AGGCACTGTA TAAACGAAGG TAACTTGTAT 4140 

TAAAGCCAAG TGCGATAACT GCCCACACTA AAAATTGATA AGCGATAGAA CCTACAACGA 4 200 

TAGTAACCAA ACGCTCTGCC AAGCTCAAAC TCTTGAAAAT AACTTCTCCA ATAATCAAAC 42 60 

TTGCAAGCCC CACAACGATA ACCCCGATCC CTCGAGACAC ATCGGCATAA CCTTCTTGCT 4320 

GAGCAATGAG GGCACCTGCA AGGGCAATCA CACCATTTGA TAAGACCAAG CCCATGAGCT 4 380 

CCATGCGTCC AGTATGAATC CCGAAACTTC TAGCCATATC AGGATTATCC CCTGTAGCAA 4440 

TATAGGCTTG TCCGAGTTTA GTGTCCAAGA AAAAGAGCAT GAGAGCAATA ACAATACTCA 4500 

CAAAGATGAG ACCTGTCAAG AGTTGATTCA AATCCGAATC AAAAGGCAAA ACATCCTGAA 4 560 

TTTGCTTGGT TCCAAGCAGG CCTAAATTCG CACGTCCCAT AATCAAGAGC ATGATTGAGT 4 620 

GACAAGAAGT CATCACCAAA ATCCCTGAGA GCAAGGTTGG GATCTTCCCT TTTGTATAAA 4 680 

GAAGGCCTGC TGCCATTCCA GCCAAACAAC CTGCTCCTAC AGCAACAAGT GTCGCTAAAA 4740 

ATGGGTTCAC GCCTTTGGTT ATCAAAGTGA CAGCAACAGC TCCCCCAAGA GGGAAGGAAC 4800 

CTTCTGTCGT CATATCTGGA AAGTTTAAAA TCCTAAATGT CATAAAGATT CCCAGACCTA 4860 

GAATAGCCCA GACAAATCCT TGAGAAATAA TGGAAACAAT CATATTTTAT TTAATCCTTT 4920 
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CTATATTCAT CTTTTTAAAA AATGGGAAGA GTCTCCTCCT CCCTACCTTA TTTATTCGAT 4980 

GACTTGTCCT GCTTCTTTGA GAACAGACTC AGGAATAGTA ATACCTAGTT CTTGTGCTAT 5040 

TTTTTTATTG ATGACTGACT TACCAGTTGA AAAGACATTG ACTGGGGTAT CGGCTGGTTT 5100 

TGCACCTTTC AAGACTTGCA CAATCATTTT ACCTGTTGCC ACACCAAGGT CATGTTGGTC 5160 

AATTACAACT GATGCCAAAC CACCTACTTC TACCATAGCT GTCGCACTGG GATAAATTGG 5220 

TTTCTTAGAA CTTTGATTGC TAGAGACAAC CGTTGGAAAT CCTGATGCAA TGGTGTTATC 5280 

AATTGGAACC CAAATAGCAT CTACCTTGCT AGTCATAACA GTGACAGTTG AGGCAATTTC 5340 

ATTTGTTGAA GGAACTGCAA ATGTTTCCAC TGTCAGACCT GCCTTTTCAG CATAAGCCTT 5400 

AAATTC 54 Og 

(2) INFORMATION FOR SEQ ID NO: 167: 

(i) SEQUENCE CHARACTERISTICS: 

fA) LENGTH: 9711 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 167: 
CAGCTTGCTC TTACTATTAT AGCAGATGTT ATAGCTGGAA TTATCTTGTA TTTCGTCTGC 60 

AAATGGCTAG ATGGTAAGAA GTAGACCGAA TCACTAGCCT ATAAACACCC Gl-rAAATCGC 120 

TAAGATACGT CAAAAAAGCC CTTAACTATG GCACTAGTTA GGGGCTTTGG TGTTCTAATG 180 

AACCTTATAC ACTAACTACA TTCTAGCATA TAAGCCCAGA TATTTCAAGA GTTTTATTTA 24 0 

TTGTTTAAAG TTCTGAAAGG TCTATAATGA AGTTAGCCAT CTAGTATCAA AAAACCGACT 300 

AGCTCTTATG AACTAGTCGA TTTCTCATCA ATGCGCCAAC ATTTCTTGGG CG ATTTCTTG 360 

GCCAGATAGG TTATCTGGGT AGTAGGTTGG CCAGTTGTCC ATTTCTTCAA AGAGGGCTTC 420 

TTGGCTTGTG CCTCCAAAGA AGATATGGAA ATGTTCTGCC TTAACTGGGG CAACATTG'iG 480 

GTCACTAAAC TGAACATACT TGAATTGTCC AGCGTCAGCA TCTGTGGCTT CAAAGAGGAA 540 

ACGCACGCCA CGATTGCCTT TCTTGTAAGT CAAAATTTTC TTACCGACAT ACTTGTAAGT 600 

GT ATTTCTTG CTTTGTCCAC CTTGAACAAA TTCCATAGTA TTATCAGTAA TGTTAATCTT 660 

AGTCACATCT GTATGATAGC CTTTTGTATA GTAAGCCTTG TACTCAGCCT GGGTCATCTT 720 

ACCAGTCAAC TTAGCCTTGT AGTCAAAGAC TTGGTCAAAC GTGCCGTCTT CAAGGAAAGG 780 

ATAAACTGAT TGCCAGTTAC CTGCATAGTC ACTCAAGGTG CGGTCCTTGA CAGCTGCATC 840 

CTCGAAGTAA CCATTTTGGA CTGTCTTGGT ATCCTCTGCC TTTTCAGGTT CAATTGCTGG 900 
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GCCTTCTTGG TCTGTTGTTT GTTTCAAAGC CTTGAGGTTT TTCTCCATCA CGGAAATGTA 960 

GTTTTCTCCA GCCTTGGTGT CCTCTTCTGT CAGACTTTCT AAAGGATTGA GGACATCAGT X020 

TTTGACACCT GCTTCTTTTG AAAGTGTGTT AGCAAGGGCT TGTGAGGCAT TTCTTCAAAA 1080 

TAGATATAGG CGATTTTATT TTTCTTGACA TACTCTGTCA ATTCTGCCAA GCGAGCAGCT 1140 

GATGGCTCTG CATCTGGAGA AAGTCCTGAG ATTGCGACTT GTTTGAGTCC ATAGTCCAAG 1200 

GCAAGATAGT TAAAGGCTGC GTGTTGAGTC ACAAAGCTCT TTTGTTTTGC TTGAGACAAA 1260 

CCTTCTGCGT AAGCCTTATC CAAGGCTTGC AATTTTTCGA TATAGGCAGC TGCATTCTTC 1320 

TCAAAGGTCT CTTTTTTATC AGGATAATCT GCTGACAAGC TGTCGCGGAT GTGCTCTACT 1380 

AGTTTAATGG CACGAACTGG TGATAACCAA ACATGGGGGT CAAACTCATG GTGATGACCT 1440 

TCTTCTCCAT GGTCATGGTC TCCCTCTTCT TCCTCGCCAC CTGGCAAGAG CAACATATCG 1500 

CCTGTCGCCT TGATGGTTTT CACTTTTTTC TTATCCAAGG TATCTAGCAA TTTAGGTACC 1560 

CATGTTTCCA TGTTTTCATT TTCATAAACG AAGGTATCTG CATCTTGGAT TTTGGCAACT 1620 

GCCTTGGCAG ATGGTTCGTA TTCATGAGGT TCTGTCCCAG CACCGATTAG GAGTTCTACA 1680 

TTAGCCGTAT CTCCTGCGAC TTGCTTGGTA AATTCATAGA CAGGGTAAAA GGTTGTCACG 1740 

ATATTGAGTT TACCATCTGC CTGTTTTTGA TTGGAACAAG CCACTAAAAA CAAGGCACAT 1800 

AGACTGGCTA GTAATAAGCT AATTTTTTTC ACGTTCGTCT CCTATTTGAT AAAACGTCTT 1860 

ACTAAACTGA TTAGTATAAA GACAGTTACA AAAATAATGG TAATACTTGC ACTTGCAGGT 1920 

GTTTCTGCAT AGTAGGAAAT GTAAAGTCCT GCTACCATTC CCAAAAAGCC AATCGCACTG 1980 

GCAAGCAGCA TAACCGATTT AAAGTTTTTC CCCAGACGCA GGGCAATACT AGCTGGCAAG 2040 

ACCATAATGG TCGATACCAG AAGAGCTCCT GCTGCAGGAA TCATAAGGGC AATAGCCACC 2100 

CCTGTCACCA TGTTAAAAAG AATGGACATG GTACGAACTG GCAAGCCATC CACAAAGGCC 2160 

GTATCTTCGT CAAAAGTTAA GATATACATA GGACGAAGAA AGAGAAAGGT CAAAATCAAA. 222 0 

ACAACCGCCG CAATGACAAA GAGGGAAATG ACCTGTTCTT CACTGATAGT CACGATCGAA 2280 

CCAAAGAGAT ATTGGTCCAA ACTCATTGAA CTCGAGCTTT TACCCTTGCT CATGACAATC 2340 

AGAGAAACAG CCAGACCTGT TGACATGAGG ATAGCTGTCC CGATTTCCAT AAAGCTCTTG 2400 

TAAACCGTAC GGAGATACTC CAGAAAGACC GCCGCAATCA AGACAATGGC AATAGTAGAA 24 60 

ACAGTTGGAG AAATCCCCAA AACCAGACCA AAGGCTACAC CTGAAAGTGA GACGTGGCTA 2520 

AGGGTATCAC TCATCAAACT CTGACGACGC AAGATGAGGA AGGTTCCCAA TACCGGTGAG 2 580 

AAAAGACTCA TAGCAATAAC CGCCAAAAAG GCGCGTTGTA TAAAGTCGTA AGATAATAAA 2 640 
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CTAAGCATGG CCCACCTCCT GGCCATTCTC ATGAACATTG AAACAACGCC ATGGCGAGTC 2700 

TTGGTTACGG ACTAGATGAA TATTGCGATC CGCATAATCC TTAACTTCTT CAGGGTCATG 2760 

GGTAATCATC AAAACAGCCT TGCCATGATG ATGGGCGCTG TGGTGCATGA GTTCGTAAAA 2820 

TTCATTTTTA CTTCCTGCAT CCATCCCCGT TGTCGGCTCG TCTAGGATAA ACACATCAGG 2880 

GTCAGAAGCA AACATACGCG CAATTACCGC TCGCTGCTTT TGTCCCCCAG ATAGAGACCC 2 940 

CAAGCGTTTG TCTCGATGTT CCCACATGCC AACTGAGTCC AGACTAGCCT TGATATGCTC 3000 

CTCATCATGA GCATTCAAAC GACGGAACCA GCCTTTTCTC GGATAGCGAC CCGACTTGAC 3060 

AAATTCATAG ACCGTACTTG GAAAACCAGC ATTAAAACTG GCAATTTGTT GAGGAAGATA 3120 

GGCTATTCTC AATTTCTTAC CTTGCGTATT TGTCTTTGAA ATAGCCACCT TTCCAATGCG 3180 

TGGTTGCAGA ATTCCAAGAC TAGCCTTGAT GAGCGTCGTC TTAGCCGCTC CATTTTCCCC 3240 

AGTCAAGGTA ACAAATTCCC CACTATCAAC ACAATAATTG ATATGTTCAA GAACAGGCTC 3300 

CTTATCATAA TAGAAGGACA AATCCTCTAC CGTAATATAT CTCATTATTT GATTTCTCCT 33 60 

ACTAAAGCAG TCAAAAACCG CTGAATCACT TTTTGTTCAT TTGGAGTAAA CTGAGTCGCC 3420 

ACTTGTTCAT AGGTTAAAAG TGTATGCTCA TGGTGATGGT GGTGCTCCTC AGCGATTGGA 3480 

CGAGCCAAGT CAGTCAACTG ATAAAAAATC ACACGCGCAT CTTTAGAATC TTTAGATGTT 3540 

TCCAACATCC CTTCCTTGAC CAAAGACTTA ATGGCCTTGG TAACTGCCGC CTGACTGACA 3600 

TTGAGACGAC GGGCCAATTC TGAATTTGTT AAAGATTCCT CTGAC^iAGAG CATAAGGATA 3&60 

TGCTCCTGAG TATTGGTCAG GGCCACCTCG CTAGTGCAAT GACCTATTAG GATTTCATGC 3720 

TGATTTTCCG CCTGCAAAAT CACCTCATTC AAAAAAGCAT TGATATCCTT TGCTAGCTGT 3780 

CTCATATCTG ACTCCTTTCC TTTTAGACTT CTCTTTTTTA AGAGAAAAAT ACTATTCTTT 3840 

GACATTTTGT TTACCAGTTA ATTATATCAC AAGCAAAAAA AGAGTCAAGA AAAAACGTGA 3900 

AAACTAGTTT CATTCTTGAA CTCTTCTATA TTATATTATC TATTGAAATT CTTTGACATC 3960 

TCCATCATAA GTCGCCCAAT CTTTGCTGAA AAAGCGCTCA TTCAGATGGT AAGTCGGAGC 4020 

TGGTGTGGGA TTGGATAGGA AAGGATCAAC TGCCTTGTCA AAAGCCAACC AACCCAACCA 4 080 

ACCAAGGTGA ATGGTGTCCT TCATAAAGAA AGGCTCCCCG CCGTCCTTAG AAAAATCTGC 4140 

TATATTGGTA AAACCTTGAC TTTCTAACTG GTAGCGAATC TTCTGCACCG TTTGTTGGTA 42 00 

CATATCCTCT CGTAGACCAG CATAGTTCAT CCATTTTTTA TTAACAGGTG GAATGATAAA 42 60 

AATCGGGTTT ACCTTAGATT TAGAAAACTG TGTTAAAACC AACTGCAAGT CATTATACTC 432 0 

TGGCGACTTG AGATAGGTAA AGCTTTTCTG AGAATCCTTT AATTTCTTCA AATCCTTCTT 4380 

QATCTGCTCA TTATAGAAAT AATTTTCCAT TCCCATCTCA TTATTGGAAG TATTTTTTTC 4440 
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AGCATCTGCT TTGACAACAT CTTCTATTGC CTGATAAGAA AACTGGTCTG GCAAGATTTT 4500 
TAAATACTTA GCTACATGCT TATCGTAGTT AACATAGCCT CTAACCGAAA ACTGACCAAA 4560 
AAAGGAAGCT TGGCGTTCAT TAAAACGAGC CAATAATTCA ATCATTTCAT TGTCTGCTGT 4620 
CGACAATTCT TCTTTACTTG CCAACTTCTG AACCAGGTCC TTCATAGCTA CGTTTGGGAA 4680 
CTGTTGCAGT AAGCGAGTCG CTGCATATTG ACTAGCCTGA TCCCCAGATT GATGTTTCAG 4740 
AAAACTAGTC AACTGGTCTC CATTAAAATA CTGCTGGAAG GCTGCTGGAT CATAGCCATT 4800 
TTTACTGAAC CACTGAGGTG AGATAACATA CACAACTTGT TTATTCTCCA GCTGTGGTAA 4860 

CATCTGTTGC ATTCCAAAAT ATTGGTTAAG CGATGCAGCT CCCCCCTGTC CTAAAAGATA 4 920 

AGGACGGTAG GAACGATTGT ATTTCTCAGC TAATACCGCA GGATGAGCAC CGTCAAAACG 4980 

AAGCCATTCA CTAGAGCCAA AGAAGGGAAC AAAACGCACA TTTGGATCAG ATAGTGCTCT 5040 

GACTTTTTGA CTTCGCTCCT TAAAACTATC GATAGTAGTA GCCACTGCTG AACGCTTTTC 5100 

AGCTCCTAGA TTATGATGCA TCTCAGTAGG ATAAAAGAAA ATGAGCAGAA AAACCAACAA 5160 

ACCAGCGATC AAGACCGGTC CGAAGATCAT CCATAAGCGT TTAAGCATTT TGTAGCTCCA 5220 

CAATACCAGC TATGATTTTA TTAGCTGTAT TCCAGTCGTC ACGACCAAAC TCTGTTACAG 5280 

GGACACGAAT GTCAAAACGG TTCTCAATCT CCACAATCAA CTCAACCGTT CCCATACTAT 5340 

CCAAGACACC TGCATCAAAA AGATCTTCAT CCATCATGTC AGAAACATCT TCCATAAACA 5400 

ACTCATCAAT AATTTCAATA ACTTCTGATT TGATATCCAT ATTTTATTTC CTTTTATTTT 5460 

TTAAACCATA GATTATTCAA GAATCCAGAA AAGATTAAGA ATGACAACAT GACAACATGG 5520 

AAAGTGACAA CCATGCCAAG CAACTGAATC CAGCGATTCT CAGGTAGGGC AGCCTTCCCT 5580 

GCTTTTTTCC GTTCCTTATT GAGCGTTTTT TTCTTGCGAA CCCAGGCATC ATTGATGACC 5640 

AAGCCTAGTC CATGAAAGAG TCCATAGGCG ATATAGTACC AGGTCACACC ATGCCAAAAT 5700 

CCCATAATCA GCATATTTAC AATGTAGGCC ATGCTTGAGG TTACATTACG ATTTTTAAAG 5760 

ACTTTCTTTC TGGTTAACAC CATCACCATT CGCATAAAGA CAAAGTCACG GAACCAGAAG 5820 

GACAGACTCA TATGCCAGCG ATTCCAAAAC TCCTTTAAAT CCCTTGATAA AAAGGGCTTG 5880 

TTAAAGTTGA TAGGGCTACG GATTCCCATC AAGTTTGAGA TGGCCAAAGC AAACATAGAA 5940 

TAACCTGCAA AGTCAAAGAA GAGTTCCAGA CCAAAAGTAT ACATAACTGC CAAGGCATAG 6000 

AGATTAAAGA AGCCACCTGA CTGCAAGGCT AAATTCTTCA GAGGAGGTAG TAAGGTCTCT 6060 

CCTAAAACAT GAGCTAGGAT AAACTTATAC AAAAAGCCCC ACATGATATA GCGGACAGAT 6120 

TCATCCAGCA TATCCATCAA CTCATCTCGC TCAGGAATAG CCTGATAATT TTCATTAAAT 6180 
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CGCTTAAAGC 


' GATCGATTGG 


i ACCACTCGAG 


i AAAGTTGGCA 


, TGAAGAGAAG 


: GAAACGGAGG 


6240 


AATTCCCAGA 


, GGGTAAAATC 


CTTAATCACT 


' CCATCTCTCA 


GCTCGATGAC 


AATTCCAACC 


6300 


GAACGAAAGG 


TCAGGTAAGA 


AATTCCCAAG 


AACCCAAGCA 


AAGACTGCGT 


TCCATTGATA 


6360 


GCTGGTTGCA 


CCTTGACAAA 


•GATAATCGGA 


AGTAGGGACA 


GAAAACTAAC 


TAAGTAGAAG 


6420 


ACCCACTTGC 


CATCCTTGCT 


TTTTCGATAA 


TGCTTGTAGA 


AAAGCAGGAG 


CAATATTTCC 


6480 


CAGCAAAGGT 


AAATACCCAA 


GGCAGCTAGT 


TGATTGGTCT 


TTCCACCCAC 


CAACATGGTG 


6540 


ACAATAAAGA 


AGAGACTTAC 


CAACACTTCA 


TACCAGGCAA 


AGCGTTTCTT 


GAAAAAGAGA 


6600 


CCTATAAAGA 


TGGGCAAGGT 


TGCAGCAATC 


ACATAAACAA 


AATACTGAGG 


ATTGCCATAT 


6660 


GGCTCTAAAT 


GAGGAAGCTG 


TTGAAAAAAC 


TCCATCATCT 


CTTATTCACC 


TCGTTAATCA 


6720 


ATCCTTTGAT 


GTCAATCTTT 


CCATTTGGAG 


TTAGTGGCAA 


ACTGTCTCGG 


TAAAGGAATT 


6780 


TAGATGGCAT 


CATATAGGAC 


ATCATGATGT 


CTGTCAGGTC 


TTCCTTGATG 


GCCTTGGTAA 


6840 


TATCGATATC 


TCGCTCAAAC 


TGCTCACGAA 


CACCGTCTTT 


TAAGATGACA 


TAAGCCAATA 


6900 


GATTTTGTAC 


CTTGTGGTCC 


TTGTTATAGC 


GCGGTACTGC 


GACAGCAGAT 


TCGATAAAGC 


6960 


GAGACTTGTT 


GAGGTTTTGA 


GAGACATCTT 


CTAACTCAAT 


GCGGTAACCG 


TTAAACTTAA 


7020 


TCTGGAAGTC 


CATGCGTCCG 


CCGTAGAGAA 


GCAAGCCCTC 


ATCTGTCATG 


GTTCCCACAT 


7080 


CGCCTGTGTG 


ATAGGCTGGC 


AGATCTTCAA 


ACTCAAAGAA 


GGCTTCTGCT 


GTTTTTTCAG 


7140 


GATTGTTCAT 


ATAACCTTTT 


GAAACAGCTG 


GCC.C.AQ AAAC 




CCCTGCTCAC 


7200 


CATTTGGCAG 


TTTATTTCCT 


TCCTCGTCAA 


TGATAAAGGT 


TGGAGAATCA 


GCCTTGGTAT 


7260 


AGCCGATTGG 


TAGGCGTTTG 


AGAGTCGCTA 


ACATCTCGTC 


TGTCACGGCA 


ACTGCTGACA 


7320 


GAGCTACTGT 


CGCTTCTGTT 


GGGCCGTAAG 


CATTGATGAT 


ACGGGCATTT 


GGGAAAGGCT 


7380 


CGCGCAGTTT 


TTGAGCTGTT 


TTGACCGTCA 


ATTCTTCACC 


ATCAAAGTAG 


AAATGCGTGA 


7440 


TTCCAGGCAT 


TTTCTCACTG 


TTGAAGTATT . 


CAGACAACAT 


GGCCATATCT 


GCAAAGGATG 


7500 


GTGTTGATGT 


CCAGATAGCG 


ATTGGCAATG 


AAAAGATAGC 


CGCAAAGAGT 


TGCTTAAACT 


7560 


CCTGAGTGAT 


GACTGAAGGA 


AGAGTGAAAA 


GCGTACCACC 


AAGTGCCAAG 


GTCGGTGCCC 


7620 


AATACATGAC 


AGACAAGTCA 


AAAGAATAAG 


GTGGCTGTGC 


CAGCATTTGC 


GGACGACTCG 


7680 


GTGTCGCAAA 


TTCCTTATCC 


GTAATCATCC 


AGTTTGTAAA 


GCTGAGGAGA 


TTATCATGTG 


7740 


AAATCTGCAC 


TCCCTTAGGC 


TTACCAGTCG 


TACCAGAAGT 


AAAGATAATG 


TAGTAATTAT 


7800 


CATCTCCCTT 


GACTGGATGC 


GTGATTTCAT 


AGTTATTCCC 


TTGGGCAAAG 


GCTTCTTGAA 


7860 


CCTGAGCTAG 


ATTTATCATT 


GGTGTAGAAA 


CCTGCTCCAA 


GGGAAAGGCT 


GAAATGGCAA 


7920 


TAATCAAGCT 


TGGCTCTGCT 


ACTTCTAAAA 


TAGCTGAAAC 


TCGCTCCAAG 


GCCGAATGGC 


7980 
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TATCAATTGG AATGTAGGCA TGACCTGACT TAGTCAGCGC TACAAAGGTT GCCAACATTT 8040 

CATATTCTTG GCCACCAAAA ACAACCACAG GAGACTTCTC AGGCAAGCCT AGTTGGTCAA 8100 

TGACTGCAGC CAAACTATCC GAATCAGCCT TTAAATCGCC ATAAGTGTGT TCCTGCCCCA 8160 

AAACATTATA GACAGGATAG CTAGGCTGTG TCTGAGCAAA ATGCTCAATG GTTTCAATCA 8220 

TATCTGCTAT TGGTTTATTT GACACAATAG GGATTCTCCT TCAAGTTAAA ATTCATTATA 8280 

GATAAAGCTT CCTTGACCCT GACCAAGATA GCTAAAGAAG TAAAGCAGCC CTAGAAAGAT 8340 

AAGAAAATAC AAGGCTGTCC GACCAAGAAA GAGGTACAAT TCTTTTCTCT GTTTCATCAA 8400 

GAAAAACCAT TCATTTCTGT AATTTTTCGC TAAAATAAGA GTGATTCTTA CTAGCTTATT 84 60 

TTTCTACCAT TGTACCACTT TATATAGTAT CTTTTCAATT GTTTACCGTA TGTTTCCAAT 8520 

AGATTTCAGC TTATTTTAAG GATTATACAG TTTTTCTATG TATATTTTCA AATAGAGTGA 8580 

TCCTGCTTCA AAACTCCATT TCAGGAGACA ATGAAGTAAA TCTTCCCATA ATAAAACACA 8640 

CAATATCAAG TTTTTTCAAC ACCTGATACT ATGCGCTTTT CTGATTTTTA AAGACTTTTT 8700 

AACCACTCTC TCATTTAAAA TAATCTCGTC TGATATAAAT TAAAATAGCT TCTATCATCA 8760 

GACAAATGGC TGATAGCCAA AAACTGATGC TAATACCAAA ACTCTCAGTA ATATAGCTCA 8820 

TTAGCAAAAC AAATACTGAA AATGCTAATG TAGAAATCAC TTCAAGAACG GAATAGACAT 8880 

TAACTAAATG ATTTTCCTCT ACTGTTTCCT GAAGAAATAC ACTTTCAGGA ACTTCTTTTA 8940 

CTTGCGATAA CATACCAACT AAAGCTGAAA ATAATAAAAA CATCTGTGCG TTTGGAAAAT 9000 

ATAGAATAGT CAGTGTCACT ATTTCCATAG CTACAAGAGG AAAAAGAATA CTTTCCCCCC 9060 

AAATCATTCA TACCTCTCTC AACTAGATGT AACTTACAAA ACCCCTGACC TCATGAGCCA 9120 

CTTTCTTCCT CCTCATGAGG TCAGTTTTAC TTTCTGCTGT TCCAGTATCG TTTTTCCTCG 9180 

CTAGATTTCC TCAAAAGGGC AGACTCCTCC CTTGGTGCGT CACACGATTT TTTCATCTCG 9240 

ACTGTTCTTT AATGCATCAT TAACGACGCT TTTCTTCTAG GTGGTTCATA AGGAACAGGA 9300 

AGATTCAGGT TGACTTTTCT AATCCTAGAA TAAAGTGCTG AAAACAATTC GGAATAGGCA 9360 

TAGAGACTAG ACAATTTGAG GAGCTGCTTG CGTCCTGTTC GAACACATTT TCCCACCACG 9420 

TGAAGAAAAA GATGGCGGAA GCGTTTGATT GTTAAAGTTT GGAAGTCACC TCCAGCTAGA 94 80 

TGTTTGAGAA AAAGATAGAG ATTGTAGGCG ATACAGCTCA TCATCATACG AACTTCGTTT 9540 

TTGATTAAGG TTGAACTATC CGTTTTATCG CCAAAAAATC CCTCCTTCAT CTCCTTGATG 9600 

AAATTCTCGG CTTGACCACG TCCACGATAA AGCTGAAACT GGTCTTGGcT gTTCCACTCG 9660 

TCATATTTGT AACGAGAGAA ATAACATCGT AGAACAAGTA TCCTTCTTTT C 9711 
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(2) INFORMATION FOR SEQ ID NO: 168: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 3025 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



ixi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


168: 






CCCCTTTGTC 


AAAACTGTAA 


AATTAACGAC 


TCAACAATTC 


ATCTTTACAC 


CAATCTCAAT 


60 


GGAAAACAAA 


AACAAATTGA 


CCTCTGTCAA 


AACTGCTATA 


AGATTATCAA 


AACAGATCCT 


120 


AACAATAGCC 


TCTTCAAAGG 


TATGACGGAT 


CTGAACAATC 


GTGACTTCGA 


TCCCTTTGGT 


180 


GATTTCTTCA 


ATGATCTAAA 


CAATTTCAGA 


CCTTCTAGCA 


ATACTCCTCC 


TATTCCCCCA 


240 


ACCCAATCAG 


GTGGAGGTTA 


CGGTGGAAAC 


GGCGGTTATG 


GTTCCCAAAA 


TCGTGGATCT 


300 


GCTCAAACTC 


CGCCACCTAG 


CCAAGAAAAA 


GGCCTGCTGG 


AAGAATTTGG 


TATTAATGTA 


360 


ACTGAAATTG 


CCCGTCGTGG 


AGACATTGAC 


CCCGTTATTG 


GGCGCGACGA 


TGAGATTATC 


420 


CGTGTCATCG 


AGATTCTCAA 


TCGTAGAACC 


AAGAATAATC 


CTGTCCTTAT 


CGGTGAACCT 


480 


GGTGTCGGAA 


AAACGGCCGT 


TGTCGAAGGT 


CTAGCTCAGA 


AAATTGTCGA 


TGGCGATGTG 


540 


CCACATAAAC 


TCCAAGGTAA 


ACAAGTCATC 


CGTCTGGATG 


TGGTTAGCTT 


AGTTCAAGGA 


600 


ACGGGGATTC 


GAGGACAATT 


TGAAGAACGC 




T CAT G G AAG A 


aat-tcgcaaa 


660 


CGTGAAGACA 


TCATCCTCTT 


TATCGATGAA 


ATCCATGAAA 


TTGTTGGTGC 


TGGTTCTGCG 


720 


AGTGATGGTA 


ATATGGACGC 


AGGAAATATC 


CTCAAGCCAG 


CCCTTGCTCG 


TGGAGAACTG 


780 


CAACTAGTCG 


GTGCTACTAC 


CCTCAATGAA 


TACCGTATCA 


TTGAAAAGGA 


TGCTGCCCTC 


840 


GAGCGTCGTA 


TGCAGCCTGT 


TAAAGTCGAT 


GAACCAACGG 


TGGACGAAAC 


AATCACTATT 


900 


CTCAAAGGGA 


TTCAAAAGAA 


ATACGAAGAT 


TACCACCACG 


TTCAATATAC 


AGATGCTGCG 


960 


ATTGAAGCAG 


CTGCAACTCT 


TTCCAATCGC 


TACATCCAAG 


ATCGCTTCTT 


GCCTGACAAG 


1020 


GCCATTGACC 


TCCTAGATGA 


AGCTGGTTCT 


AAGATGAACT 


TGACCTTGAA 


TTTTGTGGAT 


1080 


CCTAAAGTAA 


TTGATCAGCG 


CTTGATTGAG 


GCTGAAAATC 


TCAAGTCTCA 


AGCTACACGA 


1140 


GAAGAAGATT 


TTGAGAAGGC 


GGCCTACTTC 


CGCGACCAGA 


TTGCCAAGTA 


TAAGGAAATG 


1200 


CAAAAGAAAA 


AGATCACAGA 


CCAGGATACT 


CCTAGCATCA 


GCGAGAAAAC 


TATTGAGCAC 


1260 


ATTATCGAGC 


AGAAAACCAA 


TATCCCTGTT 


GGTGATTTGA 


AAGAGAAAGA 


ACAATCTCAA 


1320 


CTCATCCATC 


TAGCCGAAGA 


TCTCAAGTCT 


CATGTTATTG 


GTCAAGATGA 


TGCAGTCGAT 


1380 


AAGATTGCCA 


AGGCTATTCG 


CCGTAATCGT 


GTCGGACTTG 


GTACCCCTAA 


CCGCCCAATC 


1440 
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GGAAGCTTCC TCTTCGTTGG 


; GCCAACTGGT 


' GTCGGT7VAGA 


CAGAACTTTC 


CAAACAACTG 


1500 


GCTATCGAAC TTTTTGGTTC 


TGCTGATAGT 


' ATGATTCGCT 


TTGATATGAG 


TGAATAPATG 


1560 


GAAAAACATA GTGTGGCTAA 


GTTGGTCGGC 


GCTCCTCCAG 


GTTATGTTGG 


CTATGATGAG 


1620 


GCTGGTCAAT TAACTGAAAA 


AGTTCGCCAC 


AATCCATATT 


CTCTCATCCT 


TPTPGATPAA 


16B0 


GTGGAAAAAG CTCACCCAGA 


TGTTATGCAC 


ATGTTTCTTC 


AAGTCTTGGA 


pp aTPPTPr'T 


1740 


TTGACAGACG GGCAAGGACG 


CACCGTTAGC 


TTCAAGGATG 


CPATPATTAT 


l-A 1 VsAUV_ i t,A 


1800 


AATGCAGGTA CAGGAAAGAC 


CGAAGCTAGC 


GTTGGATTTG 




AL»AALiL>At,L» 1 


1860 


ACCAATTCTG TCCTCGGTGA 


ACTCGGTT^C 


TTCTTTAGCC 


r*AfIAC;TTTAT 


(^AAV-L-VjI 111 


1920 


GATGGCATTA TCGAATTTAA 


GGCTCTCAGC 


AAGGATAACC 




TGTCGAGCTC 


1980 


ATGCTAGCAG ATGTTAACT^ 


GCGCCTCTCT 


AGCAAfAAfA 


i i ill vjuA 


Hj 1 AACTGAT 


2040 


AAGGTCAAGG AAAAGTTGGT 


TGACCTAGGT 




A A A Trr: r" a ^ r* 


ACGCCCAcTT 


2100 


CGTCGGACTA TTCAAGACTA 




Ar* A ATr" APTr* 




TGAAAATCCA 


2160 


AGCGAAAAAG ATCTCAAAGC 


A\Ji X X f\ A \3rW^ X 


AnpA Af^rir' A A 


ML. A I 1 1, ALiA 1 


TAAATCTGCC 


2220 


AAAAAAGCTG AAGTTAAAAG 


TTCTGAAAAA 


^AAAAATAA A 


I ^v- i A i AAAA 


AAGGAGTAGA 


2280 


AAATGAAATT TTTCTGCTTC 


X X X X X I X rVt^ X 


AA AATAAPTr; 


TAATTT C TTG 


AC AGCTTGC C 


2340 


CTTTGTCCAT TATGATATAT 


AGTAGACTGA 


ATCTGAAATA 


riTAPr: A A m A 


A 1 TGCTAAAA 


2400 


CATTTATAGA AATTAATTTT 


ACTTTCCCAA 


TCGATTTGTT 


P'PP A T P TT^ a T 
V.-1L.A1L.1 lAi 


i. J. V-AAi L ILiL 


24 60 


TATAGTCAAT TGAAACAAGA 


ACAAGACAAA 


AGAGCCTCAT 


AAA APHTaTT* 


OL. AAl_ i i L»(j 1 


2520 


AATACCTTTT TGAGGTGCTT 


TTTGATATGA 


GCCCATGTTT 


TPTPAATar^r* 


Al HjiALTLA 


2 580 


GGTGAGTAGG GAGGAAGAGG 


TAAAAGTTTA 


TACCCAAACT 


CTTCACACAA 


GAGTTCTAAC 


2640 


TTACCCATTC TATGGAATCT 


TGCATTATCC 


ATAATAATAA 


CCGATGGTGT 


GGTTAATGTT 


2700 


GGTAAGAGAA ACTTCTGAAA 


CCAAGCTTCA 


AAAAAGTCGC 


TCGTCATCGT 


CTCTTCGTAA 


2760 


GTCATTGGAG CGATTAACTC 


ACCATTCATT 


TGTTAGACCT 


GCAACCAAAG 


AAATTCTCTG 


2820 


ATATCTTCTT CCAGATACTT 


TGCCTCTTCT 


TAACTGACCT 


TTTAATGAGC 


GACCATATTC 


2880 


TCGATAAAAA TAAGTATCGA 


ATCCTGTTTC 


GTCAATCTAA 


ACAGGTGCTA 


GGTGCTTTAA 


2940 


ACTATTAAAA TTCTTAAGAA 


ATAAGGCTAC 


TTTTTCTGGG 


TCTTGTTCAT 


AGTAGGTGTA 


3000 


GTTCTTTTTT TTTTCGAGTG 


TAGCC 








3025 


(2) INFORMATION FOR SEQ ID NO: 169: 









(i) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 4104 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


169: 






TTTAAGGTTT 


TAAAAAAAGT 


TTTCGAAAGG 


TTTCTTCTTT 


ATTTTTTAAG 


GGAGAGATAA 


60 


CGTTGATATC 


TAAATCGTGG 


TCAAAGCCGG 


CAATTTTTCC 


TTTAGATGTG 


TATTGGTGAA 


120 


TATCATAATC 


TAAATCAGTT 


TTAGGACTGC 


TCTCCAAAAA 


TCCTGAGTCT 


GAGCCGTAGA 


180 


CGGAATCCAA 


ACAGAGGTAA 


ACTTGCCTGT 


ATCAATACTG 


TGTTCTTCCA 


TGAAGTAGAC 


240 


ACCAACGTAG 


ATGCCGATGT 


TTTTAGCACC 


CAGTGATGCT 


AGTTTTGCTC 


GAAAGTTTTC 


3 00 


GACACCTTCG 


TTCATATTAG 


ACATGGTTTT 


GTCTTCCACG 


TCAAGCCAAT 


AGTAACTAGG 


360 


GCTGTAAGGA 


GAGGCAGCAT 


TGTAGAAAAC 


TTCGGCAGCC 


TTTTCCATTT 


CTTGGACACT 


420 


TTTTCCAGCT 


ACATAAGCGT 


AGACAGCAAC 


TGGGACATTC 


CGCTTTTGAA 


GTTCAGTGAT 


480 


ATGACTCTTA 


TAGGCCTTGT 


CTATTCCATT 


GATAAATGAA 


GCATCATTTT 


CTTTTGTCGT 


540 


TTGAGCACCA 


CTGTGAACAC 


GAACAATAGC 


ACCTGAAATA 


TTTTGTGAGA 


GGGCATCGTA 


600 


GTTGATTTCC 


TCAGGACGCT 


GCCAGCCAGA 


GAGGTCAATA 


ATCGGTTTGT 


CTAAGTGTTT 


660 


CAAAGCCTGT 


GCTTCAATCT 


GTGCTATATT 


GGATTTTGTT 


TTAAACGATT 


GGCTGTCATT 


720 


AAGTGGGCGA 


TTGATGATTA 


AAATGAACAT 


CATAATCCCA 


AAAAAACTAA 


ATAAAATAAG 


1 a A 


TGGATGAATT 


TGTTTTCTCA 


TATCTTATAA 


TTCTACCCTA 


AAAATCAAAA 


AAAATCAAAA 


840 


AAATGGGTTA 


AGGAAGAGAC 


TTTAGAGCAT 


TTTTTCATTC 


AAGAGTGCGG 


AATGATTTGA 


900 


AATATGGTAT 


AATAAAAGGG 


AATTTCTACA 


GAAAAGAGAA 


GATTATGTCA 


AATTTTGCCA 


960 


TTATTTTAGC 


AGCGGGTAAA 


GGGACTCGCA 


TGAAATCTGA 


TTTGCCAAAA 


GTTTTGCACA 


1020 


AGGTTGCGGG 


TATTTCTATG 


TTGGAACATG 


TTTTCCGTAG 


TGTGGGAGCT 


ATCCAACCTG 


1080 


AAAAGACAGT 


AACAGTTGTA 


GGACACAAGG 


CAGAATTGGT 


TGAGGAGGTC 


TTGGCTGGAC 


1140 


AGACAGAATT 


TGTGACTCAA 


TCTGAACAGT 


TGGGAACTGG 


TCATGCAGTT 


ATGATGACAG 


1200 


AGCCTATCTT 


AGAAGGTTTG 


TCAGGACACA 


CCTTGGTCAT 


TGCAGGAGAT 


ACTCCTTTAA 


1260 


TCACTGGTGA 


AAGCTTGAAA 


AACTTGATTG 


ATTTCCATAT 


CAATCATAAA 


AATGTGGCCA 


1320 


CTATCTTGAC 


TGCTGAAACG 


GATAATCCTT 


TTGGTTATGG 


ACGAATTGTT 


CGTAATGACA 


1380 


ATGCTGAGGT 


TCTTCGTATT 


GTTGAGCAGA 


AGGATGCTAC 


AGATTTTGAA 


AAGCAAATCA 


1440 


AGGAAATCAA 


CACTGGAACA 


TACGTCTTTG 


ACAACGAGCG 


TTTGTTTGAG 


GCTTTGAAAA 


1500 


ATATCAATAC 


CAATAACGCT 


CAAGGCGAAT 


ACTATATTAC 


AGACGTCATT 


GGTATTTTCC 


1560 
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GTGAAACTGG TGAAAAAGTT GGCGCTTATA CTTTGAAAGA TTTTGATGAA AGTCTTGGGG 1620 
TAAATGACCG TGTGGCGCTT GCGACAGCTG AGTCAGTTAT GCGTCGTCGC ATCAATCATA 1680 
AACACATGGT CAACGGTGTT AGCTTTGTCA ATCCAGAAGC AACTTATATC GATATTGATG 1740 
TTGAGATTGC TTCGGAAGTT CAAATCGAAG CCAATGTTAC CTTGAAAGGG CAAACGAAAA 1800 
TTGGTGCTGA GACTGTTTTG ACAAACGGTA CTTATGTAGT GGACAGCACT ATCGGAGCAG 1860 
GAGCGGTCAT TACCAATTCT ATGATTGAGG AAAGTAGTGT TGCAGACGGT GTGATAGTCG 1920 
GTCCTTATGC TCACATTCGT CCAAATTCAA GTCTGGGTGC CCAAGTTCAT ATTGGTAACT 1980 

TTGTTGAGGT GAAAGGATCT TCAATCGGTG AGAATACCAA GGCTGGTCAT TTGACTTATA 2040 

TCGGAAACTG TGAAGTGGGA AGCAACGTTA ATTTCGGTGC TGGAACTATT ACAGTCAACT 2100 

ATGACGGCAA AAACAAATAC AAGACAGTCA TTGGAAACAA TGTCTTTGTT GGTTCAAATT 2160 

CAACCATTAT TGCACCAGTA GAACTTGGTG ACAATTCCCT CGTTGGTGCT GGTTCAACTA 2220 

TTACTAAAGA CGTGCCAGCA GATGCTATTG CTATTGGTCG CGGTCGTCAG ATCAATAAAG 2280 

ACGAATATGC AACACGTCTT CCTCATCATC CTAAGAACCA GTAGGAGCCT ATCATGGAGT 2340 

TTGAAGAAAA AACGCTTAGC CGAAAAGAAA TCTATCAAGG ACCAATATTT AAACTGGTCC 2400 

AAGATCAGGT TGAATTACCA GAAGGCAAGG GAACTGCCCA ACGGGATTTG ATTTTCCACA 2460 

ATGGGGCTGT CTGTGTTTTA GCAGTAACGG ATGAACAAAA ACTTATCTTG GTCAAGCAGT 2520 

ACCGCaAaGC TATCGAGGCT GTCTCTTACG AAATTCCAGC CGGAAAATTG GAAGTAGGAG 2580 

AAAACACAGC CCCTGTGGCA GCTGCCCTTC GTGAATTAGA GGAAGAAACA GCCTATACAG 2640 

GGAAATTAGA ACTCTTGTAC GATTTTTATT CAGCTATTGG CTTTTGTAAT GAGAAGTTAA 2700 

AACTATATTT AGCAAGCGAT TTGACAAAAG TGGAAAATCC GCGTCCGCAG GATGAGGATG 2760 

AAACCTTGGA AGTCCTTGAA GTGAGCTTAG AAGAAGCGAA AGAATTAATC CAATCAGGTC 2 820 

ATATCTGTGA TGCCAAGACA ATTATGGCTG TTCAGTATTG GGAGTTGCAG AAAAAATAGA 2880 

GGAGGTCAGT ATGGGTAAAT CTTTATTAAC GGATGAAATG ATTGAAAGAG CTAATAGAGG 294 0 

CGAAAAAATT TCAGGTCCTC CTTTGCTAGA TGATAATGAG GAAACTAAGA TTTTACCAAC 3000 

CTCTTCTTCC CGTTTTGGTT ATGCCAATCC TAAGGATCAT GGTTTTAGCC AGGAAACCTT 3060 

GAAGATTCAG GTCGAACCAT CTATTCATAA AAGCCGTCGT ATTGAAAATA CCAAGAGAAA 3120 

TGTCTTCAAT TCTAAGTTGA ATAAAATCTT ATTTGCGGTC ATCTTTCTCT TGATTTTGCT 3180 

TGTTTTAGCA ATGAAACTTT TGTAATAGAA AAGGAATTGA AATGAAAATA GGAATTATTG 3240 

CTGCTATGCC AGAAGAACTG GCTTATCTGG TCCAGCATTT AGATAATGCC CAGGAGCAAG 3300 
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TTGTTTTTGG GAATACCTAT CATACAGGAA CCATTGCTTC TCATGAAGTC GTTCTTGTAG 3360 

AAAGTGGAAT TGGTAAGGTC ATGTCTGCTA TGAGTGTGGC GATTTTGGCT GATCATTTCC 3 420 

AGGTGGATGC CCTTATTAAT ACGGGTTCAG CTGGGGCAGT AGCAGAAGGT ATCGCTGTTG 3480 

GGGATGTCGT GATTGCTGAC AAATTAGCCT ATCATGACGT GGATGTCACA GCTTTTGGCT 3540 

ATGCTTATGG ACAAATGGCG CAACAACCGC TTTATTTCGA ATCAGACAAA ACCTTTGTTG 3600 

CTCAAATCCA AAAGAGTTTA TCTCAATTGG ACCAAAACTG GCATCTTGGT TTGATTGCTA 3 660 

CAGGAGATAG TTTTGTTGCA GGAAATGACA AGATAGAAGC GATTAAGTCC CATTTCCCAG 3720 

AAGTTTTAGC CGTGGAGATG GAGGGGGCAG CTATTGCTCA AGCAGCGCAT GCCCTCAATC 3780 

TCCCAGTCTT AGTCATCCGA GCTATGAGTG ACAATGCCAA CCATGAAGCA AACATCTTTT 3 840 

TTGATGAGTT TATTATCGAA GCTGGACGTC GCTCTGCCCA AGTCTTGTTG ACCTTTTTGA 3900 

AGGCTTTAGA TTAAGCGGAA ATTTGACAGT TTTTCTAGCT TATGATAAGA TTTAAGTAAA 3 9 60 

GAAAAGCTAG AAAACGTTTC AGAGGATATT ATGAGTATTG AAATGACCGT CAGTGAGATT 4020 

GCAGAGGTCT TAGGATTATC TCGCCAAGCA ATCAATAACC GTGTCAAAGA ATTACCAGAA 4080 

GAAGACACAG ATAAAAATGA CAAG 4204 
<2) INFORMATION FOR SEQ ID NO : 170: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8876 base pairs 

(B) TYPE: nuclei o acld 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOz 170: 

CACGGATAGG CTCGGCTTTC ATCAGTCCTC AGGCTGATTT ACTAATAGCA ACTTTCCTCG 60 

ACAAAGTCCA CAGCGATACG TnTGGGTATC AATCCTACGC TTACGCTGAT ACCTTTGCTG 120 

GCAGGATTGG CAACGATAGA GCTTGATTGG CTTGGAGTTA CTATTGGGCA AGGATGGTAC ISO 

AAACCGTAAT CCATCCACTG CTTTCAACAG TTCCTTAAAA TCCCGATCCT TGTGTTGATA 240 

GCCTTTCCCT TGAAAATAGA GGTGATAATG ACAGAGTTCA TGTCGGACAA TTTTCCTAAA 300 

AACGTCCAAC CCCAGTTCCT GATAAACCTT GGGATTAAAA TCCAAATGCC CATCTTTGGG 360 

GAAAAATCGC CCACCTGTCG AACGTAGACG CCTATTCCAC TGGACATGAT GGATAAAAGG 420 

TCTGCCGAAG TCTTCTAGTG AAACCTGCTT GACGTAATCA GTCAGTTTCA TTTGGAGCTA 480 

GGAGAGACAG ATTAACTTTT TCACGTTCAG TATCAATTTT CTTAACCCAA ACGCTCACCA 540 

AATCTCCAAC TGCCACCACT TGACTAGGGT GTTTGATAAA CTTGCGACTC ATATGGGAAA 600 
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TATGGATGAG ACCGTCCTCA TGAATTCCGA TATCAACAAA AGCACCGAAA TCAACAACGT 660 

TACGCACCAC TCCTTCTAGC TTTTGTCCAA CCACTAAGTC CTTGATATCT AGGACATCTT 720 

GGCGAaCACA GGTGCGTCAA AGGAATCACG GAAATCTCGA CCTGGTTTGA GAAGATCTGC 780 

AATGATATCT TTAAGAGTTT CTGGACCAAG GTCTAACTCT TGCGCCATTT CCTTGACTGA 840 

AAGCGACTTG AGTTTGCTTT GGGCTTCTTC GTTTAGGTCT TTAATATCTA AACGTTTGAA 900 

GAGTTCCTTA ACTGCAGTGT AATTCTCTGG GTGAACTCCT GTATTATCAA GGATATTGCT 960 

ACTTTCAGGG ATACGAAGGA AACCAGCAGC CTGCTCAAAG GCCTTGGCTC CCAGACGAGG 1020 

AACTTTCTTG ATTTGGGCGC GTGAAGTGAT TTTTCCTTCT TCCTCGCGGT ATTTGACAAT 1080 

ATTTTCAGAG ATAGTTTTGT TGAGTCCAGC TACGTGTGAA AGAAGAGCTG GGCTAGCTGT 1140 

ATTGACATTG ACACCAACTT GGTTAACCAC TGTATCGACA ACAAAGTCCA GACTCTCAGA 1200 

TAGTTTCTTC TGACTGACAT CGTGTTGGTA TTGACCGACA CCAATTGACT TAGGATCGAT 1260 

TTTGACCAAT TCCGCAAGAG GATCTTGCAA ACGACGGGCG ATAGAAATGG CAGAGCGTTT 1320 

TTCAACGGTC AAGTCTGGAA ACTCCTGACG AGCAAGTTCG CTGGCAGAAT AGACAGAAGC 13 80 

ACCACTTTCA TTAACGATAA CATAGCTGAC TTCAGGGAAA TCTTTCAGAA CTTCCGCTAC 14 40 

AAAAGCTTCA CTTTCACGAC TGGCCGTTCC ATTTCCAATG GCAATAATCT CTACACCGTA 1500 

TTGACCAATT AAATCTGCTA AATCTTTCTT GGCTTCTTCG ATTTGACGAG CTGATGCTGG 15 60 

TTTAACAGGA TAAATAACCt' GAGTTGTCAG CATTTTTCCT GTTGCATCCA CGACAGCTAG 1620 

CTTGGCACCT GTACGAAAGG CTGGGTCAAA TCCAAGAACC ACGCGCCCTT TCAGTGGAGC 1680 

AACCAAGAGG AGATTGCGCA GATTGTCAGA AAAAAGTTGG ATAGCTCCTT CTTCAGCTTT 174 0 

CTCAGTTAAT TCTGTCCGAA TACGACGCTC GATAGCAGGC AAGACCTTTT TCTTAACGGA 1800 

TTGCTGAACA ACTTCATCAA TATAAGCATT TTTCACCTTG AAACGAGTAG CAAAGAAGGC 1860 

AAGAATACGG TCCGTCGCAT GTTCAAAACC GATCTTCAAG ACACCAAGTT TCTCCCCACG 1920 

ATTGAGAGCC AAGGTACGAT AGCCTTGCAT AGTTCCAACT GTCTCTGAAA AATCATAATA 1980 

AATCTGAAAA ACCTGCTTTT CATCAAGACT TTCATCCTTG GCTTGAGAAG TAAGTTTAGA 2040 

GTGTCTCAGC ACTTCCTGAT AAGTCATAGA ACGCAAGGTC ACATCTTCCG ATAAGGCTTC 2100 

GACCAAAATA TCAACTGCAC CGGTCAAGGC TTCCTTGCCA GTCGCAAATC CTTCACAGAC 2160 

GAACTTTTCA GCTTCTTTCT CTAAGTCAAC TATATTCTGC AAAATCAAGC GAGCAAGAGG 2220 

AAAGAGTCCA GCTTCACGGG CAATGGTTGC CTTGGTACGA CGCTTTTCCT TATAAGGAAG 2280 

ATAGAGTTCT TCAACGTCTG CTAATTTTTC GGCAACTAAG ATAGCTTCTT CCAATTCCTT 2340 
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GGTCAACTTA 


. CCTTGTTCTT 


' GAATCTTAGC 


TAAGACAGCT 


' TCCTTACGGT 


' CATTGAGATT 


2400 


TGTCAGACTT 


TTATCCAAAT 


CAATAATAGC 


CTTAATCGCC 


ACCTCATCCA 


. GACTACCAGT 


2460 


CATGTCCTTG 


CGATAACGCG 


CGATAAAGGG 


AATAGTCGCC 


CCTTCAGCTG 


TCAAACTTAG 


2520 


AACGGTATCA 


ATTTGCTTTA 


ACGTCACTCC 


CAAATCCTGA 


GAGATTTTTT 


CATATTTTTT 


2580 


ATCCATAAAT 


CTATTATACC 


ACAAGCTAAA 


CGTTTCAAAT 


TAACTCGTAG 


AACATTTAAA 


2640 


AAATATGTAG 


GAAATAGATT 


TATATGCTAC 


AGCGCAATAA 


CTTGCACTTA 


AAGAGCATTG 


2700 


CCACCTTTTT 


TTAACCAAGC 


CATGATATCA 


AAAGTATTTA 


ATGGATCAGA 


CATAATAGCC 


2760 


AGTTCTGGAA 


GATGTTCCTG 


ACCTGGAATA 


ACACATTGAC 


TTTTCAAATT 


TTTATATGGA 


2820 


CGATTGACTA 


AAATTAATTT 


ATTAGAATAA 


GGAAGATTAT 


CCATCTTATT 


TAAAATTTCT 


2880 


TCACTAGCTG 


AATCTTTATT 


ATCAAATTTA 


AAATAAAGAT 


TATTCCAATT 


TATGCGTTTT 


2940 


TTTCTTTTTT 


CCCACTTAGT 


TCGTGCTTCT 


TCAATACTAG 


AATAATGTAG 


AAAATGAATA 


3000 


TCTATATCTC 


CTAAGTGCCC 


CAAAGGATAA 


ACTTCATGAG 


TCCAGCTCGG 


TGAAATAAGT 


3060 


TCCTCTTCGA 


AAACAAGTTC 


TTGTTCCATA 


TAATAACGAA 


AATGCTTTGT 


AAGTTTATAA 


3120 


TAATCATCAG 


GAAGAATAAA 


TAAACCAACA 


AAAGGTGTTC 


TATATTGAAA 


ACCAAGCTGT 


3180 


TTATAAATTA 


ATCCTCCAAC 


ACAATTATTA 


CTTATAATCG 


TAAAATCTAA 


TCTATCAAGC 


3240 


TCAAGAAAAG 


GGAAAATTCC 


TTTCTCTGCA 


GCTATl'AACT 


TATGATAAAC 


AATATCAGAA 


3300 


TCTAAATATT 


CACCGTCATT 


TTTTAACCAA 


GCACTAAAAT 


TTCCCAATTC 


TTGAATATAT 


3360 


TGTTTTTTCG 


CTCTTTCTAT 


ATCATAGTTT 


TCTAAGACGG 


CGCAATCTTT 


GATTCTATTT 


3420 


TCATAATTTT 


CTAATATGAT 


TTTGTAGGAG 


TCTTTTAGAG 


GTTTAGCATC 


TATAACAGGT 


3480 


TTATAGATAT 


ATGTCGGGAA 


ATTAATATAG 


GTTGCAGTTT 


TAGAGTGAAT 


ATAAAGTCTC 


3540 


CAAATAAGGT 


TGTTTATATC 


AAATTGATTT 


ATTTTTCGTA 


AAAGCTTACT 


ATTGAATAAT 


3600 


TTTCCAAATA 


ATGAGCGATA 


TTGTTTTCTA 


ATTCGATGAT 


CTGTATCATC 


CATCTTTTGT 


3660 


AAAACTTGAA 


CATTCGTTAA 


ATTTTCTGTC 


AACCAATTAT 


CCCCCCAAAA 


AGGATAAAjvG 


3720 


TAAAATACTC 


CATCAACCAA 


ATCAGCAAAA 


TGACCAAGAA 


CAACATCAGA 


ATCGGATAAT 


3780 


TTTATCGCAT 


GATACATCTT 


TTCAAATGTC 


CAATCAAATA 


ATGAATCATT 


TGAAGATAGA 


3840 


AACGTAATAT 


AATCTCCTGT 


AATCATATCA 


GACAACTCAG 


CAAAAGAATT 


CTCATCTATA 


3900 


ATCTTAATAT 


TAAATGATAG 


ATTCATCTGT 


TGGCTAATGG 


AAGCTATCTC 


CTCTGTAGAT 


3960 


TGATTTACAA 


TAATAACTTC 


TATATCTTTT 


AATGTTTGTC 


TCTCCACTAT 


TGACAAAGAC 


4020 


TCTAATAAAC 


TATTTTTATC 


TCCTTGATGT 


AACAAAACAA 


CACTAATTGA 


GTAAGTCAGT 


4080 


TTGACTACCT 


CCCATAATTT 


TCTGATAATG 


ATTTTCTTTT 


TATTTAATTA 


TAGCACAATT 


4140 
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ATGATATATA 


, TCAGGTAATA 


, TCAAGCTATA 


, TTATCTCTTA 


GCTACTCAAT 


TTGAAATTTT 


4200 


AACTTTTCCC 


TTTTCCGCAA 


AATAATAGTA 


TAATAGAGGT 


AGAATCTAGA 


ATCGAGGTAC 


4260 


ACCTATGGCT 


GTCAAATTTA 


CAAAACGAGA 


CGACTTGGAC 


AAGATGTTTG 


AAGAGTTTGC 


4320 


TAAACTCCCT 


GATTTGAAAC 


AAGTTACTTT 


CCCTGATGAC 


AAAGAGAAAA 


AAGTCAAAGC 


4380 


AGAAAAGAAA 


AACTAGATGA 


CTGCTTTTCA 


ACAACTCCCA 


TCTAGTGTAC 


TTCAAACTGG 


4440 


AGCCATTTTT 


CTCTCCATTA 


TCATTGAAGC 


CCTTCCCTTC 


GTTCTGATAG 


GAAGCATTRT 


4500 


CTCAGGGCTG 


ATTGAAGTTT 


ATATCACACC 


TGACAAGGTT 


TATCATTTTC 


TCCCTCRAAA 


4560 


TCGTTGGGGG 


AGAATCTTTT 


TTGGGACCTT 


TGTCGGTATA 


CTTTTCCCTT 


^ X X O X 1 \J 


4 620 


TGGAATCGTC 


CCCATCATCA 


ATCGTTTTCT 


GGAAAAAAAG 


GTTCCAAGTT 


A f A p r; rTT'OT 




TCCTTTTCTT 


GTGACAGCAC 


CTGTTATCAA 


TCCCATTGTT 


CTTTTTGrRA 


rr'T A T TT'TTT' 

V-^lnl 1 ^ J. 




CTTTGGCAAC 


TCCTTCCATG 


TCGCCCTATT 


ACGAGCTCTG 






4800 


AATACTAGGA 


ATTTTTCTAG 


G ATTT T TCTG 




nil v_/iV3M/\/\u 


& A a 1\,T*r*f^rnr^iT> 
J\/v\J\ L ULi i L 1 


4860 


GGCTTGTCAT 


GAGCATGATT 


X X 1 \^ A X X 1 




A A A A A & PTTT 


I 1 L AALf 1 C I 1 


4 920 


TGTGCAGGCC 


ATTGATGAAT 


TTTTTGATAC 




TTTim* A fPTT /~1 

X L i x\ X X X o 


uL I I L. 1 1 


4980 


TGCTTCTATA 


ATACAGGTCT 


ACGTTCCGAC 


TCGGATTCTG 


nV^\^ X X /\X v^-rV 






TCTTTTTGCC 


ATCCTGCTCT 


TGATGATTTT 


AGCCTTTCTT 


^ X X X X ^ 1 


/^T A f2 Tn A rr* r* 

\j 1 Avj 1 (jAoVjV- 


D J. uu 


GGATGCCTTT 


ATAGGTGCTT 


CTCTTCTCTC 


GAGTTTCGGT 




X X V. 1 V30V,Vi„ X -1 


DU 


TCTCGTCATT 


GGTCCAATGC 


TGGATATCAA 


AAATATTCTC 


ATGATGAAAA 


ATTACTTGAA 


5220 


AGCACGATTT 


ATCAGTCACT 


TCATAACAAT 


TGTAACTCTT 


GTCGTCTTAG 


TCTATTCTCT 


5280 


CTTGATTGGA 


GTTATCCTAT 


GATTCGATTT 


TTAGTTTTAG 


CTGGCTATTT 


TGAACTGACT 


5340 




A i l„ i 1 ^uuVj 




LAti 1 1 LA 


ACATGCACTA 


TTCCTATCTG 


5400 


GCCTATATCT 


CCATGGTGCT 


TTCTTTTATC 


TTGGCTATCG 


TTCAATTGTA 


TATCTGGATG 


5460 


AAGCAAGTCA 


AAACCCACAG 


TCATCTGAAC 


AGCCGATTAG 


CCAAGATAAC 


GAGTATTTCT 


5520 


CTTCTGGCTA 


TTCCACTTGT 


CATCGGCTTA 


ACTTTCCCAA 


CTGTTAGCTT 


GGATTCTCAG 


5580 


ACTGTTTCTG 


CTAAAGGTTA 


TCATTTCCCC 


CTATCGGAAG 


GAACGGATCT 


AGCCATTCAG 


5640 


ACAAGCGAAG 


GGACGACAAG 


CCAATATTTG 


AAACCAGATA 


CCAGTTCTTA 


TTTTTCAAAA 


5700 


TCAGCCTATG 


AAAAGGAAAT 


GCGAACGGCG 


GCGGATAAAT 


ACTTATCCCA 


AGATAGTATT 


5760 


CAGATCACTA 


ATGAAAACTA 


TATGGAAGTC 


ATGGAGGCTA 


TCTACGACTA 


TCCAGATGAG 


5820 


TTTGAGGGCA 


AGACAATCCA 


GTTTACAGGC 


TTTGTCTATA 


ACGACCCCAG 


TCATGCCAAT 


5880 
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AGTCAATTTC TGTTCCGATT CGGCATTATC CACTGTATCG CAGATTCTGG TGTCTATGGA 5940 
TTGCTGACCA AGGGCAATAC CCGGCAGTAT GAAAACAACA CTTGGATAAC AGCCAAAGGA 6000 
AAACTGGTCA ATCACTACCA TAAAGAACTC AAACAAAACC TTCCAACCTT GGAAATCGAC 6060 
AGCTTTACCA AAGTCGATAA ACCAGAAAAT CCCTATGTAT ATAGAGCTTT TTAAGAAAAT 6120 
CAAGATAAAA ACGAACAAGT TCTCTTCTGA ATAACAGAAA AAGAGCCTGT TCGTTTTTTG 6180 

TTATATGAAA ATTAGTGACT TGTAGATTTT CATCTTATAC CATTCCCAGC AATACAAGTA 6240 

GCTCATAGAA AATAAGCGAG CCACTCATTC ATTAGACTAG CGATTTCTTT AGGTGCTTGA 6300 

GTATAAAGCT CATGGCCAAA GTTTTCTAAA AAAATAGTAT CAAAATAGTC TGGCAATTCT 6360 

TTTAGGGCTT CCTCTCTCCA TGTAGCTTCA TTAGGATAGC GAGGACTAAT AAACAAGGTA 6420 

TCTCCCACTT CTCTCTTAAA AGCTTGTATT TTTCTCCGTA GcGGAGTATC GCTTCTATAT 6480 

TTTCATAATT TATAGCCAAC TCATATCTAT TATACTCAAC ATTCCAGTGA TAAGACTGTC 6540 

TTACAGCTTT CTCCATATTT TCTGACCAAT GCTTTGCTTC AGATTTTTCT TTAGAAGTAA 6600 

GAACATCTAA GTCCGAAACA ATTTGAGATT TGATATAATT TTTAGTTTCC TCTAACTCTG 6660 

TATCCAAAGG TAAAATCTTA TCTAAATCTA GATAGCCACC ATCCAAAAGA ATCAGTTTCT 6720 

TTACTTCTTC AAATTCCGAT GCGAAATAAC GAGCTAAATC TCCTCCAAGA GAATGGCCTA 6780 

TCAGACAGAT AGATTCTTCC TCTACAATTT CATTTTTAAA CCATGATTTC AATTCTGTTT 6840 

CATCTCGAAG ATGCTTTTCA TATGGATTTA GAAAATAGAC CTGCGAATCT AGTTCTTCAA 5300 

GAAAATCCTT GCTATGATAG GCATTGCTTC CCAAACCGCC AATAAAATAT TTTTTCATTC 6960 

TCTACTTAAT ACTATGCTTA TTCATCTTTT GTTCAAAGAT AGTTGTGATA ATCTGACGCA 7020 

ATTCTTCGCG TTTTTTTTCT GGAATCTCAC CACTTGTTTG AGCTACAGCG TAGAGTTCAG 7080 

GGTATTCAAT TGAAATGCGT TTAATCGTAC GTGTTGTAGC ATGTTTTCTG ACAAAAAACG 7140 

GGATTCGCTT AATCAAGTCT TGTGGGACTA GCGCCAGAAT CTTCTCAGTA GTTTCTTTGT 7200 

CACTAATATT AGACATTGTA AGCCTTTTCT TAATCATTTC CTGTTCTTTT TCTGTAAAAT 72 60 

CTTTTAATTC CATTCGATTA GTCCTCCTAT TTTCTCTAAG TTAAATTATG TACTAATACA 7320 

GATGAAACTA CAAAGAATAA ACTTTAAGAA ATCTTCTCAC TGATAAGATT TTAGCATTAG 7380 

ACTTCCTGCG AAACAAAATA TGGTATAGTA GTTCTATGAA TTATGAAGCA AGTAAACAAC 7440 

TAACTGATGC ACGATTTAAA CGTCTTGTTG GTGTTCAGCG CACGACTTTT GAAGAGATAT 7500 

TAGCTGTATT AAAAACAGCT TATCAACTTA AACACGCAAA AGGTGGACGA AAACCTAAAT 7560 

TAAGCCTAGA AGACCTTCTT ATGGCCACTC TTCAATATGT GCGAGAATAC CGCACTTATG 7620 

AAGAAATTGC GGCTGATTTT GGTATTCACG AAAGCAACTT AATCCGTCGG AGCCAATGGG 7680 
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TTTAAGTAAC TCTTGTTCAA AGTGGTGTTA CGATTTCAAG AACTCCTCTC AGTTCTGAGG 7740 

ACACGGTAAT GATTGATAGC CATTCCCATC AATATCGTAT CTTTGGACAT AGCCAATAAA 7800 

TGTTTCATTT TTGCGTGGTT TCTGGCTATT AACGATTGAA ATAACCCACC AACTTATCAA 7860 

AAATAGAAAT AAAAATCCTA AGATTACTGT CATATCATAA CACTATTAAA GTTTAACCCA 7920 

CTTATCATTA TCCATGATAA AAGGCTTAGC CAGTCCCTCG CCTGTATAAT CCGCATACTT 7980 

GGTGCCCAAA TACTTGTAGC AATCTTCCTT ACTAGCAAAT TTAATCGCTT GGTAGGGCTC 8040 

TTCGAAAGTC AATTTCTCTA CAAATAAGAA ACCGTCATCA GCAGGTACTA AGACCCCAAC 8100 

GTGGCCTACA AACAGATACT CGCCATCCAA ATTGTCGTGC AAGACTACAG ACAGCATTCG 8160 

AGCTTTTTCA TTGAATTGAA ATTGTGAGAA GAATGCTTCC ATCTTTTCAG CGTGAACCTT 8220 

GACATCTGTA GTTGACTCAG TTGGAACTCT CGAAAATAGA ATATCAAACT CTTCCTTATC 8280 

TTGTGAATCA AAGACCTTTC CTTTATCAAT CGCATCATTA TCTAGGAAAA GCAACTGGTC 8340 

ATTCTTTTCA AGCTTTGGAA TGGTGACTGA ATTTTTCAAA AGACAATAAC TATTGATACG 84 00 

GCAGTTGGTC CCAACAAAAT CGCCCTTCTT TTGATTCCAG AGATGACTGA TTTTCTCAAC 84 60 

ATCGTATTCG GTGTGAGTAA AGGAAGTGAA ATCTCCTGAT AAGCCAGTTG AGCCGACAAT 8520 

GGTATTATAG TCATTAACGA GATTAAAAAA TGCATCAACA CTATTTGGAT CCAAGTGAGC 8580 

TGATAAGAGA GATTTGACCT CTTCTGTACT TACCTGGTTG TTTAGGTTGG TGTATGAAGC 8640 

TTTCCATGGA ACTTTCGCTG AACTGCTTTG CCTTTGATTC GTCCCCTCAG AAGTAGCATG 8700 

TTGTTGTTGA CAAGCAGCCA AGCCTAAAAA CAAGGCTGAA CAGATTCCTA ATGTGGCTAA 87 60 

TTTTCTTGAT TTCTTCATTT CTTTCTCCTA AATGTCTTGG ATTAAAGTTT CTTTAACTAT 8820 

TGCTTTACAG ATATTGATTA CTTTCTCATT TAATGTGTTC ATCGTCTTTC CTCCGG 887 6 

(2) INFORMATION FOR SEQ ID NO: 171: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14736 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 
CGCAAACTTT CGCGGTCGGA AGGTAGTTTT ATGACACGAT TTGAGATACG AGATGATTTC 
TATCTCGATG GAAAATCATT TAAGATTTTA TCTGGTGCCA TTCATTATTT TAGGGTTCCT 
CCAGAGGATT GGTATCATTC GCTCTATAAC TTGAAGGCTC TTGGTTTTAA TACGGTAGAG 
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ACTTATGTTG CTTGGAATTT ACACGAGCCT TGTGAAGGTG AGTTTCATTT TGAAGGTGAT 240 

CTGGATTTAG AGAAATTTCT CCAAATAGCG CAGGATTTGG GTCTCTACGC AATTGTGCGT 300 " 

CCGTCTCCAT TTATCTGTGC GGAATGGGAA TTCGGTGGCT TACCAGCTTG GCTCTTGACC 360 

AAGAACATGC GAATTCGCTC ATCCGACCCA GCATATATCG AGGCAGTTGG TCGCTACTAT 420 

GATCAGTTAT TGCCAAGACT GGTGCCTCGT TTGTTGGACA ATGGTGGCAA TATTCTCATG 480 

ATGCAGGTTG AAAATGAGTA TGGTTCTTAC GGAGAAGATA AGGCTTACCT GAGAGCGATT 540 

CGACAGCTAA TGGAAGAGTG TGGCGTAACC TGTCCCCTCT TTACATCAGA TGGTCCATGG 600 

CGAGCTACTC TGAAAGCTGG AACCTTAATT GAAGAGGACC TCTTTGTAAC AGGAAACTTT 660 

GGTTCTAAGG CACCTTACAA CTTTTCGCAG ATGCAGGAAT TCTTTGATGA ACATGGTAAG 720 

AAATGGCCAC TCATGTGTAT GGAGTTCTGG GATGGTTGGT TCAATCGCTG GAAAGAACCG 780 

ATTATCACAC GGGATCCTAA GGAATTGGCA GATGCAGTTC GAGAGGTTTT GGAACAAGGC 840 

TCTATCAATC TTTACATGTT CCACGGTGGT ACAAACTTTG GTTTCATGAA TGGTTGCTCA 900 

GCTCGAGGAA CTTTGGACCT GCCACAAGTT ACGTCTTATG ATTACGATGC CCTTCTGGAT 9 60 

GAAGAAGGAA ATCCAACTGC TAAATATCTT GCAGTCAAGA AGATGATGGC AACACATTTT 1020 

TCAGAGTATC CGCAGTTGGA ACCACTCTAC AAAGAGAGTA TGGAGTTGGA TGCTATTCCA 1080 

CTAGTTGAAA AAGTTTCTTT GTTTGAAACC TTAGATAGCT TGTCAAGTCC TGTAGAAAGT 1140 

CTCTATCCTC AAAAGATGGA GGAGCTGGGA CAAAGTTATG GCTACCTACT TTATCGAACA 1200 

GAAACAAACT GGGATGCAGA AGAAGAAAGA CTTCGTATCA TTGATGGTCG AGATAGGGCC 12 60 

CAGCTGTATG TCGATGGTCA GTGGGTTAAA ACTCAATATC AGACAGAGAT TGGGGAAGAT 1320 

ATTTTTTATC AAGGTAAAAA GAAAGGGCTA TCTAGGTTAG ATATCTTGAT AGAAAATATG 1380 

GGGCGTGTCA ACTATGGGCA TAAGTTCTTA GCGGATACGC AACGTAAGGG AATTCGGACA 1440 

GGGGTCTGTA AGGATCTGCA TTTCTTACTA AACTGGAAAC ACTATCCACT CCCACTAGAC 1500 

AATCCTGAGA AAATTGATTT TTCAAAAGGA TGGACTCAAG GACAACCAGC CTrTTACGCT 1560 

TATGACTTTA CAGTCGAAGA GCCAAAAGAT ACTTACCTAG ACTTGTCTG A • GTTTGGTAAG 1620 

GGGGTTGCCT TTGTCAATGG GCAGAATCTA GGACGTTTTT GGAACGTTGG CCCAACTCTC 1680 

TCACTTTATA TCCCTCATAG CTATCTCAAG GAAGGTGCCA ACCGCATCAT TATCTTTGAA 1740 

ACAGAAGGTC AATATAAAGA AGAGATTCAT TTAACTCGTA AACCTACACT AAAACATATA 1800 

AAGGGGGAAA ACTTATGACA ATTGTAGGAT GCCGTATTGA TGGACGTTTG ATCCACGGAC 1860 

AAGTAGCCAA TCTTTGGGCT GGAAAACTAA ATGTTTCACG CATTATGGTT GTAGACGACG 1920 

AAGTTGTCAA CAACGATATT GAAAAGAGTG GTTTGAAACT TGCGACACCA CCAGGTGTGA 1980 
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AATTGAGTAT TTTGCCAGTT GAGAAAGCTG CAGCCAATAT TCTTGGTGGC AAATACGATA 2040 
GCCAACGTCT CTTTATCGTG GCTCGTAAAC CAGACCGCTT CCTTGGTTTG GTAGAAGCAG 2100 
GTGTACCACT TGAAACCCTT AATGTTGGGA ATATGTCTCA AACACCAGAA ACTCGTTCTA 2160 

TTACACGTTC TATCAACGTA GTAGACAAGG ATGTGGAAGA CTTCCACAAA CTGGCAGAAA 2220 

AAGGTGTTAA ACTTACTGCT CAGATGGTTC CAAATGATCC AATTTCAGAC TTTTTGAGCT 2280 

TATTAAAATA GGAAAAAAAT TTTTAGGAGG TCATTGTTAT GATACAATGG TGGCAAATTT 2340 

TACTTCTCAC TTTGTACTCA GCTTATCAAA TCTGTGATGA GTTGACGATC GTTTCATCTG 2400 

CAGGTTCCCC TGTATTTGCT GGTTTCATTA CTGGTTTAAT CATGGGAGAT GTGACTACTG 2460 

GTTTACTTAT CGGTGGTAAC TTGCAACTGT TCGTTCTTGG GGTTGGTACC TTCGGTGGTG 2520 

CTTCTCGTAT CGACGCAACT TCTGGTGCGG TTCTTGCGAC ACCTTCTCTG TTTCACAAGG 2580 

AATTGATGCA CCGCTTGCCA TTACTACAAT CGCTGTACCA GTAGCAGCTC TCTTGACTTA 2 640 

CTTCGACGTT CTTGGTCGTA TGACTACTAC CTTCTTCGCT CACCGTGTGG ATGCTGCAAT 2700 

CGAACGCTTT GACTATAAAG GTATTGAACG CAACTACTTG CTTGGTGCGA TTCCGTGGGC 27 60 

TCTATCTCGT GCCCTTCCAG TCTTCTTTGC CCTTGCTTTT GGTGGTGCCT TTGTACAATC 2 82 0 

AGTAGTAGAC TTCGTTGAAG CCTACAAATG GGTTGCAGAT GGCTTGACAC TTGCAGGACG 2880 

TATGCTTCCA GGTCTTGGAT TTGCAATCTT GCTTCGTTAC CTTCCAGTTA AACGTAACCT 2940 

TCACTACCTT GCTATGGGAT TTGGTTTGAC AGCTATGTTG ACTGTTCTTT ACTCATATGT 3 000 

AACAGGTCTT GGTGGCGCTG TTGCTGGTAT CGTAGGTACT CTTCCTGCTG AAGTTGCTGA 3060 

AAAAATTGGT TTCGTGAACA ACTTCAAAGG TTTGTCTATG ATTGGTATTT CTATCGTAGG 3120 

TATTTTCCTT GCAGTGCTTC ACTTCAAAAA TAGCCAAAAA GTAGCTGTAG CAGCACCTTC 3180 

TACACCATCA GAAAGTGGGG AAATCGAAGA TGACGAATTC TAATTACAAA CTTACAAAAG 3240 

AAGATTTTAA TCAAATCAAC AAACGTAGCT TGTTTACTTT CCAATTAGGT TGGAACTACG 3300 

AACGTATGCA AGCTTCTGGT TACCTTTACA TGATCTTGCC TCAGTTGCGT AAAATGTATG 33 60 

GTGATGGAAC TCCTGAATTG AAAGAAATGA TGAAAGTTCA TACTCAATTC TTCAATACTT 3420 

CACCATTCTT CCATACCATT ATCGCTGGTT TTGACCTTGC CATGGAAGAA AAAGATGGTG 34 80 

TAGGTTCAAA AGACGCCGTT AACGGTATCA AGACAGGTTT GATGGGACCA TTCGCTCCTC 3540 

TTGGGGATAC AATCTTTGGT TCACTTGTAC CTGCTATCAT GGGGTCAGTC GCAGCAACTA 3600 

TGGCTATCGC TGGCCAACCT TGGGGGATCT TCCTTTGGAT TGCAGTTGCA GTAGCGTATG 3660 

ACATCTTCCG TTGGAAACAG TTGGAATTTG CTTACAAAGA AGGGGTTAAC CTTATCAACA 3720 
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ACATGCAAAG TACCTTGACA GCTTTGATTG ACGCTGCATC TGTACTTGGT GTCTTCATGA 3780 

TGGGTGCTCT TGTAGCAACA GTGATTAACT TTGAAATTTC TTACAAGTTG CCAATCGGTG 3840 

AAAAGATGAT TGATTTCCAA GACATCTTGA ACCAAATCTT CCCACGTTTG CTTCCAGCAA 3900 

TCTTTACTGC CTTTATCTTC TGGTTGCTTG GTAAGAAAGG TATGAACTCT ACTAAAGCTA 3960 

TCGGTATTAT TATCGTACTT GCTTTGGCTC TTTCTGCCCT TGGTCACTTT GCACTTGGAA 4020 

TGTAATTCCT TATGACTAAA TCATTAATTT TGGTGAGCCA TGGTCGCTTC TGTGAGGAGC 4080 

TTAGAGGTAG CACAGAAATG ATTATGGGCC CACAAGACAA CATTTACACA GTAGCTCTTC 4140 

TTCCAGAAGA TGGCCCAGAA GAATTTACTG CTAAATTTGA AGCTGTTATT GAAGGATTGG 4200 

ATGATTTCCT AGTCTTTGCG GATCTTCTCG GTGGGACACC TTGTAATGTG GTGAGTCGCT 4260 

TGATCATGGA AGGTCGTGAT ATTGACCTTT ACGCAGGGAT GAATCTTCCA ATGGTGATTG 4320 

AATTTATCAA TGCGAGCCTT ACAGGCGCAG ATGCGGACTA CAAGAGCCGT GCTGCAGAAA 4380 

GCATTGTGAA AGTTAATGAC CTGTTAGCGG GCTTCGATGA TGACGAAGAT GAATAATACT 4440 

CTTCGAAAAT CTCTTCAAAC TACGTCAACG TCGCCTTGCC GTAGgTATAT GTTACTGACT 4500 

TCGTCAGTCT TATCCGGCAA CCTCAAAACG GTGTTTTGAG CTGACTTCGT CAGTCTTATC 45 60 

CGGCAACCTC AAAGCAGTGC TTTGAGCAGC CTGCGGCTAG TTTCCTACAG ATTTTAGTTG 462 0 

GAACTCGATT CAATTCATGT GACAACGTGA AAATCGTTAG AGCATTTTAT ATAGAATATA 4680 

CATGGGAATG TAGCTTACTC CCATTCCCAT ATTTAATAGA AAAAGAtiGAA CTCAATGCTA 4740 

CATTATACAA AAGAAGACTT GCTCGAATTG GGTGCAGAAA TCACTACGCG TGAAATCTAC 4800 

CAACAGCCTG ATGTATGGAG AGAAGCTTTT GAATTTTATC AAGCAAAACG TGAAGAAATT 4860 

GCAGCCTTCC TACAAGAAAT CGCTGATAAA CATGACTATA TTAAGGTTAT CTTGACAGGT 4920 

GCTGGGACTT CTGCTTATGT GGGAGATACC TTGCTACCTT ATTTTAAGGA AGTCTATGAC 4980 

GAACGCAAAT GGAATTTCAA TGCTATTGCG ACAACAGATA TCGTTGCCAA TCCAGCAACC 504 0 

TATTTGAAAA AAGATGTGGC AACTGTCCTT GTGTCTTTTG CTCGTAGTGG GAATTCGCCT 5100 

GAAAGTTTGG CGACTGTTGA TTTGGCCAAA TCCTTGGTGG ATGAGCTTTA TCAAGTGACG 5160 

ATTACTTGTG CAGCAGATGG TAAATTGGCT CTTCAAGCTC ACGGTGATGA TCGTAATCTC 5220 

TTGCTCTTGC AACCAGCTGT CTCTAATGAT GCTGGATTTG CCATGACTTC TAGCTTTACG 5280 

TCTATGATGT TGACAACTCT CTTGGTCTTT GATCCTACAG AATTTGCTGT TAAGTCTGAA 5340 

CGTTTTGAAG TTGTATCTAG TCTTGCCCGT AAAGTTTTAG ACAAGGCAGA AGATGTCAAA 5400 

GAGCTCGTTG ATTTAGACTT TAACCGTGTC ATCTATCTAG GCGCTGGTCC TTTCTTTGGA 5460 

CTTGCTCATG AAGCTCAGCT CAAGATTTTG GAATTAACTG CTGGTCAAGT TGCGACCATG 5520 
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TATGAAAGCC CAGTTGGCTT CCGTCACGGT CCAAAATCTC TTATCAACGA CAATACAGTT 5580 
GTTTTGGTCT TTGGTACAAC GACAGACTAC ACTCGTAAGT ACGACTTGGA CTTGGTTCGT 5640 
GAAGTTGCTG GTGACCAGAT TGCTCGTCGT GTTGTGCTTT TGAGTGATCA AGCTTTTGGT 5700 
CTTGAAAATG TCAAAGAAGT GGCCCTTGGT TGTGGCGGTG TCTTGAATGA TATTTACCGT 57 60 

GTCTTCCCTT ACATCGTTTA TGCCCAACTC TTTGCTTTAT TGACTTCACT CAAGGTAGAA 5820 
AATAAACCAG ATACACCGTC TCCTACAGGT ACAGTAAACC GTGTAGTACA AGGTGTCATA 5 880 

ATTCACGAAT ATCAAAAGTA AGACAGTGTT TATGAATTCT TGACAAGAGG ATTTGTAAAT 5940 
TATCAGATAA ACCATAGATT GTCAGTACGC TTTCTATGGT TTGTTTGCTT GAGAGAAATA 6000 
GTAAAAGGAG AACAGAATGA AAGCATACAC AGAGCGTGTA TTTGGAAATG TTGAGGGTGA 6060 
GGATGTCTTG GCCTATCGAT TTGAGACAGA CGGTGGCTAC CAACTTGAGG TTATGACTTA 6120 
TGGTGCGACT ATCTTGCGCT ATGTCGCACC TGACAAGGCT GGAAATTTTG CCAATGTTAT 6180 
CTTGGGATTT GATGACTTTG ATAGTTATGT AGGCAATAGT CCCAAGCATG GAGCAAGTGT 6240 
AGGTCCTGTA GCGGGTCGTA TTGCAGGTGC GACCTTTGAG CTCAATGGTA AGACCTATGA 63 00 

CCTTGAGGTT AATAATGCTA GCAACTGTAA TCACAGTGGT TCAACTGGTT GGGATTCCAG 6360 
CTTGTTTGAA GTTGAAGAAG TAAGCGATCA TGGCTTGACT CTCTACACAG AGCGTACAGA 6420 

TGGGACAGGA GGGTTCCCTG GAAATCTCAA GATTTGGATC AGTTATCACT TGGAAGAAAC 6480 

TGGTGCCTAT GAAATCAGCT ACAAGGTAAC GACCGATCAG GATACGCTGG TCAATCCAAC 6540 

CAACCACAGC TATTTCAACT TGTCTGGTGA TTTCACGCAG ACGATTGACC GTCATGTCTT 6600 

CCAACTAAAC ACAGAGGGCA TTTACTCAAT CGCTCCTGAC GGTGTTCCTG CCAAAACTCC 6660 

AGAAGCCAAC CGTGATGTGG TCAAACACGT CTACAATGGT ACCTTGTTGA AGGATATCTT 5720 

TGCAGAAGAA GATGAGCAAA TCCAGCTGGC ATCAGGTTTG GATCATCCAT TTGCCCTTCC 6780 

TGCAGGCCAT GACAATGCTG GATTCCTTTA TGACCAAAAT TCAGGTCGCT TCCTGCTTTT 6840 

CAAGACAGAA GCTCCTTGCT TTGTGGTCTA CACAGCAAAC TTTGTGGATG AAAGTGTCAT 6900 

CATAGGAGGT CAGCCAATGC TACAGCACAA TGGGATTGCT CTTGAAGCGC AAGCTTTACC 6960 

AGATGCCATT CACAGTGACC TTAAAGGCCA AGTCATTCTT AAAGCTGGTC AAACCTTCAC 7020 

CAGTAAGACA CGTTATGAAC TTGTTGTGAA GTAAAAGAGT CATTGCGCCT ACTTTTGGGA 7080 

GCTAGGAATA GGTACGCAGA GACAAATAGT AGGAAAATAT GATATAACTA AGCGTTGAAA 7140 

GCTATCTGTT AATATAATAT TCAAACTACA ATAAGGAGTA AGAAAGAAAC GAAGAAAATT 72 00 

GTATTTGCTA GTGCCTTGGC TTTGACCTTG GCTGGAGCAG TTTTGACAAA TGATGTTTTT 72 60 
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GCGAACGACA GACTTGTGGC AACACAAACT ACTGATGGTA AAAATGAAAA TGTATTGACC 7320 
TCAGAGGTGC TAAAACCTTC TAGTGGCAAT GTTTTGGTTG GAATCAAAGG AGAATTTGTG 7380 
GCTCCTCATC AACAATCTAT TTTGGATGCC ATCAATGCTA TCTGTAAAGA AGCGGCTGAC 7440 
GAAGGTTTGG TAGATAAGTA TGTCCCTATC AAATGATCAA CTGACCTAGA AAAGGCAGCT 7500 
TTTGCCAGAG CTACAGAAGC ATCTATAACC ATGGATCATA CCCGTCTTTC TAGCAAAGAT 7560 
CTTTGGAGTG CCTTTCCAAC TTCTAATAGT ATAATGGGAG AAAATTTGGC ATGGAATCAT 7620 
GACGGTTTTC TAAAAGCTAT TGAACAATGG CGTGCTGAAA AAGCAGATTA TGTGGAGAAA 7680 
AAAATAGTGG TTCAGACAAC GGGAAATCTG GTCACTATGA GTCGCTAATT AACCCTAAAT 7740 
TTACACACAT GGGGATGGCA GCTTTTAAAA ATCCTAACAA TCAATACAAA GCTATTACAA 7800 
rrGCTCAAAC TCTAGGTGAT GATGCTTCTT CAGAGGAATT GGCTGGTAGA TATGGTTCTG 7860 
CTGTTCAGTG TACAGAAGTG ACTGCCTCAA ACCTTTCAAC AGTTAAAACT AAAGCTACGG 7 920 

TTGTAGAAAA ACCACTGAAA GATTTTAGAG CGTCTACGTC TGATCAGTCT GGTTGGGTGG 7 980 

AATCTAATGG TAAATGGTAT TTCTATGAGT CTGGTGATGT GAAGACAGGT TGGGTGAAAA 8040 
CAGATGGTAA ATGGTACTAT TTGAATGACT TAGGTGTCAT GCAGACTGGA TTTGTAAAAT 8100 

TTTCTGGTAG CTGGTATTAC TTGAGCAATT CAGGTGCTAT GTTTACAGGC TGGGGAACAG 8160 

ATGGTAGCAG ATGGTTCTAC TTTGACGGCT CAGGAGCTAT GAAGACAGGC TGGTACAAGG 8220 

AAAATGGCAC TTGGTATTAC CTTGACGAAG CAGGTATCAT GAAGACAGGT TGCTTTAAAG 3230 

TCGGACCACA CTGGTACTAT GCCTACGGTT CAGGAGCTTT GGCTGTGAGC ACAACAACAC 8340 

CAGATGGTTA CCGTGTAAAT GGTAATGGTG AATGGGTAAA CTAGGCTCAG GCCATAGGTA 8400 

AAGCATTCAT CTTACTTAGC AAAAAGAATG AACGATAAGA AAGAGGTTGA TGGCGAACAT 84 60 

TGGCCTCTTT TGATTTATAA AGATTGGATT CTTGTCGCCT CAATTTCAGA CTTTTCTATT 8520 

GTAAGCTAAT ATTTTATAGC CCATTAAAAG CATAAGCGGT AATCTAATTT AAAAAATGCT 85 80 

GTAATTAGTC TGAAGTCCAC ACTTACTTGT TGAGATGTTA TCTCTGTTTT TTATCGTTA? 8 640 

AATTTACTGT ATTTTTTATA GTATGCAGAA TATTTTTAAG TATATTTCAA TAGAAATTTC 8700 

TATCGATTTA TTGTATAATG ATAAGTAATT GTTGAAAAGT ACTCAGAAAA TTCCATACTA 8760 

TATTATTTTT ATGTTTATAC TTTTATGCTA TAAAATATAG ATTGATATAA AGAATATAGA 8820 

AAAAGCGAGG TTAATATGAG CCGAAAAAGC ATTGGTGAGA AACGCCATAG TTTCTCGATG 8880 

AGAAAGTTGT CAGTGGGATT GGTATCAGTT ACTGTATCTA GTTTCTTTTT GATGAGTCAA 8940 

GGGATTCAAT CGGTATCGGC CGATAATATG GAAAGTCCAA TTCATTATAA GTATATGACC 9000 

GAGGGTAAAT TGACAGACGA GGAAAAATCC TTGCTGGTAG AGGCCCTTCC ACAACTGGCT 9060 
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GAAGAATCAG ATGATACTTA TTACTTGGTT TATAGATCTC AACAGTTTTT ACCGAATACA 9120 

GGTTTTAACC CAACTGTTGG TACTTTCCTT TTTACTGCAG GATTGAGCTT GTTAGTTTTA 9180 

TTGGTTTCTA AAAGGGAAAA TGGAAAGAAA CGACTTGTTC ATTTTCTGCT GTTGACTAGC 9240 

ATGGGAGTTC AATTGTTGCC GGCCAGTGCT TTTGGGTTGA CCAGCCAGAT TTTATCTGCC 9300 

TATAATAGTC AGCTTTCTAT CGGAGTCGGG GAACATTTAC CAGAGCCTCT GAAAATCGAA 93 60 

GGTTATCAAT ATATTGGTTA TATCAAAACT AAGAAACAGG ATAATACAGA GCTTTCAAGG 94 20 

ACAGTTGATG GGAAATACTC TGCTCAAAGA GATAGTCAAC CAAACTCTAC AAAAACATCA 94 80 

GATGTAGTTC ATTCAGCTGA TTTAGAATGG AACCAAGGAC AGGGGAAGGT TAGTTTACAA 9540 

GGTGAAGCAT CAGGGGATGA TGGACTTTCA GAAAAATCTT CTATAGCAGC AGACAATCTA 9600 

TCTTCTAATG ATTCATTCGC AAGTCAAGTT GAGCAGAATC CGGATCACAA AGGAGAATCT 9660 

GTAGTTCGAC CAACAGTGCC AGAACAAGGA AATCCTGTGT CTGCTACAAC GGTGCAGAGT 9720 

GCGGAAGAGG AAGTATTGGC GACGACAAAT GATCGACCAG AGTATAAACT TCCATTGGAA 9780 

ACCAAAGGCA CGCAAGAACC CGGTCATGAG GGTGAAGCCG CAGTCCGTGA AGACTTACCA 9840 

GTCTACACTA AGCCACTAGA AACCAAAGGT ACACAAGGAC CCGGACATGA AGGTGAAGCT 9900 

GCAGTTCGCG AGGAAGAACC AGCTTACACA GAACCGTTAG CAACGAAAGG CACGCAAGAG 9960 

CCAGGTCATG AGGGCAAAGC TACAGTCCGC GAAGAGACTC TAGAGTACAC GGAACCGGTA 10020 

GeUACAAAAG GCACACAAGA ACCCGAACAT GAGGGCGAAg CGGCAGTAGA AGAAGAACTT 10080 

CCGGCTTTAG AGGTCACTAC ACGAAATAGA ACGGAAATCC AGAATATTCC TTATACAACA 1014 0 

GAAGAAATTC AGGATCCAAC ACTTCTGAAA AATCGTCGTA AGATTGAACG ACAAGGGCAA 10200 

GCAGGGACAC GTACAATTCA ATATGAAGAC TACATCGTAA ATGGTAATGT CGTAGAAACT 102 60 

AAAGAAGTGT CACGAACTGA AGTAGCTCCG GTCAACGAAG TCGTTAAAGT AGGAACACTT 10320 

GTGAAAGTTA AACCTACAGT AGAAATTACA AACTTAACAA AAGTTGAGAA CAAAAAATCT 10380 

ATAACTGTAA GTTATAACTT AATAGACACT ACCTCAGCAT ATGTTTCTGC AAAAACGCAA 10440 

GTTTTCCATG GAGACAAGCT AGTTAAAGAG GTGGATATAG AAAATCCTGC CAAAGAGCAA 10500 

GTAATATCAG GTTTAGATTA CTACACACCG TATACAGTTA AAACACACCT AACTTATAAT 10560 

TTGGGTGAAA ATAATGAGGA AAATACTGAA ACATCAACTC AAGATTTCCA ATTAGAGTAT 10620 

AAGAAAATAG AGATTAAAGA TATTGATTCA GTAGAATTAT ACGGTAAAGA AAATGATCGT 10680 

TATCGTAGAT ATTTAAGTCT AAGTGAAGCG CCGACTGATA CGGCTAAATA CTTTGTAAAA 10740 

GTGAAATCAG ATCGCTTCAA AGAAATGTAC CTACCTGTAA AATCTATTAC AGAAAATACG 10800 
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GATGGAACGT ATAAAGTGAC GGTAGCCGTT GATCAACTTG TCGAAGAAGG TACAGACGGT 10860 
TACAAAGATG ATTACACATT TACTGTAGCT AAATCTAAAG CAGAGCAACC AGGAGTTTAC 1092 0 
ACATCCTTTA AACAGCTGGT AACAGCCATG CAAAGCAATC TGTCTGGTGT CTATACATTG 10980 
GCTTCAGATA TGACCGCAGA TGAGGTGAGC TTAGGCGATA AGCAGACAAG TTATCTCACA 11040 
GGTGCATTTA CAGGGAGCTT GATCGGTTCT GATGGAACAA AATCGTATGC CATTTATGAT 11100 
TTGAAGAAAC CATTATTTGA TACATTAAAT GGTGCTACAG TTAGAGATTT GGATATTAAA 11160 

ACTGTTTCTG CTGATAGTAA AGAAAATGTC GCAGCGCTGG CGAAGGCAGC GAATAGCGCG 11220 

AATATTAATA ATGTTGCAGT AGAAGGAAAA ATCTCAGGTG CGAAATCTGT TGCGGGATTA 11280 

GTAGCGAGCG CAACAAATAC AGTGATAGAA AACAGCTCGT TTACAGGGAA ACTTATCGCA 11340 

AATCACCAGG ACAGTAATAA AAATGATACT GGAGGAATAG TAGGTAATAT AACAGGAAAT 11400 

AGTTCGAGAG TTAATAAAGT TAGGGTAGAT GCCTTAATCT CTACTAATGC ACGCAATAAT 114 60 

AACCAAACAG CTGGAGGGAT AGTAGGTAGA TTAGAAAATG GTGCATTGAT ATCTAATTCG 11520 

GTTGCTACTG GAGAAATACG AAATGGTCAA GGATATTCTA GAGTCGGAGG AATAGTAGGA 11580 

TCTACGTGGC AAAACGGTCG AGTAAATAAT GTTGTGAGTA ACGTAGATGT TGGAGATGGT 11640 

TATGTTATCA CCGGTGATCA ATACGCAGCA GCAGATGTGA AAAATGCAAG TACATCAGTT 11700 

GATAATAGAA AAGCAGACAG ATTCGCTACA AAATTATCAA AAGACCAAAT AGACGCGAAA 11760 

GTTGCTGATT ATGGAATCAC AGTAACTCTT GATGATACTG GGCAAGATTT AAa_^CGT;^J^T 1132G 

CTAAGAGAAG TTGATTATAC AAGACTAAAT AAAGCAGAAG CTGAAAGAAA AGTAGCTTAT 11880 

AGCAACATAG AAAAACTGAT GCCATTCTAC AATAAAGACC TAGTAGTTCA CTATGGTAAC 11940 

AAAGTAGCGA CAACAGATAA ACTTTACACT ACAGAATTGT TAGATGTTGT GCCGATGAAA 12000 

GATGATGAAG TAGTAACGGA TATTAATAAT AAGAAAAATT CAATAAATAA AGTTATGTTA 12050 

CATTTCAAAG ATAATACAGT AGAATACCTA GATGTAACAT TCAAAGAAAA CTTCATAAAC 1212 0 

AGTCAAGTAA TCGAATACAA TGTTACAGGA AAAGAATATA TATTCACACC AGAAGCATTT 12180 

GTTTCAGACT ATACAGCGAT AACGAATAAC GTACTAAGCG ACTTGCAAAA TGTAACACTT 1224 0 

AACTCAGAAG CTACTAAAAA AGTACTAGGA GCAGCGAATG ATGCAGCCTT AGATAACCTA 12300 

TACTTAGATA GACAATTTGA AGAAGTTAAA GCTAATATAG CAGAACACCT AAGAAAAGTA 123 60 

TTAGCGATGG ATAAATCAAT CAATACTACA GGAGACGGTG TAGTTGAATA CGTAAGTGAG 12420 

AAAATCAAAA ATAACAAAGA AGCATTTATG CTAGGTCTTA CTTATATGAA CCGTTGGTAC 12480 

GATATTAATT ATGGTAAAAT GAATACAAAA GATTTATCTA CGTACAAGTT TGACTTTAAC 12540 

GGAAATAATG AGACTTCAAC GTTGGATACT ATTGTCGCAT TAGGAAATAG TGGACTAGAT 12600 
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AACCTGAGAG 
GAAGATTCAG 
AATAACGAGT 
GCAGAAGTAC 
TACGATAGAA 
TTACCTGAAG 
GAAAGATATC 
GTAAGAAATA 
AATCTTCTTG 
TTCAATGTAA 
GGCTCTATTA 
GGAGCGTATG 
GGAACGTCGG 
GGAAATGGTA 
GTAGATAGTG 
GATTTGAATA 
CAAAGtI'ATA 
GCGATATTAG 
TACGTTCGTG 
ACAGATGAAG 
AATAGACGTA 
ATGTTCTCTC 
TTTAGAAAAA 
TATGTTTCTA 
TGGCATGGAA 
GGTGAGTACT 
CAAGATAATC 
GAAGTAACTA 
GATATTACTA 



CTTCAAATAC 
TCTTTGACTT 
GGTTTAAAGA 
GAGAAAAACA 
TATCAGCACC 
AATCTGTGTA 
GTGATAGTGT 
GAGTTGATAT 
ACAGTGCTTC 
AAGATGAGAC 
AAGAATTCTT 
CGyAtGGAAG 
TTTATACTCA 
GACGTGAAGG 
TAAATTCTCA 
GATTGCATAC 
TGCATGGATC 
CTCAAAATAA 
ATACTAGACA 
AAGTAGCTAA 
GCTATGATGA 
CTGTATACGC 
TAGCTTATGA 
ATCAGTACGG 
GAGATGTTGC 
CATCATGGGC 
TGAAACCAAT 
TAACAACGGC 
ATATAGATCG 



TGTAGGTTTA 
CGTAGAAGCG 
AAATACAAAG 
AGAATCACCA 
AAGTTGGGGG 
TATTTCATCG 
GGATGGAGTT 
AGCAGCGAAA 
AAAAGAAAAA 
AGGAAGAACT 
CGGACCTGTT 
TTTAACGCAC 
TGAAATGGTT 
ATTGGGAGCG 
TATTTTAGCT 
ATATAATCCG 
ATATGATGTA 
TGATGTTAAG 
TAATAAAGAT 
CTTAACATCG 
TAGTAGAGAA 
AGCGCTAAGC 
ATTACTTGCG 
AGCAGAAGCA 
TTTAGTGACA 
TGATTTCAAA 
AACAATTCAA 
TGCACAAATG 
TGCAACGAGT 



1093 
TATGCGAATA 
TATAGAAAAC 
GCATATATAG 
ACAGCCGATA 
CATAAGAGTA 
AATATGTCTA 
ATTCTTTCAG 
AGGCATAGAG 
CTTTTCCGTT 
TATTGGGCAA 
GGGAAATGGT 
TTTGTGTTAG 
CATAATTCTG 
GAGTTATACG 
TTAAATACGT 
GTGGAACGTT 
ATGTATACAC 
AAAAAATGGT 
ACACATGCAG 
TTAAACTCAT 
TATAAACGAA 
AATTCGAAAG 
GAAAAAGGTT 
TTTGCCAGCG 
GATGATTTAG 
AAAGCAATGT 
TACGAATTAG 
CAACAATTAA 
CATACCCCAG 



AACTTGCATC 
TGTTCTTACC 
TCGAAATGAA 
GAAAATATTC 
TGTTATTACC 
CACTTGCATT 
GAGATGCTTT 
ACCATTATGA 
CTGTGATAGT 
GGTTAACGGA 
ATGAGTATAA 
ATAGATTATT 
ATTCTGCAAT 
CACTTGGTTT 
TATATAAAGC 
TCGATTCGGA 
TTGATGCGAT 
TTAGAAAAAT 
GAAATAAAGT 
TAATCGACAA 
ATGGCTACTA 
GTGCTCCTGG 
ATCACAAAGG 
GAAGCAAAAC 
TATTTAAGAA 
TTAAACAACG 
GTAATCCTAA 
TTAATGAAGC 
CAAGTTGGGT 



GGTAAAAGGA 
AAACAAAACA 
GTCTGATATT 
ATTAGGAGTT 
ACTACTAACT 
CGGTTCGTAT 
ACGAACTTAT 
TATTTGGTAC 
TTATGATGGA 
TAAAAACATC 
TAGTAGTGCA 
AGATGCTTAT 
CTACTTTGAA 
ACTGCAATCT 
AGAAAAAGAT 
TGAGGCGCTT 
GGAAGCAAAA 
AGAAAATTAT 
CCGTCCATTA 
CGACATCATA 
TACTATAAGT 
AGATATTATG 
ATTCCTACCT 
ATTCTCATCA 
AGTATTCAAT 
TATAGATAAA 
TAGTACAAAA 
GGCTGCGAAA 
GCATTTATTA 



12660 

12720 

12780 

12840 

12900 

12960 

13020 

13080 

13140 

13200 

13260 

13320 

13380 

13440 

13500 

13560 

13620 

13680 

13740 

13800 

13860 

13920 

13980 

14040 

14100 

14160 

14220 

14280 

14340 
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AAACAAAAAA TCTATAATGC ATATCTTCGC ACTACAGATG 
AAATAAGATT GTAGAGTTTC ATTGTTGAGT AGTGTTTCTT 
GACAAATCGA CTCCTTTTTC TTATGGATCG ATGTAGAGAT 
GGAATCATCT TCAACTCATC AACGACCAAT GGTGACAAGG 
AATGTTGATT TGAGAAATAA CTTTGCTAGT CTAGTAAAAT 
AGATTTTTTC TGGGATTGTT TTTTGCTGAG TGGGATGCTT 
TCTTGAGGGA AGTTATATAA TAGTTGTAAT AATTAG 
(2) INFORMATION FOR SEQ ID NO: 172: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11770 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



ACTTTAGAAA TTCTATATAT 
GTAAGGATGA GGAGTCAGAT 
TTGATTGAAT GCAGATTGCA 
TGGATTTCAA TCCCACAGAA 
AAATACAAAA CAATCCTAGA 
CAAGTTGTCT GGCTTGACTT 



14400 
14460 
14520 
14580 
14640 
14700 
14736 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 

ACAGGAAAGC ACGATAGCAA TCTCTTTGGA AGATTTAAAA AATATTCCTC AAAGTTTCGC 60 

TGTTGCTTAC GGTGATACGA AAGTATCTTC GATTCTCTCT GTCTTGCGTG CTAATTTAGT 120 

AAATCATTTG ATTACAGACA AAAATACAAT TTTAAAAGTT TTGGAAGAAG ATGGGGATTT 180 

GACTTTTAGA GAGATTCTAG GTGAGTaVL\ 7iTGATAGACT GATTCAGttt ATCGTTTTTC 240 

TTTTTAGTTG ATTGCACATT TGTGCTTATA TAAACAAAAA TAGTTTATCT GTTGTTTTTG 300 

GATTGACAAC TTTATTATGT AGTTGTATTC TATAGTTACA AAAGAAAATT TTAAAATTTC 3 60 

AAATGAAAAA AGCTTTTTAC ATAGTGAAAT GAGGAGGAAT TTATGGAAAT GATTGTTCCA 420 

GATCAAATTA TCATGGGTTT AATTTTATAT GCTGGTGATG CGAAACAACA TATTTATAAA 480 

GCGTTAGATT ACATAAAAAA TGGTACATGT GAACGGTGTG AAGAAGAAAT ACAGTTAGCT 540 

GATGCAGCCT TATTAGAAGC TCATAATCTA CAAACAAAAT TTTTGGCACA GGAAGCGTCT 600 

GGTACAAAGA CAGAAATTAC AGCTCTCTTT GTTCATTCAC AAGATCATCT CATGACCAGT 660 

ATGACGGAGA TTAATTTAAT CAAAGAAATT ATTAGTTTGA GAAAAGAACT TCATAAAAAA 720 

TAATACTAGA GTATTATCAT TGTTATTAAC ATAGAGGAGG AAAACATAAT GGTGAAGATT 780 

GGTTTGTTTT GTGCAGCAGG TTTTTCTACT GGTATGCTTG TAAATAATAT GAAAATTGCA 840 

GCGCAATCTA GTGGAGTTGA GGCAGAAATA GAGGCGTTTT CTCAGTCTAA ATTAGCGGAT 900 

TATGCGCCAA ATATAGATGT TGCACTATTG GGTCCACAAG TTGCTTATAC ATTAGATAAA 9 60 

TCAAAAGAAA TTTGTGATAA GTGTGATGTT CCGATAGCTG TTATTCCGAT GATGGACTAT 1020 
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GGTATGTTAG ATGGGAAAAA AGTATTAGAT TTGGCCCTAT CTTTGATTAG TGGGTAAGAA 1080 

AAGGAGATTT ATTATGTCAA AGATGGATGT TCAGAAAATC ATTGCACCGA TGATGAAGTT 1140 

TGTGAATATG CGTGGCATTA TAGCTCTAAA AGATGGGATG TTAGCAATTT TGCCATTGAC 1200 

AGTAGTTGGT AGTTTGTTCT TGATTATGGG ACAATTGCCG TTCGAAGGAT TAAATAAGAG 12 60 

CATTGCTAGT GTTTTTGGAG CTAATTGGAC AGAGCCGTTT ATGCAAGTAT ATTCAGGAAC 1320 

TTTTGCTATT ATGGGTCTAA TTTCTTGTTT TTCAATTGCC TATTCTTATG CTAAGAATAG 13 80 

CGGAGTAGAG GCTTTACCAG CTGGAGTTCT ATCTGTATCT GCATTCTTTA TTTTGCTAAG 1440 

ATCATCTTAT ATCCCTAAAC AAGGTGAGGC GATTGGGGAC GCT ATT ACTA AAGTTTGGTT 1500 

TGGAGGCCAA GGAATTATCG GTGCTATCAT TATAGGTTTG GTAGTAGGAA GTATTTATAC 1560 

CTTCTTTATA AAGAGAAAAA TTGTTATTAA GATGCCAGAA CAAGTTCCAC AAGCTATTGC 1620 

CAAACAGTTT GAAGCAATGA TTCCAGCATT TGTAATTTTC TTATCTTCTA TGATTGTATA 1680 

TATTTTAGCG AAGTCATTGA CTAArGGCGG AACATTCATA GAAATGATTT ATTCTGCTAT 1740 

TCAAGTTCCG TTGCAAGGTT TAACTGGATC TTTGTATGGT GCTATTGGAA TTGCATTCTT 1800 

TATATCATTT TTGTGGTGGT TTGGTGTTCA TGGGCAATCG GTAGTAAATG GAGTAGTGAC 1860 

AGCTCTGCTT TTATCTAATC TTGATGCTAA TAAAGCTATG TTAGCCTCTG CTAATCTATC 1920 

ATTAGAAAAT GGTGCACATA TTGTTACTCA ACAATTTTTA GATTCATTTT TAATTCTATC 1980 

AGGTTCAGGG ATTACGTTTG GTCTTGTAGT TGCCATGCTT TTTGCAGCAA AATCAAAACA 2040 

ATACCAAGCC TTAGGAAAAG TTGCAGCTTT TCCAGCAATA TTTAACGTAA ATGAGCCAGT 2100 

TGTATTTGGA TTTCCGATTG TCATGAATCC AGTTATGTTT GTACCTTTCA TTCTTGTTCC 2160 

TGTACTTGCA GCTGTGATAG TATATGGAGC TATTGCAACA GGTTTCATGC AGCCATTCTC 2220 

AGGGGTAACA TTGCCTTGGA GTACACCAGC TATTTTATCA GGATTTTTGG TGGGTGGATG 2280 

GCAAGGAGTT ATTACTCAGC TGGTGATATT AGCGATGTCT ACATTGGTTT ATTTTCCATT 234 0 

CTTTAAAGTA CAGGATCGTT TAGCTTACCA AAATGAAATC AAACAATCTT AGAGGTATTT 2400 

GTGTGTTACT GTTAAACTCA CACATTTGTG CTAAAAATTA GAGAGTTAAA ATTTTTCTAG 24 60 

TTAAAAGCTT GAAAATTTCT ATAAAAATCG GTATTATATT TTCGAAAGAA ATAAAAATAT 2520 

TTTCGAAAGA AAGGTGCTTA CGATGGTAAA TACAGAAGTA GCAAGAACAA CAATCAAGAC 2580 

AGAATATTTT GGCAGCCTTA CTGAAAGGAT GAACAAATAT CGAGAAGATG TTTTAAATAA 2640 

AAAACCTTAT ATTGATGCTG AGAGAGCAGT TCTAGCAACA CGCGCCTATG AACGATACAA 2700 

GGAACAACCT AATGTCCTAA AACGTGCATA TATGCTGAAA GAAATTTTGG AAAATATGAC 2760 
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TATCTATATT GAAGAAGAAT CTATGATTGC GGGAAATCAA GCTTCTTCCA ATAAAGATGC 2820 
TCCTATTTTT CCGGAATATA CGCTAGAATT TGTTCTCAAT GAGTTGGATC TTTTTGAAAA 2880 
GCGTGATGGA GATGTTTTCT ATATTACAGA AGAAACAAAA GAACAACTTA GAAGTATTGC 2940 
TCCGTTTTGG GAAAATAATA ATTTACGTGC TAGAGCTGGT GCCTTATTAC CTGAAGAAGT 3 000 

GTCTGTTTAT ATGGAAACAG GATTCTTCGG TATGGAAGGT AAGATGAATT CTGGAGATGC 3060 

TCACTTAGCA GTTAACTATC AGAAACTTTT GCAATTTGGT TTAAGAGGTT TTGAAGAGCG 312 0 

GGCTCGTAAA GCAAAAGTAG CTCTAGATTT AACAGATCCA GCAAGTATTG ATAAATATCA 3180 

TTTTTACGAC TCTATATTTA TCGTAATCGA TGCTATTAAA GTATATGCAA AGCGCTTTGT 3240 

TGCTCTTGCT AAAAGTTTAG CCGAAAATGC AAATCCTAAA CGTAAGAAAG AATTACTTGA 3300 

GATTGCAGAT ATTTGCTCTA GAGTCCCATA TGAACCGGCA ACTACTTTTG CAGAAGCTAT 33 60 

TCAATCAGTT TGGTTTATTC AATGTATTTT ACAAATTGAA TCTAATGGCC ACTCTCTTTC 3420 

ATATGGCCGT TTTGATCAAT ATATGTATCC ATATATGAAG GCTGATTTAG AAAGTGGTAA 34 80 

AGAAACAGAA GATAGCATTG TTGAACGTCT GACAAATCTT TGGATTAAGA CAATTACAAT 3540 

TAATAAGGTT CGCAGTCAAT CACATACATT TTCTTCAGCA GGAAGTCCTT TATATCAAAA 3 600 

TGTTACAATT GGTGGACAGA CTCGAGATAA GAAGGATGCT GTTAACCCAT TATCTTATTT 3 660 

GGTATTAAAA TCAGTTGCAC AAACCCATCT ACCGCAACCT AATCTAACTG TACGTTACCA 3720 

TGCAGGTTTA GATGCTCGTT TCATGAATGA GTRTATTGAA GTGATGAA?.C TTGGTTTTGG jvaO 

TATGCCTGCA TTTAATAATG ATGAGATTAT TATTCCTTCT TTTATTGCAA AAGGAGTATT 3840 

GGAAGATGAT GCTTATGATT ACAGTGCCAT TGGATGTGTT GAAACGGCAG TTCCAGGGAA 3900 

ATGGGGCTAT CGTTGCACAG GTATGAGTTA TATGAACTTC CCTAAGGTTC TACTTATCAC 3960 

GATGAATGAT GGAATTGATC CGGCTTCGGG TAAACGGTTT GCACCAAGCT TTGGTCGTTT 4020 

TAAGGATATG AAGAACTTTT CTGAATTAGA AAATGCTTGG GATAAAACAC TAAGATATTT 4080 

GACACGAATG AGTGTTATTG TTGAAAATTC TATTGATTTA TCATTGGAAC GAGAAGTTCC 4140 

TGATATTCTA TGTTCAGCAT TGACTGATGA TTGTATTGGT CGTGGAAAAC ACCTTAAAGA 4200 

AGGTGGAGCA GTATATGATT ATATATCAGG ATTGCAAGTT GGAATTGCAA ATTTGTCGGA 42 60 

TTCATTAGCT GCAATTAAAA AATTGGTGTT TGAGGAAGAA CGTATAAGCC CAAGTCAGCT 4320 

TTGGCATGCA CTGGAAACAG ATTATGCCGG AGAAGAAGGT AAGGTCATTC AAGAAATGTT 43 80 

GATTCATGAT GCACCTAAGT ATGGTAATGA TGATGATTAT GCTGACAAAT TGGTTACTGC 4440 

TGCTTATGAC ATTTATGTTG ATGAAATTGC TAAATATCCT AATACACGTT ATGGAAGAGG 4500 

GCCTATTGGA GGAATTCGTT ATTCAGGAAC ATCTTCTATC TCAGCCAACG TAGGGCAGGG 4560 
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ACGTGGAACA TTAGCAACTC CAGATGGACG CAACGCGGGT ACACCGTTAG CAGAGGGTTG 4 620 

TTCACCATCA CATAATATGG ATCAACACGG CCCTACATCT GTTTTAAAAT CTGTTTCAAA 4 680 

ATTACCAACA GATGAAATCG TAGGTGGGGT TCTCTTAAAT CAGAAAGTAA ATCCTCAAAC 4740 

GTTAGCCAAA GAAGAAGATA AATTAAAACT AATTGCTTTG TTACGAACAT TCTTTAATCG 4800 

TTTACATGGG TACCATATTC AATACAATGT TGTTTCCAGA GAGACGCTGA TTGACGCTCA 4860 

GAAACATCCT GAAAAACACA GAGACTTAAT TGTTCGTGTT GCAGGATACT CTGCATTCTT 4920 

CAATGTTCTT TCTAAGGCAA CCCAAGATGA CATTATAGGA CGTACTGAGC ATACTTTGTA 4980 

AAATAAAGAG GTTCTTTTTA TGGAATTTAT GCTTGACACA TTAAATTTAG ATGAGATTAA 5040 

AAAGTGGTCT GAAATTTTGC CGCTAGCTGG GGTAACTTCA AATCCCACTA TTGCAAAAAG 5100 

AGAGGGTTCT ATTAATTTTT TTGAACGAAT CAAAGATGTA AGAGAATTGA TTGGCTCTAC 5160 

ACCCTCTATT CATGTTCAGG TGATTTCTCA AGATTTTGAA GGCATCTTAA AGGATGCTCA 5220 

TAAAATTCGA AGACAAGCAG GAGATGATAT ATTTATCAAA GTACCTGTTA CTCCAGCTGG 5280 

ATTACGTGCA ATAAAGGCGC TAAAAAAAGA GGGCTACCAT ATCACTGCAA CAGCTATTTA 5340 

TACAGTTATT CAGGGATTAT TAGCTATCGA AGCAGGAGCG GATTACCTAG CTCCATATTA 5400 

TAATAGAATG GAAAATCTGA ACATTGATTC AAATTCTGTC ATTCGTCAAT TAGCTCTTGC 54 60 

TATTGATAGA CAGAACTCTC CTAGTAAGAT TTTAGCTGCA TCCTTTAAAA ATGTAGCACA 552 0 

AGtAAataaT GCTTTAGCTG CAGGTGCGCA TGCTGTTACA GCAGGAGCGG ATGTTTTTGA 5580 

ATCAGCTTTC GCCATGCCAT CTATCCAAAA GGCGGTTGAT GATTTTTCTG ACGATTGGTT 564 0 

TGTTATTCAA AATAGTCGTT CC ATTTAGAT AGAGAGGAAA TACATATGAG AATTTTTGCT 5700 

AGTCCTTCTA GATATATTCA GGGGGAAAAT GCCTTGTTTG AAAATGCCAA ATCAATTTTG 5760 

GATTTGGGAA ATTGCCCTAT TCTATTATGC GATCAGTTGG TTTATGATAT TGTTGGAAAA 5820 

CGATTTGAAG ATTACCTACA TAGGTATGGT TTCCATATTG TTCTGGCGCT ATTTAATGGT 5880 

GAAGCTTCTG ACAATGAAAT CAATCGAGTT GTTGCCTTGG CTGAGAAAGA AAATTGTGAT 5940 

AGTATTATCG GTCTTGGTGG GGGAAAGACG ATTGATAGCG CAAAAGCTAT TGCAGATTTG 6000 

ATTGAAAAGC CTGTTATTAT TGCTCCAACA ATTGCATCGA CCGACGCACC TGTATCTGCT 6060 

TTATCTGTTA TTTATACAGA TGAAGGTGCA TTTGATCATT ATCTATTTTA TTCTAAAAAT 6120 

CCAGATTTAG TTTTGGTTGA TACAAAAGTT ATTTCACAAG CCCCTAAGCG TTTATTAGCG 6180 

TCTGGTATTG CAGATGGTTT AGCAACTTGG GTTGAGGCGC GTGCGGTTAT GCAGGCAAAT 6240 

GGAAAAACTA TGTTGGGACA ACAGCAAACA TTGGCTGGAG TTGCAATTGC GAAGAAATGT 6300 
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GAAGAAACGC TGTTTGCAGA TGGTTTACAG GCTATGGCAG CTTGTGAAGC TAAAGTGGTG 63 60 

ACACCAGCAT TAGAAAATAT TGTTGAAGCT AATACTTTAT TGAGTGGTCT AGGTTTTGAA 6420 
AGTGGAGGAT TAGCTGCGGC GCATGCAATT CATAATGGTT TTACTGCATT GACAGGTGAC 6480 
ATTCATCATT TAACACATGG TGAAAAAGTA GCTTATGGAA CTTTAGTACA ACTATTATTG 6540 
GAAAATAGAC CTAAAGAAGA ACTTGATAAG TATATTGAGT TTTACAAAAA AATTGGTATG 6600 
CCAACAACTC TAAAAGAAAT GCATTTGGAT CAAGTTGGAT ATGATGATTT AATAAAAGTT 6660 
GGTAAACAAG CAACTATGGA GGGTGAGACA ATTCATCAGA TGCCGTTTAA GATTTCGCCT 6720 
TCAGATGTTG CTCAAGCTAT TATCGCTGTA GATGCCTATG TAAATTCAAA ATAAACAATA 6780 
AGGACTACTG TTTTCCAAAT GGTAGTCTTT TATTGATCCC TGTATTGAAT TCTATAGAAG 6840 
ATTGAAATAG GATGAGAACA AATCGATTGG GAAAGTAAAA TTAATTTCTA TAAATGTTTT 6900 
AGCAATTGTT TCGTACTATT TCAGATTCAG TCTACTATAT GTTCTTCATA AATCAAAAAG 6960 

CGACATAGGT TGTCGGCTAT TTATTGTGAA TACATTAATT AGCATTCCAG TTTTATCTTC 7020 

GGTCTAAAAT AAGTATTTTG TGCTATACGA GATAAGCTTC TTGACTTACT CCTTGATTTA 7080 

CTGCATAACA ATGGGATAAA AAGTGGGAGA TAGAGCAATT CATAGTCATC AAAATTAATG 7140 

AGATACAGTA TACAGTTTTT CCTTTAAACA CATTTCAAAT TCCCTCAAAA ATGGTATAAT 7200 

AGTAACATCA CAAAATTGGA GAGAGACCAT GAGTTTTTAC AATCATAAAG AAATTGAGCC 72 60 

TAAGTGGCAG GGCTACTGGG CAGAACATCA TACATTTAAG ACAGGAACAC ATACATCAAA 7320 

ACCTAAGTTT TATGCGCTTG ATATGTTCCC TTATCCGTCT GGAGCTGGTC TGCACGTAGG 73 80 

ACACCCAGAA GGTTATACTG CAACCGATAT CCTCAGTCGT TACAAACGTG CGCAAGGCTA 7440 

CAATGTCCTT CACCCAATGG GTTGGGATGC TTTTGGTTTG CCTGCAGAGC AATACGCTAT 7500 

GGATACTGGT AATGACCCAG CAGAATTTAC AGCGGAAAAC ATTGCCAACT TCAAACGTCA 7560 

AATTAATGCG CTTGGATTTT CTTATGACTG GGATCGTGAA GTCAACACAA CAGATCCAAA 7620 

CTACTACAAG TGGACTCAAT GGATTTTCAC CAAGCTTTAC GAAAAAGGCT TGGCCTATGA 7680 

AGCTGAAGTG CCAGTAAACT GGGTTGAGGA ATTGGGAACT GCCATTGCCA ATGAAGAAGT 7740 

GCTTCCTGAC GGAACTTCTG AGCGTGGAGG CTATCCAGTT GTCCGCAAAC CAATGCGCCA 7800 

ATGGATGCTC AAAATCACGG CTTACGCAGA GCGCTTGCTC AATGACTTAG ATGAACTAGA 7860 

TTGGTCAGAG TCTATCAAGG ATATGCAACG CAACTGGATT GGTAAATCAA CTGGTGCCAA 7920 

TGTAACTTTC AAAGTAAAAG GAACAGACAA GGAATTTACA GTCTTTACTA CTCGTCCGGA 7980 

CACACTTTTC GGTGCGACTT TCACTGTCTT GGCTCCTGAA CATGAATTAG TAGACGCTAT 804 0 

CACAAGTTCA GAGCAAGCAG AAGCTGTAGC AGACTATAAA CACCAAGCCA GCCTTAAGTC 8100 
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TGACTTGGCT CGTACAGACC TTGCTAAAGA AAAAACAGGG GTTTGGACTG GTGCTTATGC 8160 

CATCAACCCT GTCAATGGTA AGGAAATGCC AATCTGGATT GCAGACTATG TCCTTGCTAG 8220 

TTATGGAACA GGTGCGGTTA TGGCTGTGCC TGCCCACGAC CAACGTGACT GGGAATTTGC 8280 

CAAACAATTT GACCTTCCAA TCGTCGAAGT ACTTGAAGGT GGAAATGTCG AAGAAGCTGC 8340 

CTACACAGAG GATGGCCTGC ATGTCAATTC AGACTTCCTA GATGGATTGA ACAAAGAAGA 8400 

CGCTATTGCC AAGATTGTGG CTTGGTTGGA AGAAAAAGGC TGTGGTCAGG AGAAGGTTAC 8460 

CTACCGTCTC CGCGACTGGC TCTTTAGCCG TCAACGTTAC TGGGGTGAGC CAATTCCAAT 8520 

CATTCATTGG GAAGATGGAA CTTCAACAGC TGTTCCTGAA ACTGAATTGC CGCTTGTCTT 8580 

GCCTGTAACC AAGGATATCC GTCCTTCAGG TACTGGTGAA AGTCCACTAG CTAACTTGAC 8640 

AGATTGGCTT GAAGTGACTC GTGAAGATGG TGTCAAAGGT CGTCGTGAAA CCAACACTAT 8700 

GCCACAATGG GCTGGTTCAA GCTGGTACTA CCTCCGCTAT ATTGACCCGC ACAATACTGA 8760 

GAAATTGGCT GATGAGGACC TCCTCAAACA ATGGTTGCCA GTAGATATCT ACGTGGGTGG 8820 

TGCGGAACAT GCTGTACTTC ACTTGCTTTA TGCTCGTTTC TGGCATAAAT TCCTCTATGA 8880 

CCTCGGTGTT GTTCCGACTA AGGAACCATT CCAAAAACTC TTTAACCAAG GGATGATTTT 8940 

GGGAACAAGC TACCGTGACC ACCGTGGTGC TCTTGTGGCA ACCGACAAGG TTGAAAAACG 9000 

TGATGGTTCC TTCTTCCATG TAGAAACAGG GGAAGAGTTG GAGCAAGCGC CAGCCAAGAT 9060 

GTCTAAATCG CTCAAGAACG TTGTTAACCC AGACGATGTG GTGGAACAAT ACGGTGCCGA 9120 

TACCCTTCGT GTTTATGAAA TGTTTATGGG ACCACTCGAT GCTTCGATTG CTTGGTCAGA 9180 

AGAAGGTTTG GAAGGAAGCC GTAAGTTCCT TGACCGAGTT TACCGTTTGA TTACAAGTAA 9240 

AGAAATCCTT GCGGAAAACA ATGGTGCTCT TGACAAGGTT TACAACGAAA CAGTCAAAGC 93 00 

TGTTACTGAG CAAATTGAGT CTCTCAAATT CAACACAGCT ATTGCCCAAC TTATGGTCTT 9360 

TGTCAATGCT GCTAACAAGG AAGATAAGCT TTATGTTGAC TATGCCAAAG GCTTTATTCA 9420 

ATTGATTGCA CCATTTGCAC CTCACTTGGC AGAAGAACTC TGGCAAACAG TCGCAGAAAC 94 80 

AGGTGAGTCA ATCTCTTATG TAGCTTGGCC AACTTGGGAC GAAAGCAAAT TGGTTGAAGA 9540 

TGAAATTGAA ATTGTCGTCC AAATCAAAGG AAAAGTTCGT GCCAAACTCA TGGTTGCTAA 9600 

AGATCTATCA CGTGAAGAAT TACAAGAAAT CGCTTTAGCT GATGAAAAAG TCAAAGCAGA 9660 

AATTGACGGT AAGGAAATCG TGAAAGTAAT TGCGGTACCG AATAAACTCG TTAATATCGT 9720 

CGTTAAATAA CGAGTTTATT AGCTCTATCT GCCACCTTCA ATAGTCCACT GGACTATTGA 9780 

AsCCAACTAA ATTAGTTAAC ATTGTTGTGA AATAAGATAG GAGTCCTTCA GAGTAGAATC 9840 
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TGGAGGATTT TTTGAATCTT CTTATGAAAG TATGATATAC TATGGGCAAC TATAAAGTTT 9900 
GAAAAGTGAA ATAAGGAGAA TAAGATGCCA GTAAATGAAT ATGGTCAAAT GATTGGGGAG 9960 
TCAATGGAAG CTTATACTCC AGGTGAATTG CCTTCTTTTG ATTTCTTAGA AGGGCGTTAT 10020 
GCTAGGATAG AGGCTCTTTC AGTGGAAAAG CATGCGGAGG ATTTATTAGC TGTTTATGGC 10080 
CCTGATACGC CTCGGGAGAT GTGGACCTAC CTCTTTCAGG AGTCAGTAGC AGACATGGAG 1014 0 
GAACTGGTCA GCCTTTTAAA TCAGATGTTG GCTCGTAAGG ACCGTTTTTA TTATGCAATC 102 00 
ATAGACAAGG CAACTGGTAA GGCTTTGGGA ACTTTTTCCC TCATGCGAAT TGATCAGAAT 10260 

AACCGAGTAA TAGAAGTGGG AGCTGTCACT TTTTCTCCAG AGCTCAGGGG GACACGGATA 10320 

GGAACAGAAG CCCAGTATCT CTTGGCTTGC TATGTCTTTG AGGAGCTTAA CTATCGTCGC 10380 

TATGAGTGGA AATGCGATGC TCTTAACCTG CCATCCAGAC GAGCAGCGGA ACGTTTGGGA 10440 

TTTATTTATG AAGGAACCTT CCGTCAGGCA GTGGTTTATA AGGGGCGTAC AAGAGATACG 10500 

GATTGGTTGT CTATGATTGA TAAGGACTGG CCTCAAGTCA AAGCTCGATT GGAAATATGG 10560 

TTGCGTCCTG AAAACTTTGA TAAAAATGGA CGACAGCACA AGAGCTTGAG AGAACTTTAA 10520 

GAGGTGTTGA GATGATTACT ATTAAAAAGC AAGAAATTGT CAAGCTAGAG GATGTTTTGC 10680 

ATCTCTATCA GGCTGTCGGT TGGACAAACT ATACCCATCA AACAGAGATG CTGGAGCAGG 10740 

CCTTATCTCA TTCATTAGTA ATTTATCTGG CACTTGATGG TGATGCTGTG GTGGGCTTGA 10800 

TTCGTTTGGT TGGAGATGGT TTTTCATrAG TTTTTGTACA GGATTTGATT GttttGCCTA 10860 

GCTATCAGCG TCAAGGGATT GGTAGCTCCT TGATGAAAGA GGCTTTAGGA AATTTTAAAG 10920 

AGGCCTATCA AGTCCAGCTG GCGACAGAAG AGACAGAAAA AAACGTGGGA TTTTATCGTT 10980 

CTATGGGCTT TGAAATCTTA TCCACCTATG ACTGTACAGG AATGATTTGG ATAAACAGAG 11040 

AAAAATAAAA AAACTTGTTT GTTCTTAAGC AAAGTTTAAG GATGGTCTAG TATCATATAG 11100 

TCATTAAATA AAGACCTCCT AACTTTATTT AATAAAATCC TAAACTTTTT TCATCACAAT 11160 

CTCCTAATGA AGCCACCCAA TCAGGTGGCT TTTTTGCGGT ACGACGGGCA TGTCGTATAT 11220 

. CTGAGGTGTA AGTCCTCAGC CTGACTATCG TGAGGTAGCA GGGAGAGGAA GGGATAGCGA 11280 

AATCGTGGCT CTACGAACAG GAACGTGATA GTAAGGCGTA TATAGCGGAT AAGGAGGCTT 1134 0 

CAAACTCTAA AGTCCAAAAA GGTAGTCGTA ACCTATATGT GTAAATCACG AGAGTAATTG 11400 

AATTCGGACT AAGGTTTGTG TGAAAAAGAT AAATCTTTCT AGAGTCTAAA GACTCTGCGT 11460 

CAGATTTCCT ATTTTCACTG TAACCTTTTA ACGTCCTCAT ATCTTGTATA AACGAGGAAA 11520 

GATGTACGAC TTATCCCGTG AGGTTTCATG AGCGCTGAAA GCGTAGTAAC AACGAATCAT H580 

GAGAAGTCAG CCGAGCCCAT AGTAGTGAGG AAACTTCCGT AATGGAAGTG GAGCGAAGGG 11640 
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1101 

GTGAATACTC AAACAGTCTG GGGAGAGACT GTTTGAGGTC TGTCGCTAGA AAGAGAAAAC 11700 
GACAGATCGA AGTAATCCTA CTTCACTTGT GTCTGTAAAA TGAGTGGTCT GATAGAACTG 11760 
GACTTTGAGG 

(2) INFORMATION FOR SEQ ID NO: 173: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4185 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 

CGCGAAACTA CTTTCTTAGT ATAACACTTT CAGAATCATT GTCAATAGAA ATGACTTGAT 60 

TTTTTCAATT TTTTCAAGCT ATTTCCAAGG GTTGTAAAAT CGTCCCTGAT TCTGCAAGAT 120 

AAGTAGTAAA CTAACTACTA AAAACAAGGT TGCCAAGAGC AAGGTAATAT AGTCTCCTTT 180 

TTTCAAGGCC TGATAACTAT ACCATGTGCG TTTTTTCTCT TTCCCAAAGC GGCGAACTCC 240 

ATGGCAGTCG CAATGGTATC AATGCGTTCT AGCGAGCTAA AAATCAAGGG CGTAATAATG 300 

AGCAGATTGC CTTTGATTCG TTGCATAAGA GAAGCTTTCT TGGATAATTC CATCCCACGC 360 

GCCTCCTGAG ACATCTTGAT AGTAAAGAAT TCTTCCTGCA AATCTGGAAT ATAGCGCAAG 420 

GTCAGGCTGA CAGAATAAGC AATCTTATAG GGCACACCAA TTTGATTTAA ACTGGAAGCA 4 80 

AACTGACTAG GATGGGTTGT .CATCAAAAAG ATAATAGCCA GAGGAATGGT GCAAAGATAC 540 

TTAATGGCCA AATTTAGCAG ATAAAAGAGC TCCTGGCTGG TTAGAGTGTA GACACCGATT 600 

CCCTGCCAAA TCACACTTCT CTCTCCATAA AGTCCAACCC CATACTCGGG AGAAAAGAGA 660 

TAGACCATCA AAACGTTTAA AACGGCAAAT ATCGTCGCAA AAACGGCTAC AAAGGAAACA 72 0 

TCTTTAAAGC GAATTTCTGA TAAATAGAGG AGAAAGACTG AAAAGATGGC AATCAGCAAG 780 

AGCATTCTGG TATCATAGCT AATCATGGCC GCCAATGATA CCAGAATGAA AAAGAGAAGT 840 

TTCCCAGCTC CTGACAAGCG ATGAATCACA GTATCTCTAT GCTGGTAACC GATTAATTTA 90O 

GCTTGCATCC CTCTCTCCTT TCTTTGTAAA ATGCCGTTAA ATCCAGTGGA TCCACATCTA 960 

GTTTCTTAGC CAAGTTAAAG ATGGAGGTTT CTTTTAGATT GGCTTTTACT AACAGCTCAG 1020 

GATCGCTCAA CAGACTGGCT GGAACAGTAT CGGCAATCAA TTCTCCATCC ACCATGACAA 1080 

GGACCCGGTC TGAATAATCC AGCATCAATT GCATATCATG GGTAATCATG ACAATGGTAT 1140 

GCCCTTTTTG ATGTAACTCT TCGAGAAATT CCATAATCTC AGTATAGTTC TTCTGATCTT 1200 
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GACCTGCAGT CGGTTCATCT AGGAGAATAA TTTCAGCTCC TAAGACCAAA ATTGAAGCAA 12 60 

TGGTGACACG TTTTTTCTGA CCAAATGACA GGGCAGAAAT AGGCCAATTA CGGAATTCAT 1320 

AAAGTCCACA GATTTTCAAG GTTTCATATA CTCTCGTTTC AATTTCCTTC TCATCCACAC 1380 

CTCGCAAACG GAGCCCTAGA GCCACCTCAT CAAAAATCAT ATTGGTTGAA ATCATTTGAT 1440 

TAGGATTTTG TAGCACATAT CCTACTCGTT CCGCCCGCTC TGCAACAGAA TCGCCTTTTA 1500 

TATCCTGTTT TTCCCAAAGA TAGCGTCCTT CCGTCTGAAT AAAGCTACTT ATAGCCTTGG 1560 

CTAGAGTTGA TTTCCCTGCT CCATTTTTTC CGACAATAGC AATCTTTTCA CCCTTTTTAA 1620 

TATCTAAATG TAGGGATTTT AAAATCGGTC TATCATCATA AGAAAAAGAT ACTTCCTCTA 1680 

GTCTAAAGAG TGACTGCAAT GCTGGGGTTT CTTTTGCCAG TTCATTCTGC AACTGAACCT 1740 

GACCTTTTGA GATAGACAAG TTATCCAGAT TCGCTAATTG TTCTTCCTTG ACTAAGTCCA 1800 

CACCTAATTG ACGGAGAGTC GTTAGATAAA GGGGTTCTCG AATTCCATTT TGAGTCAATA 18 60 

AATCAGTCGC AAGCAACTGG TCAGGGCTCC CATTAAAAAG GATACGACCA TCGTTTATCA 1920 

AGACAATCCG ATCCACAGGG CGATGCAGAA CGTCCTCCAA ACGGTGCTCG ATAATAAGAG 1980 

TCGTCGTCCC CTCTTCCTTA TGAATCTGGT CAATCAATTC GATAATATCC TGACCTGACT 2040 

TGGGATCTAG ATTGGCGAGT GGCTCATCAA ACAAGAGAAT CGGACTTTCA TCAATCAAGA 2100 

CACCAGCCAG ACTGACTCGC TGCTTTTGTC CACCTGACAA ATCCTGAGGA CGCTGATCCA 2160 

GTAAAGGAAG AAGGTCCAGC TTTTCAGCCC ATTTATAAAC ACGACCTTTC ATCTCATCTA 2220 

GGGCTGTCAC ATCATTTTCC AGAGCAAACG CCAAATCTTC TGCCACAGAC AAGCCAATAA 2280 

ACTGCCCATC TGTATCCTGC AAAACTGTGC TAACCAGATG AGACTTATCA TAGATGCTCA 2340 

TATCAAAGGC TACTTGACCC TTTATCAAAA ATTCTCCATA TGTCTGACCC TTGTAAATAT 24 00 

TGGGAATAAT CCCATTCAAA CACTGACCCA AGGTAGATTT ACCTGACCCA GATGGTCCAA 2460 

CAATTAAGAC TTTCTCTCCC TTGTAAATGG TCAAGTCTAT CCCTTGCAAG GTCGGTTCTT 2520 

GTTGTGTTTC ATACCGGAAA GAGAAATCCT TCCACTCAAT TaTAGCTTCT TTCATCTIAC 2580 

TCTCTTCATT CGCTTCTTAG ACTTCTATTT TATCATAAAT CAAGCCCTTC TTGCAGTCTC 2 640 

TCCTCTTAAA ATCTTAGCGC CAAAAAGATT CCTATCCTAG CTTACTTGCC TAACTAATCT 2700 

ATAAACATCG AAAAAGACTA GTTGCCCAGC CTTCCCCATC ATTTTATACT CTTCGAAAAT 2760 

CTCTTCAAAC CACGTCAGcT TCGCCTTGCC GTAGGTATGG TTACTGACTt CGTCAGTTTC 2820 

ATCTACAACC TCAAAACCAT GTTTTGAGCc TGCTTCGTCA GTTCTATCCA CAATCTCAAA 2880 

ACACTGTTTT GAGCAACtGC GGCTAGCTTC CTAGTTTGCT CTTTGATTTT CATTGAGTAT 2940 

TAGTCCTTTT TCAAACTTCC TGCACGAGTT TGGGTTCCTG CATAGGCAAG TAAGAGAAGA 3 000 
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GTTCCTGCAA TAGCTACAGA TACACCATTG GCAATTCCCG CAACAATCCC TTGTGCAAAT 3060 

ACTTTTTCTG CCGCTTCTTG ATAAATCACA ACATCTCCAA GTGGTGCCAA GACACCCCAA 3120 

ACAAGGGCAT TTGCAAGTAG TTGAATGAGA TTAAAAATAA GAATATCTTT CCAGTCAAAA 3180 

ACACCATTGA TCACGCGAAC GTACTTTCTA AAAAGTCCCA CAACTAAACC AAAGAGTCCG 3240 

CTAGCGATAA TCCAAGTCCA CCATAGACCA TAACCAACAA GAGAGTCCTT GATTGCATGA 3300 

CCAATCAACC CGACAAGCAA ACCGATAATC GGTCCAAAAA TAATAGAAAG TAGCGCTTGT 3360 

ACCGCATACT GAAGCTGGAT GCTTGTATTT GGAACAGGGG TTGGAATGTT GATCATCCCG 3420 

ATGACGACAA AGAGGGCAGC GCCAATTCCG ACAGCAACAA CTTGTTTAAT TGTAAATTTG 3480 

ATTTCCATAC TATTCTCCTA TTTTATCCTT CTATTTTCTT TATTTCAATG GTCCAAGATG 3540 

AACCGACACC TACATTATAG GCCTTGGCAA AGGAACCTTG GTTGATAGCC AAACCTAAAC 3 600 

GATAGAGAGA GTTGATGTAA AGGATGGGTT GCCCAATTCT CACATCTGCA AATGATTTGC 3 660 

CATAGACAAC CTGATTTTGA TAGACCAGCA TATCAGCATG ATAGATGGTC ACTTCAAAAC 3720 

GATCACCAAA TTCTGGTTCC AGCTTGTAAA ATTCTTCCCG TGTGATAGAG GTCCAAAGCG 3780 

AACCGAAACG CACATCCAGA ATATCAATGG CTCCCTTCAC CAGATGATCT TCTATGATGG 3840 

TCGCTACGAC TGGAAGCTCT ACAATCTGTT CCACACTGAG CTCTGGCCCT ACTTCCTCAA 3900 

AAGTAATGTG ACCACTGGCC AGTTTAGCAC CAGTATAGGC ATAGACATCA CGACCGTGCA 3 2SC 

AGGTATAAGA ATGCTCTGTG TTTTGACGCC TATTGGCCAC CTCAGAAATC TCACGAATGG 4020 

CTACAATGCC AACGTGTTTC TTGATAAAGG AAAGCGTCCC ATTATCTGGC GTGACAATGT 4080 

ATTGATTTTT TGCAGTCTTG GCAACTACAC TCTTACGTTT CGAACCGACA CCTGGATCGA 4140 

CAACCGATAC AAACGTCGTT CCCTCAGGCC AGTAATCCAC CGTCT 4185 
(2) INFORMATION FOR SEQ ID NO: 174: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2069 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 

TGATAGAGTT AAAGCCGCTG AGTCATTCAA TCCATCTCCA ACCATCAAAA TAGTGTGACC 60 

TGCTTTCTGC AGTTTCTCTA CTAACTCAAA TTTCCCATCA GGTTTCAAGT CTGTATAGAC 120 

CTGATCAAAG GGCAAATCTT TGACTAATTC CTCTGTCCTA ATCAAGGTGT CTCCTGTTGC 180 
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CAGAATCAAT TTTTyCCCCT GTGCCTTAAG TTTATCCAAG GCTGTTTTTG CTTCTTTTCT 240 

CAAAGGAGTA TGAATGCAGA ACATTCCAAT CAATTCATTT TGATAAGCCA AGAATAAGAG 300 

ATTGTAGTGA CTCrTGTACT CTTCAATTAA AGCATTTTGT TCTGAACTGA TATGAATCTG 3 60 

CTCATCCTGC ATCAAGACAT AATTCCCAAT AAGAACTGGT TGGCCATCTA TATGAGATTT 42 0 

GATCCCCTTG CTTGCGATAT ATTGGAGTTT CCCATGCATT TCCTCATGTT CAATTCCCTC 480 

TATCTCAGCT TGCTTGACGA TGGCATTAGC AATAGGATGA TAAATGTGTT CCTCAAGACA 540 

GGCACTGATT CTGAGAATAT CTTCCTCACT ATAGTCTCCA AAAGGTAACA CCTTTTCAAC -600 

TATAGGATAA CTAGTTGTGA TTGTTCCTGT CTTATCAAAC AAGAAAGTAT CAACTTCCAG 660 

ATATTTCTCC AGAACATCTC CATCCTTAAT CACCATTTCA CGGTTCAACC CTTCCTTGAT 720 

AACTGTCAAA TAAGCTACAG GAGTAGAGAT TTTCAAAGCG CAGGAGAAAT CGACCAATAG 780 

GAAAGAAATA GCCTTAGAAA AAGAACCTGT CAATAGGTAA GTCAGCCCAG CCCCCAAGAA 840 

ATTATATTTG ACGACTTTAT CCGCCATCTT GATGAAATAG CGTTGTTTCG TTTTCTTGTT 900 

TTCTTCAGAT TTCTTCATCA ACTCAATCAG CTGTAAAATA CGGCTGTTCA TCTGATTATC 960 

TGTTACACGA ATGCGTAACT CTCCAGTTTC TAATACTGTA TTTGCACAAA CCAAATCAGA 1020 

CTCTCTTTTT TCAACTGGAA AACTCTCTCC TGTCAAGGAA . CTTTCGTTGA CCATACCTAA 1080 

ACCTGAAACT ACTTGTCCAT CAAACAGAAT TTCATTTCCT TGAGATAAGA TCAAGACATC 1140 

TCCTATT7GA ACATCGGAAC TCtT'GAtaCT AACAACCGTA TCGCCCTGTA CTAGGAATAC 1200 

ATCGCTCTCT TTTGCAAGAA GACTCTGTTC TAAATGTGTT GCAGTTTTTT TCAAGGACCA 1260 

CTGATCTAAA TGATTCCCCA AATCAAGCAT AAACATGATA TTGCTAGCTG TCTTGGATTG 1320 

GTTCATAAAC AAAGACAATA AAATAGCCGA ACAGTCCAAG ACTTCCATCG TTAGTyCCTT 1380 

ACGCGCTAGT GTTTGATAGG CTTCTCTAAT ATAACCCAAA GCCTGATAAC AAGTCCATAT 1440 

ATAGCGAATA GGATACGGCA CAAAACTACG AAAAAGTACA CGCTTAACCG CTGCACCTGA 1500 

AACAATAGAA TAAGCACTCT CTTCTCTACG AATGGGAAGA GTCATCAACT CAGAAACTTT 1560 

CCCTTTATCA ATTCTTTTTA AAAAGGCTTC TGCATTATCT AATACAGAAA AGCCTTCTTT 1620 

TATGCGTAGA GTAAAGTGCT GTTGATCCAT GTAAAACTGG ATAGACTCAA TCCCCTTTTC 1680 

ATCTCTCGCC AAGGAACGAA GATAGTCTTG AATATCCAAG GTAAGTGAAA AAGAAGATGA 1740 

TAGTCGGATA TGTTGGTATC CTCTATGTAG CACTTTAAAA GACATATTAT TCACCTATAA 1800 

GGCTATCTAA TTGCTCTTCT TTTTTCTCTT GCTCGTACAA ATATTTGGCA TCTTGCAAGA 1860 

CATCGTCTCC ATGTTGCTTC ACAACAGAAA CAGATGCATC TAGCTCGTCT TTCAACTTGT 1920 

AAGCCTTAGC CAAAGCTTTA GAATAACCTT TTTTAGCTTC CTTACTTGCT AAGATTTTCA 1980 
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AACCAAGGGT ACCAAATGCG ACACCACCCA AAAATAATGA AGATTTTTTC GCAACTTTTG 2040 
CAACGGTTAA TACTTCTTTT AACATAGGG 2069 
(2) INFORMATION FOR SEQ ID NO: 175: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4597 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 

AAATCTTGCG CAATAAAGCT CATCTCCATC TCCCGATTGA AACAGTCACT CCCCGGACTG 60 

TTTCAACGTC CCAAGACATA ATCTTAGGCA GATTTCTAAA ATTACACTCA AAGTGGAAGT 120 

CATTGAGCTT TCGAATGACA GTTGAAGTTG AAATGGCCAG CTGATGGGCA ATATCGGTCA 180 

TAGAAATCTT TTCAATTAAC TTTTGCGCAA TCTTTTGGTT GATAATACGA GGAATTTGGT 240 

GATTTTTCTT GACGATAGAA GTTTCAGCGA CCATCATTTT CAAGCAATGA TAGCACTTAA 300 

AACGACGTTT TCTAAGGAGA ATTCTAGTAG GCATACCAGT CGTTTCAAGG TAAGGAATTT 3 60 

TATAGGGTCT TTAATGTCTA GTAATTTTGT GATAAAATGT AATTGTTCCA TATGATTCTT 4 20 

TCTAATGAGT TGTTTTGTCG CTTTTCATTA TAGATCTTAT GGGACTTTTT TTCTACCCAA 480 

AATAGGCTCC ATAATATCCA TAGGGAATTT ACCCACTACA AATATTATAG AGCCCAAAGT 540 

TTTAGGTCGC TTGATAATAT GCGTTTTTTG AATTTTATAG ACTGCTCGTT TAAACTCTAT 600 

TTACTTCGTA CCTTCTGGAG CGAGACGGAA TATTAGTCAC ATACAAAATG AGTACTATTA 660 

GGATTTTATT TTCATGTACA ATTTCAGCCA GTCTTCTTAT AATCAGCCTA TAGGAATCAA 720 

GGAGGTGACT CTTATGGCTG TTTTTGTGTC TTTGGATGGA ATTGTGGTAG AAGTCCTTGA 780 

TGTCTTTTCT TCTTTTAATG GGGATAGTGA GTTTTTCTTG TGTATAGCAT TTTGAATCTG 84 0 

GAATAGGACG CCATGACTGC TAAAAGATTT CTATAAATTA ATTTGATTTT CCTAATCAAT 900 

TTGTTCATAT CTTATTTCAT TCCACTATAA ACGTCTTAAA GACAAGAGTC AGTTTGTTAT 9 60 

GGAACGCTCT CAGTTCGAGG AGATGTTCCA ACTTCAAAGT AGTCGCTTGA CGACGCAAGA 102 0 

AAAATTACAA TTGTTTACCT CTGTGTTTGC TGGCCGTTAT GATGTTTATG CTAAGAATTT 1080 

TATCAATGAA CAAGGGAAAA TTCAGTATTT TCCTTCCTAT GATTATGGTT GGAAGCAGTT 1140 

GCCACCTGAA AAACGGAGTT TCCAGACATT GACGAACTCC GTTTTGAAAT CTCATTTTCG 1200 

TGGGGAGGCA GCTATCGGTA TCTTTCCTAT GCACTTAGAT GATAGCTGTT ATTTTTTGGT 12 60 
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ACTGGATTTG GATGAAGGAG ATTGGAAAGA AGCTGGTTTA ACCATTCGAA GAATAGCCAG 1320 

GGAACGCCAG ATGGAAGCCC ATTTAGAGAT TTCTCGTTCG GGTCACGGAC TCCATATTTG 13 80 

GTTCTTCTTT GAGGAAGCGA TTCCGAGTCG AGAGGCTCGC TTGTTTGGAA AGAAACTGAT 1440 

AGAACTGGCA ATGCAGGAAA GTATGCAACT GTCCTTTGAT TCTTTTGATC GCATGTTTCC 1500 

AAATCAGGAT GTCCTTCCTA AGGGGGGATT TGGAAATTTG ATTGCCTTGC CTTTTCAAGG 1560 

AGAAGCTTAC CATCAAGGGC GAACGGTCTT TGTGGATGAA CAGTTTCAGC CTTATGAAGA 1620 

CCAATGGAGG TATCTACAAG AAATTCAGAG GATTTCAACT GCTAAAGTGG CACTGTTAAT 1680 

CCAAGAGGAG TTAGGCAAGC AAGAATTGGA TAAGGAGTTG AAGGTCGTTT TATCCAATAT 1740 

GATCCAACTT GAAAAATCGT CTGTGACATC CAAGGCACTT TTTTCTTGAA AAATATGGCT 1800 

TCCTTTTCTA ATCCCGAATT TTATAGTAGA TTGAAACTAG AATAGTACAC CTCTGCTTCT 1860 

AAAACATTGT TAGAAATCGA TTTGACTTTC CTGATCGATT TGTCCTGTTA TTATTTCATT 1920 

TTACTATATT TAAAGCAGGC TATGCGACAG CCAACCTATC AAATTCCTGA GAGAATGTAT 1980 

TTATTTGGAG AATCCGATCA TTATTTATGG TTGCCAAGAG GTTTGCTGTA TCCATTGCAA 2040 

GATAAATTTA AGCAGGTATC TGTGGAAGAT AGGAGAAAGG TACAAAGGTC TATTAGCGTG 2100 

GAATTTAAGG GAGAACTCAC TTTTGAGCAA GAGTTAGCCC TGTCAGATAT GACTTCTAAA 2160 

GAAAATGGTT TACTTCATGC GGAGACTGGT TTTGGGAAGA CCGTTTTAGG TGCTGCTCTT 2220 

ATCTCTGAAC f^GAAAACAa_^ AAC^-\T?ATT CTAGTCCATA AtaaGCAACT CTTAGACCAA 2280 
TGGCTAGATC GCTTAAACTG CTTTTTGACT TTCGAAGAGG AGGAGGCTAT CCGTTATACG . 2340 

GCATCAGGTC GTGAAAAGGT AATCGGCTAT GTTGGGCAGT ACGGTGGGAC TAAGAAATGG 2400 

CTGAGTAAAC TGGTTGATGT CGTTATGATT CAATCTCTAT TTAAGTTGGA AAATAGTCAA 2460 

AGTCTTTTGG ATGAGTATGA GATGATGATT GTGGATGAGT GTCATCATGT CTCTGCCTTG 2520 

ATGTTTGAAA AAGTTGTTGC TCAGTTTAGA GGGAAGTATC TTTACGGTTT GACGGCTACG 2580 

CCTGAGCGTA AGAATGGTCA TGAGCCTATT GTTTTTCAGA GAATTGGTGA GATACTCCAT 2640 

ACTGCTGATA AGAGGGAAAC GGATTTTAAA CGGCAATTGC AATTAAGATT CACTTCTTTT 2700 

GGTCATTTGG AAATTGAAAA GACCAAAGCA AGTAATTTTA TACAGCTTAG TGATTGGATT 2760 

GCTACTGACT CAGTGAGGAA TCAGATGATT CTCAAGGATA TTCTAGCCCA AGTGGCAGAA 2820 

GGACGGAATA TCTTGGTTTT AGTTAATCGA ATTCAACAGA TAGATGTCTT TGAAAAATTA 2880 

TTGAAAGAGA AAGAGGTTGA TGACTGTTAC ATTATTAGCG GAAAAACCAA AGTCCGAGAG 2940 

AGAACGAGTT TACTGGAGAC GTTAGAACAG TTAGATAAAG GGTTTGTTTT GTTGTCTACT 3000 

GGAAAATACA TTGGCGAAGG TTTTGACTTA CCTCAGTTGG ACACGCTTAT CTTGGCAGCA 3060 
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CCCTTTTCTT GGAAAAATAA TTTGATTCAG TATGCAGGTC GGATTCATAG AAACTACAAG 3120 

GATAAGTCTT TGGTGCGTAT TTTCGATTAT GTGGATATTC ATGTTCCTTA TTTAGAAAAG 3180 

ATGTTTCAGA AACGACAAGT AGCTTATCGA AAGATGGATT ATCGTGTCAT CGAGGGTGAG 3240 

GAGAAACAAT TCGTTTATGT TGATAGTAGA TATGAGAAGG TGTTGAGAGA GGACTTAGCA 3300 

GGGGAAAGAC AGGAATGTCT GCTTATTTTA CCTTATGTGC ACCAGACAAA ACTGATGAAT 3360 

TTTCTAAAAG AATTTAGGAT TAGTCAAATT GAGATATGTA TACCAGAGAC GGTTGCAAAT 3420 

AAAGCATGGC TAGACCAGTT GAAGAGCCAG AAAATTAAAG TGTCTTTTAC TCAATCAAAA 3480 

ATAGTAACGC CTATTCTTTT GGTGAATAAG ACTATTGTTT GGTATGGTGC AATGCCATTA 3540 

TTAGGGAAGG TAGATGAGAT GACCATATTA CGTTTGGAAT CAGCTAGTAT AGTTTCTGAA 3 600 

CTAGTGGCAG GTTTACGATA GAGAAAATTT TTAAAAATTT CTATGTATGA TTTTCATTTC 3660 

TTTAGTGAGA CTGTTGCCAT TATCACATTC GAATCACACA AAATAAAAAA ATTTTTATAA 3720 

GTACTTGACA AATAGATTGA AATATCATAA AATAAAAACG GTTACAGAGT TATTAATTAT 3780 

TTAAGCTTCA TGTCACCATT AAAAATTGAA ATAAAAGGAT GTTATCACTA ATACAAGTGA 3 840 

GCAGGAACCT ATTTAATCAC ATCAGAAGAA GTTTCTTGAT GTTTTTAAGT AGCTTCCTTl' 3900 

TATTTTAAAA GGGAAATTTT ATGATCATAA AACGAATACT AAACCACAAT GCCGTAATTG 39 60 

CGCAAAGTAA AAAAGATATC GATATTCTTC TTTTTGGAAG GGGAATAGCT TTTGGAAGAA 4 020 

AAACTGGAGA TAAAGTAAAT CCAATTGATA TTGAGAAAAG TTTTTTTCTC AAAAATAGAG 4080 

ATAATATGAC CCGTTTTACA GAGATGTTTA TTAACGTTCC TTTGGAGTTG GTGTACATCA 4140 

CCGAAAAAAT AATTAACCTA GGTAAAATAA CATTGGGTAA TAATTTTGAT GAAATTATCT 4200 

ATATTAATTT AACGGATCAT ATTTCTTCGA GCATAGAACG TTATAAAGAA GGGATTATTA 4260 

TTTCGAATCC CCTACGCTGG GAAATATCGA AATATTATAA AGAAGAATTT GAACTTGGGA 4320 

AAAGGGCTTT ACAAATAATA AAAAAAGAGT TAGGTATTGA ACTTCCAATT GACGAAGCTG 4380 

CATTCATAGC GCTACATTTT GTTAATGCTA ATTTAGAAAA TAATTTTCAA GAGTCGTATA 4440 

AAATCACTGA AATAATTATG GGAATTGAGA AAATCATTCA AGATTTCTAT TGTACTGAGT 4500 

TTAACCAAGA TTCTATTGAT TATTATAGAT TCATAACTCA TATGAAATTA TTTGCCCATC 4560 

GCTTGGTTGA GAATACAACT TATTGTGACG ATGATGA 4597 
(2) INFORMATION FOR SEQ ID NO: 176: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 984 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 6: 

CGGCTTATTT ACTACTTGTT CCATCATATA TGGAATATGC ATGAAACCTG CTCTCATATT 60 

AGGGAATTTT TTATCCACTA AATAAAGAGC TTGGTACATC AAATGATTGC AAACAAAGGT 120 

TCCTGCACTA TTGGATACAA CTGCCGGAAG TCCCTGTTTT TTGATAGCTT GTACCATCGC 180 

TTTGATAGGT AAACTACTAA AATAGGCCGA TGCTCCATCA ATACGAATCG GTGTATCAAT 240 

TGGTTGATTG CCTTCGTTAT CAGGTATGCG AGCATCATCT TGATTAATAG CCACTCGTTC 300 

AGGTGTTAAG CCGGTCCTGC CGCCTGCTTG TCCAATACAA AGTACAGCAT CTGGTTGATA 350 

TCGTAATATT TCTGCCTCTA AAACTTCTGA CGACTTATAA AAAACCGTTG GAATTTCTAC 420 

CCAGCGAACT TCAGCCCCAT TAATCTCAGA TGGTAATAAT TTTACAGCCT CCAAAGCTGG 480 

ATTAATCTTT TCACCTCCAA AAGGATTAAA ACCTGTAACC AATATTTTCA TTTTATTTTC 540 

CTTTACTAAA ATGCGAGAAA GTACATTAAG AATATGTGAA TAACAATCAT TACTAGAGCA 600 

ACACCTGCTT GAGCCTTTAT AACGCCATTC TGATCTTTCA TATCCATCAA TGCTGCTGGT 660 

AGAGCGTTAA AATTAGCAGC CATTGGGGTC AATAAGGTCC CACAATAACC TGCTGTCATG 720 

GCAAGAGCAC CAGCCACAAT TGGATTAGCT CCCAGAGCAA ATACAAAGGG AACTCCAACA 780 

CCTGCTGTAA TAACGGTGAA TGCTGCAAAA GCa'1"ITCCCA TAATCATTGT GAATAGAACC 840 

ATTCCAAGAA CATAGGCCAA AACTCCTATA AAGCGACTAT CTGAAGGAAC AATACCGCTA 900 

ATCAGATGAG AGATAACATC ACCAACACCT GCTACAGTAA AAATAGCCCC CAAAGCCCCT 960 

AATAATTGAG GAACAATCCC ACTTGTTGAA ACTTGCTGAG TCATTCGATT ATTTTCTGAT 1020 

AACAGACTCT TAGGGTGACT ATTGGTAATC ACAAGAACAG AAATTGTAGC AAACAAGGCG 1080 

GCAAGGCTAA TCGAAATCTT GCTAAATTCT GGAATCATTT GCGCTAAGAC CAACGCAAGT 1140 

ATTGCCATCA GCATAACTGG AATAAAAATT TTATTTTTCA ACCTGTTAGA TTCAATATTG 1200 

GCTTTCATTT CATCTAAGGA TGGCAAGGTT CCGATACGGA CTTGCTTAAA CAATGTTAAC 1260 

AGCGATAATA GGATTACAAT AATACCAATA CTCATATTTG GCATATAGGA ACCACCTATA 1320 

AACGTAATAG ACAATAGAGT CCAAAATGCA GATGTCCCAA GTCGAACTGG GTTTGTTTTA 1380 

TCTTTATAAC TACAATAGGC TGTATGGAGA AATTGACAAC CAATCACAAT ATAGGTCAAC 1440 

TCTAATAGTT GCTTTGCCAA CTCTGTCATT TTTGTTCTCC TCCCCTAGTC TTTTTTGATA 1500 

TCAATTTTTT ATCAAATAAA TAATTATAAA TCCCCACTAC AATAAGTGTT ATAACAGCAA 1560 

CAATAATAGA TGTAGAAGCA ATCCCTGCAT AATTGCTTTC ATAGCCTAAC TGATCTAATG 1620 
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TTCCCCCTAT CAAGAGGACT CCCCCAGCAC CTACAAACGT ATTTTGAGCA AAGAAATTTC 1680 

CAAAATTTTC ATTCGCAGCC GCACGCGCTT TTATTGTCTC ATCTTCAACC TCTGTTAACT 1740 

TTCTACCTAA TTGAGACTCT GCAGCTGCTT CTCCCATAGG TTGAACCAAA GGTCTGACAA 1800 

ACTGAGGGTG TCCTCCTAGA CGAATTGAAA AGAAACCAGC TAACTCTCGA ATAAAGAAAT 18 60 

AAACTGTATA GAAGTTTCCA ACTGTCAGAC CTTTAATCTT TCGAATCAAA TCGATTGATC 1920 

GTTGCTTGAG TCCAAAGGTT TCTGACAGCC CCACAAGAGG CAAGGTAACC ATAAAAATCG 1980 

TGAGCACTCG CTGATTGCTA AATTCTTTTC CCAAAATCTC CAAAAATTCA ACGAGAGAAA 2040 

CACCTGAAAC TAAAGCTGTA ACCAAACCAG CTAAGACTAC TGTTGCAATT GTATCAAATT 2100 

TTAAAATAAA ACCCACAACA ATGATTGCTA TTCCTATTAA TCTAATCCAC TCCATATCAA 2160 

ACTCCTTTAT ATTCAAAATG ACAGTATTTT TAAAATTTTA TCAAGATCAA TACCATTCCT 2220 

TATTTAATGT GTTTTTCTAG TTCTTTTTGG TATTTGCTAT TGGATTCCAA TTTTTCTTTT 22 80 

TGCCATTTTT TAAAAACCTC GTTATATTCT TTTGTTGTAA CAATATCTTT TTGCAATTTC 234 0 

ATTCCTTTAA AGATATATGG ATCCCCCTTA ATACCAACTT GTGAGTATGG TTTTGAGAAT 2400 

GGTACTACGT TACTTACAAC TGGAGAACCA CCAGATGAAG CTGTTGGCAT CAATAATGAA 2460 

CTATCTGTCG ACCAAGCTTG AGCTTTGGCA TATTTTTCAT ATCTTTTCTC TAGGTCAGTG 2520 

GTCTCAGAAA CAGCATCTTC TAACAATTTC TTATATTTAT CCAAACCAGG TTTAGCTACA 2 580 

ACATCCTTAT CTTTTCCTTT CGTAATACCA AGGTGTTTCA TGGCAGAACC AGATTTTGGA 2640 

TCTATAATAT TCAAGTGAGA. CGCTGGATCT TGATAGCTTG GAGCCCATCC TGTACTGTTC 2700 

AAATCATAGT CTTTTTGAGA AGGAGCAACA TTGCCGTATT TATCATTTTC CATCAAACCA 2760 

TCAATAACAT TTCCAATAAC GTCTGTCCTC GATGTTCGAG TCGCTATACT GTAGCCCAAT 2820 

GATGCTGGAT CTACTGCATA GACATAAGAA AATGTTGTCG GTGCATCTGC TTCTTTATCA 2880 

GTTTTTCCAC AAGCCACTAA AATAGCTGAC GTGCTCAGGA CCACTCCTGC TGTTAAGAGC 2940 

CACTTTTTCT ATTTCATAAA GAATCTCCTT TGGTTTATTT TAATCTACTT TTACAATCCA 3000 

ACCTTCTGGC GCTTCAATAT CGCCAAACTG AATACCCGTC AATTCATTAT ATAATTTACG 3060 

CGTCACAGGA CCTACTTCTG TTTCACTATA GAATACATGG AAATCATCAC CATGTTGAAT 3120 

ACCTCCAATT GGAGAAATAA CCGCTGCTGT ACCACAGGCA CCTGCCTCTA CAAAACGGTC 3180 

AAGATTATCA ATTGGAACAT CACCCTCAAT AGGAGTTAAT CCCAAGCGAT GTTCTGCCAA 3 240 

ATAAAGCAAG GAATACTTGG TAATAGATGG CAAGATAGAT GGACTCAATG GTGTTACAAA 3 300 

TTCATTATCA GCTGTAATTC CAAAGAAGTT AGCTGATCCG ACTTCTTCAA TCTTTGTATG 33 60 
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AGTTGATGGG TCCAGATAGA TAACATCTGA GAAATGACGT GACTTGGCCA TTTTTCCTGG 3420 

TAAGAGACTT GCAGCATAGT TTCCACCAAC CTTAGCCGCA CCTGTACCAT TTGGTGCTGC 3480 

ACGGTCGTAC TCATCCTGAA TCAAGAAGTT GGTTGGGACC AAACCACCTT TAAAGTAATT 3540 

TCCAACTGGC ATAGCAAAGA TGGTGAAAAT GTACTCTTCT GCCGGTTTTA CCCCGATAAT 3600 

ATCTCCGACA CCAATCAAAA GAGGGCGAAG ATATAAGGTT CCACCTGTTC CGTATGGTGG 3660 

TACGTATTCT TCATTCGCAC GGACAACTGC TTTACAAGCT TCTACAAACA TGTCTGTCGG 3720 

AACTTGTGGC ATCAAGAGAC GGTCACATGT ACGTTGCAGA CGTTTAGCAT TTTCATCAGG 3780 

ACGGAACAGT TGAACACTGC CATCCTTAGT ACGATAAGCT TTCAAACCTT CAAATGCTTG 3840 

TTGTCCATAG TGAAGACTTG GAGAAGACTC TGAAATATGC AAAGTTGCAT CCTCTGTAAG 3 900 

CTCTCCTTGA TCCCATTGTC CATTTTTGAA ATGAGCAAGA TAGCGATAAG GTAATTTCAT 3 960 

ATAGGAAAAA CCGAGGTTTT CCGG 3984 
(2) INFORMATION FOR SEQ ID NO: 177: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8703 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID 



TATCTAATTA 


TTGGTTTTTA 


TCGCTGACCT 


TGGCTATTGT 


TGGGGTTGTT 


TTACCCTTGT 


60 


TGCCTACAAC 


ACCTTTCCTT 


TTGTTGTCTA 


TTGCTTGTTT 


CTCCAGAAGT 


TCCAAGCGAT 


120 


TCGAAGATTG 


GCTTTATCAT 


ACCAAGCTCT 


ATCAAGCATA 


TGTAGCTGAT 


TTTCGTGAGA 


180 


CCAAGTCTAT 


TGCGCGTGAA 


CGAAAGAAAA 


AAATCATCGT 


CTCTATCTAC 


GTCTTGATGG 


240 


GAATTTCTAT 


TTATTTTGCA 


CCTCTTTTAC 


CAGTCAAAAT 


CGGTCTGGGT 


GCTTTGACCA 


300 


TCTTTATTAC 


TTATTATCTC 


TTCAAGGTCA 


TTCCAGACAA 


AGAATAGTTA 


AAACAGTAGT 


360 


TATTTGCCTT 


GATAAAATTG 


AAAGCATATT 


CATT^CAATA 


TGATATAATA 


AAATTGAAGT 


420 


AATATTCAAG 


GAGAATCAAA 


TGATTTACGA 


ATTTTGTGCT 


GAAAATGTGA 


CTTTACTTGA 


480 


AAAAGCGATG 


CAGGCTGGAG 


CTCGTCGGAT 


TGAACTCTGT 


GATAATCTAG 


CAGTTGGTGG 


540 


GACAACACCC 


AGCTATGGAG 


TGACTAAGGC 


AGCGGTTGAA 


CTGGCAGCTA 


ACTACGATAC 


600 


AACCATCATG 


ACCATGATTC 


GGCCACGTGG 


TGGTGACTTT 


GTCTATAATG 


ACCTAGAAAT 


660 


TGCTATCATG 


CTAGAAGACA 


TTCGTTTGAC 


TGCTCAGGCT 


GGAAGTCAAG 


GGGTTGTATT 


720 


TGGAGCTTTA 


ACTGCTGATA 


AAAAGTTGGA 


TAAGCCTAAT 


CTGGAAAAGT 


TAATTGCTGC 


780 



wo 98/18931 



PCT/US97/19588 



1111 



ATCAAAAGGA 


ATGGAAATTG 


TCTTTCACAT 


GGCCTTTGAT 


GAACTAAGTG 


atgaagatpa 


840 


AGCGGAAGCT 


ATTGACTGGC 


TCAGTCAAGC 


CGGTGTCACT 


CGTATCCTAA 


PTPGTGPTGG 


900 


TGTGTCTGGC 


GACTCCTTAG 


AAAAACGTTT 


TGTTCACTAT 


CACAGAATTT 


TGGAGTAPGP 




TAAAGGTAAA 


ATTGAAATTC 


TACCAGGTGG 


GGGGATTGAC 


CTTGAAAAPP 


GTP A A A PPTT 
ul^AAAL.^1 1 




TATCGACCAG 


GTGGGGGTAA 


CACAATTGCA 


TGGTAPTAAG 


PTTPTTTTTT 
O 1 1 U 1 1 1 1 1 1 


AAAAAAi AOA 


1080 


AAGGAACTGC 


TAGCTTTGGG 


TAGPAfiTTTT 

X t\\3\^t\\3 1111 


P A PTT A TPTT 
^Al_ I lAlLil 1 


TG AAAT TTTT 


R R R fT^/^/^m R m/^ 

AAA X (_ L 1 ATC 


1140 


AATTTAATCA 


AGAAAAGCn* 


r'aTf3 ATT A Tn 


G TTT TAT AG A 


R R RniR/^r>/^7l R 

AAA i AIjI^vjAA 


GTCTGGACAT 


1200 


ATAGTTGCCA 


X X X X \^\^^\n 


T^ A 1" |*Tf^ 


TPATP APTPT 
1 V, A 1 Vj AU 1 Ij i 


(j i (_(_ A 1 CAL 1 


GCTGATAATG 


1260 


TGAACTTTCA 


n\3 x^x X X Kjjv^ 


PA AnAPBPTr" 


Ls ivjAUt_ 1 1- i A 


TCCTCACGTT 


TATATGGAAA 


1320 


GCATGAGGGG 


A AGTTTTGTr» 
ru\v> 1 1 1 I u 1 


Rf; A A A TPTrT" 


(J i\3AL»VjL xlK> 


TCTGGAGATT 


CTTTACCAGA 


1380 


TTCGGAAGGC 


X x\j X X X X 1 


r'TPP A A P A T>T 


TTATCTGTCA 


TCAGACTAAG 


CGTATCATGA 


1440 




A A A Ar^T AT 
ouAA/Uivj xnx 




TGGAGTATCT 


GTGGGTUiiAAA 


TCGCCTGATA 


1500 




GPGOOATP A A 


ij^CAA i ^ AAA 




CGTCTTGATG 


AAAATCTCTT 


1560 


GGAATAAGCT 


GGAAAAGGnP 


AP Ar*A APP ap 

Au Ao AAu u A L. 


AA(j i\jV;iAAL>(- 


AOTCAACCTC 


AAGCATGACC 


1620 


AAGTAGPT A A 


111 \JK^ 1 1 1\, A 


L. AAAAu ta vji A 


ill ATuCAGC 


CTTCCATATG 


AGCAAGCGCT 


1680 


ACTGGATTAG 


TnTGTPrT'TT 
1 v> 1 u 1 1 1 


ATP ATAPTT 
o A i\a A 1 Al_ 1 1 


1 AlLAtjArCaA 


TV ^ R R R /*>m^ 

AGaAGTACTG 


GAATTGATAG 


1740 


AAAAAAGTTG 


GAACTTAArC 

V3rtrtV» A l/\nV-^ 


TPTAAAA AAT 


A A ATBTTTT 
OAAAlAl III 


AAi AAl 1 i iC. 


ATGAACTTTC 


1800 


AATTAGCTAA 


n xnx X \^ X 1 X t\ 


PTfIA An Aft! AT 


TTTTAPAAaa 
1111 AuAAAA 


TaTar^r^ aTTT 
1 A i AVjO A 111 


R R ^ R /^n R ^ 

ACCACACTAG 


1860 


AGGAATATGG 


TGCCATCTTC 


AAATACCTGA 


TTfl AG A ATnT 


PAAPAPPPAT 
^AAVjA^L»«A 1 


ppTPapaTr*a 
1 ^ AO A 1 LA 


1920 


TCTATTCG CC 


TCACTGTCAT 


GATGACCTPR 


GAATGGPAfZT 


PPPa A ATAPP 
uoU-AAA 1 Auk. 


CTTG CTG C TG 


1980 


TCAAGAATGG 


TGCAGGACGT 


GTTGAAGGGA 


CTATCAATGG 


TATTAGGGAG 


CGAGCTGAAA 


2040 


ATGCTGCTTT 


GGAAGAAATT 


GCAGTGGCTC 


TCAATATTCG 


CCAAGATTAC 


TACCAAGTAG 


2100 


AAACCAGTAT 


TGTCCTAAAT 


GAGACCATCA 


ATACGTCAGA 


AATGGTTTCT 


CGCTTCTCTG 


2150 


GTATTCCAGT 


TCCTAAAAAC 


AAAGCCGTCG 


TTGGTGGCAA 


TACCTTCTCC 


CACGAATCTG 


2220 


GTATTCACCA 


AGATGGAGTC 


CTTAAAAATC 


CTCTCACTTA 


TGAGATCATC 


ACACCTGAAT 


2280 


TGGTTGGTGT 


TAAGATTCTG 


CTTGGAAAAT 


TATCTGGTCG 


CCATGCTTTT 


GTTGAGAAAC 


2340 


TGAGAGAATT 


GGCCCTAGAT 


TTTACAGAAG 


AGGATATCAA 


ACCACTCTTT 


GCTAAGTTCA 


2400 


AGGCACTGGT 


CGATAAGAAG 


CAAGAAATCA 


CAGATGCAGA 


TATTCGAGCT 


TTGGTAGCTG 


2460 


GAACCATGGT 


TGAAAATCCA 


GAAGGCTTCC 


ACTTTGATGA 


TTTACAACTT 


CAAACTCATG 


2520 
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CAGATAATGA CATTGAAGCG CTCGTTAGCC TAGCCAATAT GGATGGTGAG AAAGTCGAAT 2580 

TTAATGCGAC AGGGCAAGGT TCCGTTGAAG CAATCTTTAA TGCTATCGAT AAGTTCTTTA 2 640 

ACCAATCTGT TCGTTTGGTG TCCTACACTA TCGATGCGGT AACAGATGGA ATCGATACCC 2700 

AGGATCGGGT TTTGGTCACT GTTGAAAACA GAGATACAGA AACCATCTTT AATGCAGCAG 2760 

GGCTTGATTT TGATGTGTTG AAGGCTTCTG CTATTGTCTA TATAAACGCT AATACCTTTG 2820 

TTCAAAAAGA GAATGCAGGT GAGATGGGAC GCAGTGTTTC TTACCACGAT ATGCCTAGTG 2880 

TGTAAAGGAG AAGGCTATGG CAAAGAAAAT AGTAGCTCTA GCAGGAGACG GAATTGGCCC 2940 

AGAAATCATG GAGGTTGGTT TAGAAGTTCT GGAGGCTCTA GCTGAAAAAA CAGGTTTTGA 3 000 

CTATGAGATT GACAGACGAC CGTTCGGAGG TGCAGATATT GATGCAGCAT GACCTCCCTT 3060 

ACCTGATGAA ACCCTTAAGG CAAGTAGGGA AGCAGATGCT ATCCTACTAG TAGCTATCGG 3120 

TAGTCCTCAG TATGATGGAG CAGTGGTTCG CCCTGAACAA GGCCTGATGG CTCTCCGTAA 3180 

GGAACTCAAT CTTTACGCTA ATATTCGTCC TGTAAAAATC TTTGACAGTC TCAAGCATTT 3240 

GTCACCACTC AAACTGGAAC GAATTGCTGG TGTAGACTTT GTCGTGGTGC GTGAATTGAC 3300 

AGGCGGGATT TACTTTGGAT ATCATATTCT TGAAGAGCGC AATGCGCGTG ATATCAACGA 33 60 

CTATAGCTAT GAGGAAGTGG AGCGGATTAT TCGCAAAGCC TTTGAAATTG CAAGAAATCG 3420 

CAGAAAAATC GTTACTAGTA TCGATAAGCA AAATGTTCTA GCGACCTCAA AACTCTGGCG 34 80 

GAAAGTAGCT GAGGAAGTCG CAC^.CCATTT CCCAGATGTA ACCl-rUGAAC ATCAGCTGGT 3540 

AGACTCAGCT GCTATGCTTA TGATTACCAA TCCTGCTAAG TTTGATGTTA TTGTAACGGA 3600 

GAATCTTTTT GGAGATATTT TATCTGATGA ATCAAGCGTC TTATCTGGTA CACTTGGGGT 3660 

TATGCCATCA GCCAGTCATT CTGAAAATGG ACCAAGTCTC TATGAACCTA TTCACGGTTC 3720 

AGCACCTGAT ATTGCAGGTC AAGGAATTGC CAATCCTATT TCCATGATTT TATCAGTTTC 3780 

CATGATGTTG AGAGATAGTT TCGGACGTTA TGAGGATGCA GAGCGTATCA AACGTGCTGT 3 840 

TGAGACAAGT CTGGCGGCAG GAATTTTAAC GAGAGATATA GGAGGTCAGG CTTCAACAAA 3 900 

GGAAATGACG GAAGCTATTA TTGCAAGGTT ATGAAGTTAG ACGAAAAAAT TACTCTAGTC 3 960 

CTTTTGATTT GGAATGTCAT CATTTTCTTG ATTTATGGTA TTGACAAATC TAAGGCAAGG 4020 

AGAAGAGTTT GGCGCATCCC TGAGAAAATC TTACTTATTT TAGCCTTTAC TTTTGGTGGT 4080 

TTTGGTGCCT GGCTAGCAGG AATCATCTTT CACCACAAGA CTCGAAAATG GTACTTTAAA 4140 

ATAGTTTGGT TTCTTGGGAT GGTGACCACA CTAGTAGCCT TATATTTTAT TTGGAGGTAA 4200 

TGGATGGCAG GGTCTTCGAG GGAATACGCT GCTTGGGCTC TAGCGGACTA TGGTTTTAAG 4260 

GTCGTGATTG CAGGATCTTT CGGTGACATT CATTACAATA ATGAACTCAA TAATGGCATG 432 0 
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TTGCCAATCG TTCAGCCTAG AGAGGTTAGA GAGAAACTAG CCCAGCTAAA ACCAACCGAC 4380 

CAGGTAACTG TGGACTTGGA ACAACAAAAA ATCATCTCAC CAGTTGAAGA ATTCACCTTC 4440 

GAGATAGATA GCGAGTGGAA ACATAAACTC CTAAATAGTT TGGATGATAT CGGTATTACC 4500 

TTGCAGTATG AAGAGTTGAT TGCTGCTTAT GAAAAACAAC GACCAGCCTA CTGGCAGGAT 4 560 

TAGAAAAAAT AGAAAAGGAG ATATAGTAAA CTGAAATAAG ATGTAAACAA ATGAATTGGA 4 620 

GCTTAACATC CATTTCCAGC AATTTTTTAG AAACTACAGT GGACTATTCT GGATTCAACA 4 680 

CATTATAAAA TTATGACAAA ACACATTCAC AAGAAGGCTA CGACATTTTA AAAGGTGAGG 4740 

GCGGATGTAT CGTTTGCCCT ACTAAAGTTG GTTACATTAT CATGACCAGT GACAAGGCAG 4 800 

GACTTGAGCG TAAGTTCGCA GCCAAAGAAC GTAAGCGTAA CAAACCAGGT GTTGTTCTCT 4 860 

GCGGTAGCAT GGATGAACTT TGCGCTTTAG CGCAACTCAA CCCAGAAATT GAAGCATTCT 4920 

ACTAAAAACA TTGGGATGAA GATATTCTTC TTGGTTGTAT CCTTCCTTGG AAACCAGAAG 4 980 

CCTTTGAAAA ACTCAAAGCA TACGGGGATG GCCGTGAAGA ACTTATTACT GATGTACGTG 5040 

GTACTAGCTG TTTTGTTATC AAGTTTGGAA AAGCAGGTGA ACAATTGGCT GCCAAGCTTT 5100 

GGGAAGAAGG TAAAATGGTC TACGCCTCAT CTGCTTCAAT GACAAAACGA TTGAAACTCG 5160 

CTATGAGCAA GGTGTAATGG TGTCTATGGT CGATAAGGAC GGCAAACTCA TCCCAGAACA 5220 

AGGAGGAGCA CGTTCAACTT CACCAGCTCC AGTTGTGATC CGTAAAGGGC TTGACATTGA 5280 

TAAAATCATG ATGCACCTGT CAGATACTTT TAACTCATGG GACTACCGTC AGGTTGAGTA 5 340 

TTATTAGGAT AGAGAAGAAG TCTAGTGTTA TGAGATATTA AAGCTCCTAA CACTGGGCTT 5400 

TTGTTTAGAA TTTCTTTTCT TTTTCTATAG GATATGGTAT TCTATGTAGA AAATATATGT 54 60 

TAATAAGTAA TGCCAATATT TAAACATCAT TAGTAAAAGG AGTTAGATTG ATGAATAAAA 5520 

GAAAAGTTAG TTTAGAAGAT TTTTATAAAT GGTATAGTCT AAATAAAGAA GAGTTATTAA 5580 

ATAAGGCAAC TGTTGGTGAA AAGTTTAATG ATAAATTAAA AGAAGAGTTT CTCCAGGAAT 564 0 

GGCCTTTGGA TAGGATTTTA ACAATGTCAA TCGATGAATA TGTAATAGGA AAGGGACAGC 5700 

AAAATAAGTC TTTATGCTAC GCTCTTGAGA AGGGAAAATA CAAAAATCTA TTTCTTGGAA 5760 

TTTCTGGTGG CTCAGCTTCA AAATTTGGTA TTTATTGGAA TAAAAAAACA AACAAATATA 5820 

AAGATCAAGC TAATAATGAG ATTTCAGAGT TGGATCAGCG ATTTTCAAAA TTAAAATCAG 5880 

ATTTGTATGA AATTATCAAA GAAGGTATTC GTTTTAACTT TGAAAATCCT ATTTTTGATA 5940 

TGAAAAGATC AACAAATGAA TTTATTGGTC GTTCTGCTAT GGTGACAAAA TTACTTTGTA 6000 

TCTATACTGA GGGAGATCCT TTCTTTGGTG TAAATATTAA TAGTCAGAAA GAATTTTGGA 6060 
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ACCACTTTGT TTCTCAGACA AATCAAGGTG GACCTTATCT GCAAAATCAT AAAATAATTG 6120 

AACTGGTGTC CAAAACTTAT CCTGAGTTGG AGCCATCGAA ATTAGGAACT ATGCTTTTTG 6180 

AGTATTCTAA GCTTTTTATG GAAAATAAGG AAGACAATAG TACAATGGAT TCATCAAACA 6240 

ATTTTCGTCA TCAATTAACT CAATCTCTAT TAAAGTCTCC AAACCTCATC CTCCGCGGTG 6300 

CTCCTGGCAC GGGAAAAACT TATCTTGCTA AAGAAATTGC TAAAGT^TTA ACGGATGGCA 6360 

ACGAAGATCA AATCGGATTT GTACAATTTC ACCCATCATA TGATTATACG GATTTTGTAG 6420 

AAGGTTTAAG ACCAGTATCA AATGGGGATG GAGCTATTGA GTTTAGGCTA CAGGACGGTA 6480 

TTTTTAAAGA TTTTTGTCAG AAAGCAAAAG AAACCCAATT GATTGGAGGA CAAGATAATT 6540 

TTGATGAGGC TTGGGATTCT TACTTAGAAT ATATAAATGT TGCTGAAGAA AAAGAATATA 6600 

TAACAAAAAC ATCTTACTTA TCTGTTAATA GTAGACAAAA TTTGTCAGTA AATTATGATA 6660 

GTGGTGTTCC AGGATGGTCA CTACCTAGCA AATATGTTTA CGAGTTGTAT AAAGATAAAA 6720 

ATTATAATAA GCAAGAATAC TACAAAAGTG GTGGAAAAAC TGTCCTAGAA ACATTGAGAA 6780 

AGAGATTTGG TTTGAAAGAC TATGTTTCCC CAACAGAAAT TGATACTGAT AAGT^TTTTG 6840 

TCTTCATCAT CGATGAGATC AATCGTGGGG AGATTTCTAA GATTTTTGGC GAACTCTTTT 6900 

TCTCTATCGA CCCCGGCTAT CGTGGTGAAA AAGGAAGTGT TTCTACCCAA TATGCAAATC 6960 

TACACGAAAC TGATGAAAAG TTCTATATCC CCGAAAATGT TTACATCATC GGAACTATGA 7020 

ATGATATTHA TCGTTCAGTG GATACCTTTG ATTTTGCTAT GCGtCutuGT TTTCGTTTTG 7080 

TTGAAGTTAC TGTCGAGGGT CAAGCTGGCA TGTTGGATAA AGAGTTGAAT ATCCATGCAG 7140 

AAGAAGCAAA AATTCGTCTA AGAAACTTGA ACGCTGCTAT CGAAAATATT CAGGAATTAA 72 00 

ACAGTCATTA TCATATTGGA CCAAGTTATT TTCTTAAGTT GAAGGATGTA GATTTTGACT 72 60 

ATGAATTACT CTGGTCTGAT TATATTAAGC CTCTCCTAGA AGACTACTTG CGAGGTTCTT 7320 

ATGATGAGGT TGAAACTTTG GAaACTTTGA AAAAAGCATT TGATCTGACA AATAATGAGC 7380 

AAAAAGATCA GGCAGTAGCT GATGACAATG AAGGCGATGA AAACGATGAT GCGGATTAi.'T 7440 

GATAATCAAC ACAAGATTAT TAAAGAAAAA TTTGTTGAAG AATATCCTAA ACTAAGCAAT 7500 

CCTCTTTTAG ACAGAACCTT GGAAAGTCTA TCCCAAGATG AACGTATTTT CATTTTTCCA 7560 

AATGATTwGA CTCATACTCC TGATTTGGAT AAGGACCAAA AGATTTTTGA AACAGTCAAT 7620 

CAGAAAATCA AGACAGGGAA CGTGATTGGT TTTCTTGGAT ATGGTCAGGA AAGATTAACG 7 680 

ATTTCCTCAC GATTTTCTGA TGAGAGTAAT GACCACTTTT TGCATTATCT CTTAAACAAG 7740 

GTTCTTCATA TCAATCTCAC TAGTTTAGAT GTTGCTTTGT CTCGTGAAGA GAGGCTTTAT 7800 

CAACTTTTGG TGTATCTCTT TCCCAAGTAT CTACAAGCTG CTATTCGAAA AGGTCTTTAT 7860 
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AAGGAATATC 


ATCGATTTTC 


TCATAACGAC 


AGTCATGTTA 


AGGGAGTGAT 


TGATGTAAGA 


7920 


AACCATCTCA 


AGAAAAATCT 


TCCTTTCACG 


GGAAATATTG 


CCTACGCAAC 


GAGAGAGTTC 


7980 


ACCTATGATA 


ATCCCCTCAT 


GCAGTTGGTC 


CGTCACACTA 


TTGAATACAT 


TAAGAATCAG 


8040 


AAAAGCATTG 


GTCAAGGGGT 


ACTAGATAAT 


CTCTCAACTA 


GTCGTGAAAA 


CGTATCTGAA 


8100 


ATCGTGCGTG 


TAACGCCCTC 


TTATAAACTA 


GCTGATCGTG 


CTAAGATTAT 


TCGGGGAAAT 


8160 


CAATCTAAAC 


CTATACGTCA 


TGCATACTTT 


CACGAGTACA 


GAAACTTACA 


AGAACTTTGT 


8220 


CTGATGATCC 


TAAACCAAGA 


AAAGCACGGT 


TTAGGGTATC 


AAGATCAAAA 


AATCTATGGT 


8280 


ATTCTCTTTG 


ATGTTGCCTG 


GCTTTGGGAA 


GAGTATGTTT 


ACACCTTGTT 


GCCAAAAGGT 


8340 


TTTGTACATC 


CCAGAAATAA 


GGATAAGACG 


GATGGAATTT 


CAGTATTTTC 


TGTTGGGAAA 


8400 


CGAAAAGTAT 


ATCCAGATTT 


TTATGACAGA 


GAACGAAAGA 


TTGTTCTAGA 


TGCAAAATAT 


8460 


AAAAAACTGG 


AATTGACTGA 


AAAAGGAATC 


AACCGTGAGG 


ACTTATTCCA 


GCTGATTTCC 


8520 


TATTCTTATA 


TTTTAAAAGC 


TGAGAAGGCT 


GGACTGATTT 


TTCCTAGTAT 


GGAGCAGTCA 


8580 


GTAAATAGTG 


AAATAGGAAA 


AGTAGCTGGC 


TATGGAGCTC 


AATTGAAGAA 


GTGGTCTATT 


8640 


CGAATCCCTC 


AGAATGCCTC 


ATTCTATAGT 


ACATTTTGTA 


AAATGATGGA 


AAATTCAGAA 


8700 


GAG 












8703 



(2) INFORMATION FOR SEQ ID NO: 178: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4854 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178: 

CATCACCAGT TTTAGATGGC TTTAACAGTG AAATTATTGC TTTTAATCTT TCTTGTTCGC 60 

CTAATTTAGA ACAAGTACAA ACAATGTTGG AACAGGCATT CAAAGAGAAG CACTACGAGA 120 

ATACGATTCT CCATAGTGAC CAAGGCTGGC AATATCAACA CGATTCTTAT CATCGGTTCC 180 

TAGAGAGTAA GGGAATTCAA GCATCCATGT CACGTAAGGG CAACAGCCAA GACAACGGTA 240 

GGATGGAATC TTTCTTTGGC ATTTTAAAAT CCGAAATGTT TTATGGCTAT GAGAAAACAT 300 

TTAAATCACT TAACCAATTG GAACAAGCCA TT AT AG ACT A TATTGATTAT TACAACAATA 360 

AGAAAATTAA GATAAAACTA AAAGGACTTA GTCCTGTGCA GTACAGAACT AAATCCTTTG 420 

GATAAATTAT TTGTCTAACT GTTTGGGGGC AGTACACAAG AAAGCGCTTT AAAACCAGTA 480 



wo 98/18931 



PCT/US97/19588 



1116 



GACCTTTTCA 


TAAGGTTCGC 


TTGATGTACC 


AAGATGAGGC 


TGGTTTCGGT 


AGAATCAGTA 


540 


AACTGGGATC 


TTGTTGGTCT 


CCAATAGGAG 


TAGGTCCACA 


TGTCCATAGT 


CACTATATAC 


600 


GAGAATTTCG 


CTATTGTTAT 


GGAGCTGTTG 


ATGCCCATAC 


AGGCGAATCA 


TTTTTCTTAA 


660 


TAGCTGGTGG 


ATGTAATACT 


GAGTGGATGA 


ACGCCTTTTT 


AGAAGAGCTT 


TCACAAGCTT 


720 


ATCCAGATGA 


TTATCTTTTA 


CTCGTTATGG 


ACAATGCTAT 


ATGGCATAAA 


TCAAGTACCT 


780 


TAAAGATTCC 


GACTAATATT 


GGTTTTACCT 


TTATTCCTCC 


ATACACACCA 


GAGATGAACC 


840 


CATTGAACAA 


GTGTGGAAAG 


AGATTCGTAA 


ACGTGGATTT 


AAGAATAAAG 


CCTTTCGAAC 


900 


TTTGGAAGAT 


GTCATGAATC 


AACTCCAAGA 


TGTCATACAA 


GGATTGGAGA 


AGGAGGTGAT 


960 


AAAGTCCATC 


GTTAATCGGA 


GATGGACTAG 


AATGCTTTTT 


GAAAACAGAT 


GAGTATAAAA 


1020 


TTGAATTGCT 


TATAAAAAAG 


CTCCATACAC 


TGGATGTGTA 


TAGAGCAATG 


GGGCTTTATT 


1080 


TGATATAGAG 


TTCTTGGTTT 


TTTAGGACAA 


TTTCTCGGAT 


ACTTGCAAAC 


TTTTTAAGTT 


1140 


TTTTGATTTC 


TTCTGGATGA 


GTGACGAGAG 


TGATAACATA 


ACCTTCCTTG 


CCCATACGAC 


1200 


CAGTACGGCC 


AGCACGGTGT 


GTGTAGGTTT 


CGCTATCTCT 


AGGAATATCA 


AAGTTTACGA 


1260 


CACATTCTAG 


GCTATCGATA 


TCAATTCCAC 


GAGCCAAAAG 


GTCAGTTGCA 


AGAAGCAGGG 


1320 


TTAGTTGGTT 


ATCTTTAAAC 


TTTTCTAAGA 


TGATTTTTCT 


AAATTTAACA 


TTAACATCAC 


1380 


TAGCGAGGGA 


AACAGCCAAT 


ATATCACGAT 


ACTGTAGTTT 


TTCCTCGGCA 


TTCCCAAGGT 


1440 


CTGACAGGCT 


ATTGAAGAAn 


ACTAQA.CCAC 




TACA7GAGCC 


AOT'ITTCGTA 


1500 


GCATATCCAC 


TCGATGACGT 


TGGTCTACCT 


GCATGTAGAA 


ATGCTGGATA 


TTGTCCAATT 


1560 


TTTGATCAGA 


GAGATCAATA 


GTGCGTGTAT 


TCGGCACAAT 


CTTTTCTTGG 


tcaaacttgg 


1620 


TCGTGGCACT 


CATGTAGACC 


AGTTGGTGGT 


CACGAGGTGC 


GTAGTGAGTG 


ATTTTTTCTA 


1680 


CAAAGTGAAT 


CTGAGAATCA 


TCTAGTAATT 


GGTCAAATTC 


ATCCAGGATG 


atggtttcca 


1740 


CATTCATCAT 


CTTGATTTTT 


TTAAGTTTAA 


TGAGTTCAAA 


GATACGGCCA 


GGAGTTCCAA 


1800 


TCAGAATTTC 


TGGCCCCTTT 


TTAAGACGTT 


CAATTTGTCG 


TTTCTGACTT 


gaacctgaaa 


1860 


GGAAGAGTTG 


AGCAGTCAAT 


CCGATAGCTT 


CTGCCCACGT 


TTTACATACA 


TCAAAAATCT 


1920 


GTCCAGCAAG 


TTCCGTATTT 


GGTGCTAGAA 


TCAAGAGTTG 


TTGGGCTTTT 


ttcttttgta 


1980 


GTCTGAGAAG 


ACTTGGTAGG 


AGATACGCTA 


GGGTCTTACC 


AGTTCCGGTT 


TGGCTCACTC 


2040 


CTAGGAGGTT 


TTCTCCAGCA 


AGAAGGGGCT 


CAAATAGTTG 


AGTTTGAATG 


GGGGTGAATT 


2100 


CTTGGAAACC 


GAGTTGGTCA 


CTCAGTTCTT 


GCCATTCAGT 


CGGTAGTTTG 


GTTTTCATTT 


2160 


TTCTGCCTCA 


AATOTAATGC 


CAGCAGTCTG 


GCGCATGGTA 


TATAGTAGCT 


CATGAACAGA 


2220 


GCCTGCATCA 


TACAGCCAAG 


TTTGGTAGAG 


ATTCAGATCT 


GGTTGCTGGA 


TCATGTGTGC 


2280 
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AV, I 1 i L.Akj 


1 \_A L t.GTA l^a 


AGGAGCCTGT 


TGGATAGGAA 


GCTGGACTTG 


2340 


* mm rp rprp/^/; 


1 Lilj 1 l,Vjkjl AA 


AAA i AGCTGA 


GCGAATATGC 


TCAATCGTGT 


TGAGAGTCAA 


2400 




Gl lA^XAlAAA 


Tt„TCGCAAGG 


AAGATTGGAA 


GTGATGTTTT 


TTCCAGCCTT 


2460 




1 G A 1 AG 1 U i G 


GG I AG AaGAG 


GATACCATCT 


CCATTTAGGT 


CAATGCTATT 


2520 




TGAGCATGGT 


AAGTCGCGTC 


ATTGGCTTTT 


CCAAAAAGAC 


GAACAGCAGC 


2580 


A X AV:r AvjOlib A 


TAAATCCCCA 


AATCCATGAG 


GGCTCCACCA 


GCAAAACGGT 


CTGAAAAGAC 


2640 


ATTTG G TGTT 


TGTCCAGCCA 


ACAAGTCAGG 


CATCTTGGAA 


GAGTATTTGG 


CATAGTTGAA 


2700 


ATCTG CTCCT 


AACACTTGCT 


TATCTGCTAA 


AAAGTTTTTG 


ATAGTAGTAA 


AGGCTTTCTC 


2760 


LilLiui AAi i A 


CGAGCTGCTT 


CAAAGATAAA 


ACAGTTATTT 


TTTTCAGCTG 


TTTGAATCAA 


2820 


ATCAAACCAT 


TCTTGTGGTT 


GAGAGACAGC 


TGGCTTTTCG 


AGAATAACAT 


GTTTACCAGC 


2880 


AGACAAGGCa 


GCTTTTGCCT 


GAGCAAAATG 


TAAGGAGTTT 


GGACTGGCGA 


TATAGACTAA 


2940 


ATCAAAAGAA 


GATTTGAAGA 


AGACTTCTAA 


TTGATCGAAT 


AGTTGGATAT 


TCTGATAGCG 


3000 


AGAAGCAAAG 


GTTGCTGCAG 


TTTCTAGTTT 


TCTAGAATAG 


ATTGCGACCA 


GTTGGTATTC 


3060 


TCCACTGGTA 


TGGGCTGCTT 


CTATGAAATG 


ATGGCTGATA 


GCGCCAGTTC 


CGATGACACC 


3120 


TAATTTTAGC 


ATAAATACTC 


CTTTTCCGAT 


TTTAAATCCT 


TCTTTCATTA 


TAACATAGAT 


3180 


AGACGGGACT 


ATCCAACAGA 


GAGGAGAAAA 


TTTCAAATAA 


GCTATTAGCT 


TTCTTTTCCG 


3240 


AATAAATAGA 


TAGAAGCATA 


GAATCTAGCA 


AACCTAGATT 


TAAAAATGTG 


CTATAATAGA 


3300 


aggaggaaaa 


GGAGGATTCT 


CAGACATCTA 


GGTATCAGCC 


CAACTAATGA 


TTTGTCAATT 


3360 


TATCCGCGAT 


ATG CTGG ACT 


TGCCAGCAAA 


AAATGTGACG 


ATTTTGGAGG 


GAAGTAACAT 


3420 


1 ^Ai,v» 1 L 1 i 


CCTTCCATGC 


CCTACTCAGC 


GTAAGATTTC 


TATACTAGTA 


TAGACGTCTT 


3480 


GGCGGAGTTA 


GATAATGGAA 


TCCAAGTTAT 


CATCGAAATT 


CAGGTTCATC 


ATCAGAATTT 


3540 


TTTCATCAAT 


CGCCTATGGC 


CTTATCTGTG 


CAGTCAGGTT 


AATCAAAACC 


TAGAAAAAAT 


3600 


TCGCCAACGT 


GAAGGTGATA 


CCCACCAGAG 


CTACAAACAA 


ATCGCACTAG 


TATACGCTAT 


3660 


CGCAATTGTC 


GATAGTAATT 


ACTTCTCAGA 


TGACCTAGCT 


TTTCATAGTT 


TTATAGTAAA 


3720 


ATGAAATGAG 


AACAGGACAA 


ATCGATCAGG 


ACAGTCAAAT 


CGATTTCTAA 


CAATGTTTTA 


3780 


GAAGTATAGG 


TCTACTATTC 


TAGCTTCAAT 


CTACTAGAAA 


TTCCATAGAT 


AGAAAACTAC 


3840 


ATAATCTCTA 


CAGATACGGA 


TGTTGGAGTT 


GATGTAAGAT 


GCTTTGGCTT 


GCTAGAGGAA 


3900 


TTGTGGATTG 


CCAAATTGTA 


TCATTGAAAT 


TATTGCTCAA 


ATTTGTTATG 


ATATAAATAT 


3960 


GAATAAAAGT 


AGACTAGGAC 


GTGGCAGACA 


CGGGAAAACG 


AGACATGTAT 


TATTGGCTTT 


4020 



r 
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GATTGGTATT 


TTAGCAATTT 


CTATTTGCCT 


ATTAGGCGGA 


TTTATTGCTT 


TTAAGATCTA 


4080 


CCAGCAAAAA 


AGTTTTGAGC 


AAAAGATTGA 


ATCGCTCAAA 


AAAGAGAAAG 


ATGATCAATT 


4140 


GAGTGAGGGA 


AATCAGAAGG 


AGCATTTTCG 


TCAGGGGCAA 


GCCGAAGTGA 


TTGCCTATTA 


4200 


TCCTCTCCAA 


GGGGAGAAAG 


TGATTTCCTC 


TGTTAGGGAG 


CTGATAAATC 


AAGATGTTAA 


4260 


GGACAAGCTA 


GAAAGTAAGG 


ACAATCTTGT 


TTTCTACTAT 


ACAGAGCAAG 


AAGAGTCAGG 


4320 


TTTAAAGGGA 


GTCGTTAATC 


GTAATGTGAC 


CAAACAAATC 


TATGATTTAG 


TTGCTTTTAA 


4380 


GATTGAAGAG 


ACTGAAAAGA 


CCAGTCTAGG 


AAAGGTTCAC 


TTAACAGAAG 


ATGGGCAACC 


4440 


TTTTACACTT 


GACCAACTGT 


TTTCAGATGC 


TAGTAAGGCT 


AAGGAACAGC 


TGATAAAAGA 


4500 


GTTGACCTCC 


TTCATAGAGG 


ATAAAAAAAT 


AGAGCAAGAC 


CAGAGTGAGC 


AGATTGTAAA 


4560 


AAACTTCTCT 


GACCAAGACT 


TGTCTGCATG 


GAATTTTGAT 


TACAAGGATA 


GTCAGATTAT 


4620 


CCTTTATCCA 


AGTCCTGTGG 


TTGAAAATTT 


AGAAGAGATA 


GCCTTGCCAG 


TATCTGCTTT 


4680 


CTTTGATGTT 


ATCCAATCTT 


CGTACTTACT 


CGAAAAAGAT 


GCGGCCTTGT 


ACCAATCTTA 


4740 


CTTTGATAAG 


AAACATCAAA 


AAGTTGTCGC 


TCTAACCTTT 


GATGATGGTC 


CAAATCCAGC 


4800 


AACGACCCCG 


CAGGTATTAG 


AGACCCTAGC 


TAAATATGAT 


ATTACAAGCG 


GGGT 


4854 



(2) INFORMATION FOR SEQ ID NO: 179: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2186 base pairs 

(B) TYPE: nvicleic £Cid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179: 

TAAACAGGTG TTAGGTGCTC TAAACTATTA AAATTCTAAG GAAATAAGGC TACTTTTTCT 60 

GGGTCTTGTT CATAGTAGGT GTGGTTCTTT TTTTCGAGTG TAGCCCATAG CTTTGAGCGC 120 

ATAGTGGATG GTAGTTGGAT GACAGCCAAA TTCAGAAGCT ATTTCAGTCA AATAAGCAIC 180 

TGGATTGTCA GTAAGATAGT TTTTAAGTCT ATCTCTATCA ACTTTTCTTG GTTTTGTTCC 240 

TTTTACTTGG TGGTTTAGCT CTCCTGTTTT CTCTTTTAGC TTTAACCAGC CATAAATGGT 300 

ATTACGTGAG ATTTGGAAAA CGTGTGATGC TTCTGTTATA CTACCTGTTC GCTCACAATA 360 

AGAGAGAACT TTTTTACGAA AATCTATTGA ATATGCCATA AGAAGATTAT ACCACATTGT 420 

GTACTATTTT TGGTTCATTT TACTATATTT CTAAACACTT AGAAATAATA AAACAAATTA 480 

AATATTATTT CTAAATATTT GAAAATAACA TCTATTTGTA TTATACTATC TTTGAGGTAA 540 

CTATTATGAA CTATATCATIA AGACCACATT ATTTAGATTT TTTAAGAAAA CATCGTGACC 600 
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GACCAATCAT CAAAGTTGTG AGTGGAGTTA 

TCTATAAAGA GGAGTTACTA GCAACTGGGG 

TCGAAGATTT GAGTTACTAT GATCTGCGAC 

ATCAATTAGT TAGCAAGAAA ACATACTATA 

AATTTGAACT GGTAGCAGAT AGTCTATTCA 

GATCTAACGC CTACTTTATG AGTAGCCAAT 

AGATAGAGGT TCTTCCTTTG TCATTTGAAG 

ATCTGAATAC AACAGAAATT TTTAACAATT 

AAACATCATC TTACGATGAA AAAATTGACT 

TAAATGATAT TGTCACTAGA TTGGGAAAAC 

GAACCCTTCT CAGTAGTACA GGTAGCTTAA 

TCAGCCAAAA TGTTTCAATA TCCCATAATA 

ATAGTTTACT TTTTTATTCC GTTCCACGTT 

GTTTAGAAAA ATATTATCCC GTTGATTTAG 

AAGAAGACAT TAGGCATATC TTGGAAAATA 

CACAAGTATA TGTTGGTAAT TTAGATAAGT 

TTGGCCACTA CGCTTATTAT CAGGTCAGTG 

GAGAACTTAG ACCACTAGAA GCCATTAAAG 

ATACGATTCA GCCAACAGCC AATTACAATG 

TACTAGAAAA ATAGATAAAT ATAAATCATA 

CAATGATTCT ACCCAAATCC TAACAAGATA 

GACACTTGAA AATAGAAATT GGGGATGAAA 

AAAATTAATA TCATAGTCTT ATTAGAGAAT 

ATTGTAACTG AATTATAATG AAAAAGAGAC 

GCATAGTATC AGGTATTGAA CAACCTTGAT 

TCATTTTCTC CTGAAATAGA GCTTTTGCTA 

TTCAACTTCT TACCTCTTGG GAAAAA 

(2) INFORMATION FOR SEQ ID NO: 180: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3236 base pairs 



TAAATCTGTG CTTTTTCAAC 660 

TCAGATTATA TTCATCAATT 720 

ATTATTCGCT TATATAAAAG 7 80 

TGAAATTCAA TATGTTGAAA 840 

TGTAGACCTC TATTTGACTG 900 

CTTGACTGGT CGGTATGTTG 960 

AGGTCAATCT CTCACAGAGA 1020 

TGCTTTCCCT TACTTATTGC 1080 

AATATATAAC TCCATACTGT 1140 

TATTATTGAG CGCATTGTCC 1200 

TAAGATTCGC AATACCCTAG 1260 

TTATTTGACA ACTTTGACAG 1320 

AGGTAGAGCA TTATTGCAAC 1380 

TCTCTTATTA CCAGACCAGA 1440 

GGAATTGAGA CGTAGATATT 1500 

TTTTGTTGTT GTAACTGATC 1560 

TGCTCCAGAA ACACTAGAAA 1620 

TAAATATCTA TTAACAATGG 1680 

GAAAAGCATT ATAGATTGGT 1740 

GATTTGCAAC AGTCTGTTAT 1800 

CGAATACGCT ATATAATACG 1860 

AATTTCTGGA AGTACTATCA 1920 

ACTTTCTCAA ATAAGATTAA 1980 

GTCTTTAAAA TCAGAAAAGC 2040 

TTATTATGGA AATATTTGCT 2100 

CTCTATTTCT AATGATTTAC 2160 

2186 



1119 
GACGAGCTGG 
TAGACGAGGA 
ATTTTCAAAC 
TCTTTTTAGA 
TCTTAGCAAA 
TAGCAAGAAA 
AATATCTATC 
ATCTCTTTAG 
ATCTCAGAGG 
CAAATCCTAC 
TATCAACAAA 
CTTTGGAAAA 
TTGATGTAAA 
GTTTACGACA 
TGGTATATTT 
ATGAGGTTGA 
AAACAACACT 
ATCAATTCCC 
GAATCGAGAA 
CAGCTAATTA 
TAGTGAATTT 
GGGGATCTAT 
AGCATCACCC 
TGAGCAATCA 
AATATGCGTT 
TCCTATTTTT 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 

GTCACACGTT TGACTTCACG TATTTCATAA GTATAAACTT TATTTTTATC GGTTAGATAA 60 

ATCTTCATGC CATTTTTAGC ATTATCTAAA GGAGAAAATA ACATTTTATT AGCATTATCA 120 

ACACCAAAGA TATGGTGACT AGCTAGACTA TAATTTCCTT CTCCCATTAC TTGCTCGCGT 180 

TTCATTGTAC CAGCTCCGTA GAAGAGATTA ACATTATCAA GTCCTTTAAA AATCGGCAAA 240 

TTCATTTCCA ATTCAGGAAT TGCAATTCCC CCAATAACTG GTAATTTTTG AGCATCCCAT 300 

TGAGAAGTTA GAACAGCTTC CGAAGAGATA GCTTTGACAG AATCAAAGTC AAAATTGCCT 360 

TCTGTATCCT GATTTTCTTC TAATTTTTCT TTTGATACCT GGCTAACTTG ATACTTATTG 420 

GTATTCCAGA CTATGAAAAT ATTTCGAATT TGAGTATTAA AAATCAAAGC CAGTGACAGT 4 80 

AATATCAGAA ATCCTGCTAG GATATTTGTC AGCAGATTTT TTCGCTTGTT TTTCTTTTTA 54 0 

TTATTTTTTT GAGACATTAT GCTTCACCTT CTGTTTCGTT TTCTGTCCCA ACTTCTTCTT 600 

TTTCTGCCAC CGCAACCGTT GTGAAAGTCA CTATCTGAGC ATCTTGATCC AGGCGCATTA 660 

CTTTAACTCC CATAGTTGCA CGTCCTGTTT GTGAAATATT GGCAAGATTG GTTCGAATCA 720 

TGACACCTCT ATCAGTGATA ATCATCAAAT CeT'CATCCCC TTGAACAGTC ATAAGACCGG 780 

CCAGCAAGCC ATTTTTTTCG GTAATTTTAG CTGTCTGCAT TCCCTTACCA CCACGACCTT 840 

TTGTTGGGTA TTCAGTAGCG ACTGTACGCT TACCATATCC TTTTTCTGTG ATAATAAGAA 900 

CCTCATCTTG ATCAGTAATC AAGCTGGCAC CAACAACTGT GTCTCCTTCA CGAAGGTTAA 960 

CACCTTTCAC ACCAGTGGCG ATACGGCTCA TACCACGAAC GGCTGATTGA TTAAAGCGAA 1020 

CTGCATAACC AAACTTGGTA CCAATGATAA TATCCATATC TCCTTCTGCC AACAAGACAT 1080 

TGATTAACTC ATCTTCATCC TTTAAATTCA GCGCTTTGAG ACCATTTTGA CGAATATTGG 1140 

CAAACTCCTT AACACTGGTT CTCTTCACAA TACCGTGACG GGTTGTAAAG AAGAGATAAG 1200 

CATCATCACT GCGATCAGAC TCAACATTGA TAACCGTCTG AATACTTTCG TCTTCATCCA 12 60 

ATTTCAAGAG ATTGACTACT GGTAGCCCTT TGGCAGTCCG ACCATACTCA GGAATTTCAT 1320 

AACCTTTAAG ACGATAGACA CGTCCCTTGT TTGTGAAGAA GAGCAGATGA TCATGGGTGC 1380 

TAGTTGACAC TAACTCACGA ACAAAGTCAT CATCTTTCAC TCCCGTTCCT TGGACACCAC 1440 

GACCCCCACG TTTTTGAGCA GTGAACTCGT CCTGATCCAA ACGCTTAATG TAGCCTCTGT 1500 

TAGAAAGGGT AATCAAGACA TCCGATTCTT CAATCAAGTC CTCATCCTCG AGACTCAAGA 1560 
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CCTGTCCAAT CATCAACTCT GTACGGCGCT TATCAGAAAA TTTACGTTTA ACTTCATCCA 1620 

ATTCGTCTTT GATAATTTGA GAAACACGTT CAGGCTTAGC AAGAATATCT GCTAAATCCG 1680 

CAATCAGAGC CAAGAGGTCA TCATACTCAG ATTGAATCTT ATCGCGTTCC AAACCTGTCA 1740 

AACGACGAAG ACGCATATCA AGGATAGCTT GACTTTGACG TTCAGAAAGC TTAAACTTGC 1800 

TCATCAACTC AGCTTGAGcT TCCGCATcCG tTTCACTAGC ACGGATGATA CGAATCAyTC 1860 

GTCGATATGG TCTAGCGCAA TCAAGAGACC TTCTAAGATA TGAGCGCGCG CTTCCGCTTT 1920 

TTCCTTATCA AAACGTGTAC GACGAACAAC CACTTCTTTT TGGTGCTCGA TATAAGCATC 1980 

CAAAATCTGA CGAAGAGACA AAATTTTCGG TATACCATTT TGGATAGCGA GCATATTGAA 2040 

ACCAAAATTG GTTTGCATTT GGGTCATTTT GAAGAGGTTA TTGAGAATAA CATTGGCTGA 2100 

GGCGTCGCGC TTGACTTCAA TAACAAATCG AACACCTTCA CGGTTTGACT CATCACGTAC 2160 

TGCTGTGATA CCCTCAATGC GTTTTTCCTG AACCAAGCGA ACAATATGCT CATGCACCTT 2220 

GGTTTTATTG ACCATGTAAG GAAATTCTGT TACAACGATA CGCTCACGAC CAGTCTTAGT 2280 

CGTTTCAATC TCTGTACGAG AACGTAGGAC AATCGAACCT TTACCTGTTT CATAAGCCTT 2340 

ATGGATACCT GATTTCCCCA TGACAAGAGC ACCAGTTGGA AAATCTGGTC CAGGCAAGAC 24 00 

TTCCATCAAG TCCTTGGTAG TCACTTCAGG ATTATCCATG ACCAACTTCA CTGCATCAAT 24 60 

gGTTTCACCC AGATTATGAG GTGGAATATT GGTTGCCATC CCAACCGCGA TACCAGTTGC 2520 

TCCATTAACC AAAAGGTTTG GAAAACGCGC TGGCAAGACC AAGGGTTCCC GTTCATTGGC 2580 

ATCATAGTTA TCAACGAAAT CAACTGTATT TTTGTTGATA TCACGAAGCA TTTCCAGAGC 264 0 

AATCTTGCTC ATACGTGCCT CGGTATAACG TTGAGCGGCA GCACTATCTC CATCCATGGA 2700 

ACCAAAATTG CCATGACCAT CTACAAGCAT GTAACGGTAG CTCCACCATT GAGCCATACG 27 60 

GACCATGGCT TCATAAATAG AGGAATCCCC GTGTGGGTGA TATTTACCCA TGACATCCCC 2820 

TGTAATACGA GCAGATTTTT TATGGGGTTT GTCTGGGGTC ACACCCAATT CATTCATTCC 2 880 

GTAGAGAATG CGACGGTGAA CAGGTTTTAA GCCATCTCGA ACATCAGGAA GAGCTCGCGC 2940 

TACGATAACA CTCATGGCGT AGTCGATAAA ACTTGCCTTC ATCTCCTTTG TCAGATTGAC 3000 

ATTCACTAAA TTTTTATCCT GCATTAATAA ATGCCTCATT TCACAATTAG TAAGTAACAA 3060 

CATTATACCA TAAATTCCCA TCTATTTCAG CCTCTAAACC ACTAAAACGT TTACATCGAG 3120 

AACTATAAGG CATATTCGTG ACAAAGTTTT TTAAAAGTGA TAGAATGAAG TTGTCTAGGG 3180 

AAAACCCCTA ATAGAATAAG GAGATGGTTA nACAATGACT CTGACTAACA CACAAA 323 6 
(2) INFORMATION FOR SEQ ID NO: 181: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8651 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 

AGGTCCTGAA GTATTGGAAC AGGAAGGTCA AGAGTTTTTG GAACATTTCA AAAAACTCTT 60 

GGAGTCAGTT GAAGTAGTAG CCATCTCAGG TAGTCTGCCA GCTGGCCTTC CAGTTGATTA 120 

CTATGCGAGC TTGGTAGAAC TTGCTAATCA AGCTGGCAAG CATGTAGTCT TGGACTGCTC 180 

AGGTGCAGCA CTTCAGGCTG TTCTTGAATC ACCCCATAAA CCAACAGTCA TCAAACCAAA 240 

TAATGAAGAA TTGTCTCAGC TTCTTGGAAG AGAAGTTTCT GAGGATTTGG ATGAATTAAA 300 

AGAAGTACTT CAAGAACCTT TGTTTGCAGG GATTGAATGG ATTATCGTTT CACTTGGTGC 3 60 

CAACGGTACT TTTGCCAAAC ATGGTGACAC TTTCTACAAG GTAGATATTC CTAGAATTCA 420 

GGTGGTAAAT CCTGTTGGAT CTGGAGACTC TACTGTGGCA GGAATTTCTT CAGGACTTCT 4 80 

TCACAAAGAA TCGGATGCAG AATTACTCAT CAAGGCAAAT GTCCTTGGTA TGCTCAATGC 540 

TCAAGAAAAA ATGACTGGTC ATGTCAACAT GGCCAACTAT CAAGCTCTAT ATGATCAATT 600 

AATAGTAAAA GAGGTATAAA ATGGCTTTAA CAGAACAAAA ACGTGTACGC TTAGAAAAAC 660 

TTTCTGATGA AAATGGTATC ATCTCAGCTC T-TGCAT'ri'LJA CCAACGTGGT GCTTTGAAAC 720 

GCCTCATGGT TAAACACCAA ACAGAAGAAC CAACTGTGGC CCAAATGGAA GAACTTAAAG 780 

TCTTGGTAGC AGATGAATTG ACTAAATATG CTTCATCTAT GCTTCTTGAC CCTGAGTATG 840 

GACTTCCAGC AACTAAAGCT CTTGATGAAA AAGCTGGTCT TCTCCTTGCT TATGAAAAAA 900 

CAGGTTATGA CACAACAAGC ACAAAACGCT TGCCAGACTG CTTGGATGTT TGGTCTGCAA 960 

AACGTATTAA AGAAGAAGGT GCAGATGCAG TTAAATTCTT GCTTTACTAT GATGTAGATA 1020 

GCTCAGACGA ACTCAATCAA GAAAAACAAG CCTACATCGA ACGCATCGGT TCTGAGTG'J'G 1080 

TGGCTGAAGA TATCCCATTC TTCCTTGAAA TCCTTGCTTA CGATGAAAAA ATTGCGGATG 1140 

CAGGTTCTGT AGAATACGCT AAAGTAAAAC CACACAAAGT TATCGGCGCT ATGAAAGTCT 1200 

TTTCAGACCC ACGCTTTAAC ATTGATGTTT TGAAAGTTGA AGTTCCTGTT AACATTAAAT 1260 

ATGTTGAAGC kTCGCTGAAG GTGAAGTAGT TTATACACGT GAAGAAGCAG CAGCCTTCTT 1320 

CAAAGCGCAA GATGAAGCAA CGAACTTGCC ATACATCTAC TTGAGTGCTG GTGTATCAGC 1380 

TAAACTCTTC CAAGATACTC TTGTATTTGC TCATGAATCA GGTGCGAACT TTAACGGAGT 1440 

TCTTTGTGGC CGTGCTACAT GGGCAGGATC AGTTGAAGCT TACATCAAAG ATGGTGAAGC 1500 
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AGCAGCTCGC GAATGGtCGC ACAACTGGAT TTGAAAACAT TGACGAACTC AACAAAGTTC 1560 

TTCAAAGAAC AGCAACTTCA TGGAAAGAAC GCGTGTAAGA AAGTCCTCCT AGTTTAGGAA 1620 

CATGAATCTA AAAAAATTTA AAAAAAGTTG TATGTAAAGG CTTACAAAAT AACTTACTTG 1680 

TGCTATACTT AAATCACAAG TTAATATGAA TTAGAAAGTA ACTATATGAA GTATAATAAA 1740 

AATAGGATAT AGTTTATTTT ACGAGCTAGG AAGGAAAAAT ACGGAAACAA TATTGCCAGA 1800 

ATAAACTATA TTTAGATGCA CATTTCATTC ATTGTTTTAT AAAAGGAGAA GATAAACGGC 1860 

TACTAAAAAG AGTTTTAAAG CGTTAGTTGT AGGACTAGGT ATTGTTTCAA TATTCTTATC 1920 

AGCCTTACCT ATGGTTAGTG GTTCTGTATT TGCAGATAGT GCCCTAACTA CAGTAGATAA 1980 

AGCAAATGAT ATTGTTTTGA ATGTTGATGG GAATAAATTT TATAATGTTT CGGTTTCAGA 2040 

AGATATTGTA AATGCTGGTC AAATTTTGGA AGATTATTTT TATGTAGATA AATTTGGAAA 2100 

TATAAATTTA AAAGGCACTC CTGAAGAGTT AGCAAAAAAT ATTGGTATTT CTGTACAAGA 2160 

AGCAAGTTTG ATGTATGGAG CTGTAAAAGA GTTACCCAAC GTTTACGAAA GAGGTCCTGT 2220 

AGGTTTTCGT TTCAATCTTG GTCCTCAAGT GAGGGGGATG GGTGGCTGGG CTGCTGGAGC 2280 

TTTCGCTACT GGATATGCTG GATGGCATTT GAAACAATTT GCGGTTAATC CTGTTACATC 2 340 

TGGATTTGTT GCTGTAATAA GTGGTGCGAT TGGCTGGGCT GTAAAAACTG CTGTAGAAAA 2400 

TTATTGGACA GTTGCTGTAG CTACAGTAGA AGTGCCGTTT GTGAACCTTG TTTACACCAT 24 60 

AGATTTACCT TAGAGGTTAT TTCTTTATGA ATCATTCTTT TAAAAAAATA ACTGTATTTT 2 520 

GTTTTATAGT TTCTTGTGTT CTTTGTTTAT TAGACTTAAT GAATTTTAAA AATGTAGCTA 2 580 

CTTTTTTATT TTTCTGTCTT CCTGTTTTTG TTTTGATTTA CAAAAATAAA TAAAAACAGA 2 64 0 

GCCTCTGTTT GATGAATTTT AGAACATAGT TAAGTTTTAA AAAAAGTTGT ATGTAAAGGT 2700 

TTACAAAATA ACTTACTTGT GCTATACTTA AATCACAAGT TAATACAAGG TGAGTGTTAC 2760 

TAAGTAATAT TAGGCATGAT CACAGGTGAA TTAGAAATCA GCTGATTTTC TAGTTCATTT 2820 

GTGGTCATTT TTTGTACTTA TATACCTTTA AGATATAAAA GGAGGTTGAC ATGTATCGAA 2880 

TTCTAAATCC AATGAATCAC AATGTCTCGC TTGTCAGAAA TGATAAGGGA GAAGAGGTGA 2940 

TTGTAATTGG TAAGGGAATT GCATTCGGAA AGAAGAAGGG GGATTTGATT GCTGAAAATC 3000 

AGGTTGAGAA AATCTTTCGG ATGAAGACCG AAGAGTCCAG AGAAAACTTT ATGGCTCTTC 3060 

TCAAAGATGT TCCGCTTGAT TTTATCACAG TGACCTATGA AATCATTGAT AAGCTATCAA 3120 

AGAAATATCA TTATCCGATT CAAGAGTATC TCTATGTAAC CTTGACAGAT CATATTTACT 3180 

GTTCTTATCA AGCTCTAACT CAAGGAAGGT ACAAGGATAG TAATCTGCCA GATATTTCCG 32 40 
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CTAAGTATCC TGTCGCTTTT CAAATCGCAA ATGAAGCTTT TGAAATTTAC CGTCAGAAGC 3300 

TAGCAGATCA TTTTCCTGAG GACGAAATTA TTCGGATTGC TTATCATTTC ATTAATGCTG 3 360 

AAGGTGAAAA TGAAGTGGAA CTTGTGGAGT CGATTGATAA GAGGAAAGAA ATTCTCAGGA 3420 

ATGTTGAAGA AGTTTTAACG GACTATGCAA TTCAACGAAC TAAAAAGAAT AACCATTTCT 3480 

ATGATCGCTT TATGATCCAT TTGAATTATT TCTTGGATTA TTTAGACAGA TCTAGAGATG 3540 

ATAACCAATC ACTTCTGGAT ATGGAAGATC ATATTAAACA ATCCTATCCA AAAGCCTTCG 3600 

AGATTGGTTC CAAGATCTAT GATGTGATTA CGCAACATAC GGGTCTTGAT TTGTATAAAA 3660 

GTGAACGAGT TTATCTAGTT CTACATATCC AACGTTTATT GTCATAAAAA TTTATTTAAA 3720 

ACTATATAAG GAGAATTCTA TCATGAATAG AGAAGAAGTA ACATTGTTAG GTTTTGAAAT 3780 

CGTAGCCTAT GCTGGCGATG CTCGTTCAAA ACTATTGGAA GCCTTGAAGG CTGCTGAAGC 3840 

TGGTGATTTT GAAAAAGCGG ACGCTCTGGT AGAGGAAGCT GGTAGCTGTA TTGCAGAGGC 3900 

TCACCACGCG CAAACAAGTC TATTGACTAA GGAAGCTTCA GGTGAGGACT TGGCTTATAG 39 60 

TGTAACCATG ATGCATGGCC AAGACCACTT AATGACAACT ATCTTGTTAA AAGATTTGAT 4020 

GCATCATTTA ATTGAACTCT ACAAGAGAGG AGTTCAATAA TGAATAAACT AATTGCATTT 4080 

ATCGAGAAAG GAAAGCCTTT CTTTGAAAAA CTATCTCGTA ATATCTATCT TCGTGCTATT 4140 

CGTGATGGTT TCATTGCAGG TATGCCTGTT ATTCTCTTCT CAAGTATCTT TATCTTGATT 4200 

GCCTTTGTAC CAAACTCATG GGGCTTTAAA TGGTCTGATG AAGTTGTAGC CTTTCTGATG 4260 

AAACCTTATA GCTATTCTAT GGGTATTCTG GCTCTCTTGG TAGCTGGTAC AACAGCTAAG 4320 

TCATTGACTG ACTCAGTAAA CCGGAGCATG GAAAAAACCA ATCAAATCAA GTATATGTCA 4380 

ACATTGTTGG CAGCAATTGT TGGTTTGTTG ATGTTGGCAG CTGATCCTAT CGAAAGTGGT 4440 

CTAGCTACTG GATTCTTGGG GACAAAAGGT TTGCTTTCAG CCTTCCTTGC TGCCTTTGTT 4500 

ACTGTAGCCA TCTATAAGGT TTGTGTTAAG AACAACGTCA CTATTCGTAT GCCTGACGAA 4560 

GTTCCACCAA ATATCTCACA AGTCTTTAAA GATGTGATTC CATTCACTCT ATCTGTTGTT 4620 

TCTCTTTATG CTCTTGACTT ATTAGCACGT TATTTTGTTG GTTCTAGTGT GGCAGAATCA 4680 

ATCGGTAAAT TCTTCGCACC ACTCTTCTCA GCAGCAGACG GATACCTTGG TATTACCATT 4740 

ATCTTTGGTG CCTTTGCCTT CTTCTGGTTT GTTGGGATTC ATGGTCCATC TATCGTTGAA 4800 

CCAGCTATCG CAGCTATTAC CTATGCCAAT GCCGAAGTTA ACTTGAACCT TCTCCAACAA 4860 

GGGATGCATG CAGACAAGAT TCTTACTTCT GGTACACAAA TGTTTATCGT TACCATGGGT 4920 

GGTACAGGTG CGACATTGGT CGTTCCATTT ATGTTCATGT GGTTGACAAA ATCGAAACGT 4980 

AACCGTGCAA TCGGACGTGC TTCAGTAGTT CCTACCTTCT TCGGTGTAAA TGAACCAATC 5040 
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TTGTTTGGTG CACCTCTTGT TTTGAATCCA ATCTTCTTCA TTCCATTTAT CTTTGCTCCA 5100 

ATTGCAAACG TATGGATTTT CAAATTCTTT ATTGAAACTC TTGGAATGAA CTCATTCACT 5160 

GCTAATCTAC CATGGACAAC TCCAGCTCCA CTAGGTCTAG TTCTTGGAAC TAACTTCCAA 5220 

GTGCTATCAT TCATTCTTGC TGCCCTTCTA ATCGTGGTTG ACGTTGTCAT TTACTATCCA 5280 

TTCCTTAAGG TCTATGATGA ACAAATTCTT GAAGAAGAAC GTTCAGGTAA GTCTAATGAT 5340 

GAATTGAAAG AAAAAGTTGC TGCAAACTTC AACACTGCAA AAGCGGATGC TATTCTTGAA 5400 

AAAGCGGGTG TCGATGCAGC ACAAAATACC ATCACTGAAG AAACAAATGT CCTCGTTCTC 54 60 

TGTGCAGGTG GAGGAACAAG TGGTCTCCTT GCAAATGCTT TGAATAAGGC AGCAGCAGAA 5520 

TACAATGTCC CTGTGAAAGC AGCAGCAGGC GGCTATGGTG CTCACCGTGA AATGTTACCA 5580 

GAGTTTGATC TTGTTATCCT TGCCCCTCAA GTTGCTTCAA ACTTTGAAGA TATGAAAGCA 564 0 

GAAACAGATA AGCTCGGTAT TAAACTAGCG AAAACAGAAG GCGCTCAATA CATCAAATTA 5700 

ACTCGTGATG GAAAAGGTGC TCTTGCATTC GTACAAGCGC AATTCGATTA AGGCTAGAGA 5760 

CTCTGAAATA GTCTCCCATC GTTACGGAAA TCGCTATGGC GAATTTCCTA TTATTAATTC 5820 

GTCGGTAAAA AGATATCGTT TTTACCTCCT CATGTCACAA TTCGGTGACT TGGTACAAGA 5880 

AGTGAGATGG AGAAGGATGG CTCACTGACT CCTCTCCTCT CACTTTTACT TTATTTAAAT 5940 

CAAGAAATAG GTGAAAAAAA TGACAAAAAC ACTTCCAAAA GACTTTATTT TTGGTGGCGC 6GGG 

AACAGCTGCT TATCAAGCAG AAGGTGCTAC ACATACTGAT GGAAAAGGAC CAGTTGCTTG 6060 

GGATAAATAT CTTGAGGATA ACTACTGGTA CACTGCCGAA CCAGCTAGTG ATTTTTACAA 6120 

TCGATATCCA GTTGACCTCA AGCTAGCAGA AGAGTATGGT GTCAATGGTA TTCGAATTTC 6180 

TATTGCTTGG TCACGTATTT TCCCGACTGG TTACGGCCAA GTAAATGCTA AAGGTGTTGA 6240 

GTTTTATCAT AATTTATTTG CAGAGTGTCA CAAACGTCAT GTTGAGCCTT TTGTAACTCT 6300 

TCATCACTTT GACACGCCAG AAGCTCTCCA CTCAAATGGA GACTTCTTAA ACCGTGAAAA 6360 

TATCGAACAT TTTGTAGACT ACGCTGCCTT CTGTTTTGAA GAATTTCCAG AAGTAAACTA 6420 

TTGGACAACC TTTAATGAAA TTGGACCAAT CGGTGATGGT CAATATTTGG TTGGGAAATT 6480 

CCCTCCAGGT ATCCAGTACG ACCTTGCCAA AGTCTTTCAA TCACACCACA ATATGATGGT 6540 

GTCTCATGCA CGCGCGGTAA AATTGTACAA AGAGAAAGGC TATAAAGGGG AAATTGGTGT 6600 

TGTTCACGCC CTGCCAACTA AATATCCTCT AGATCCTGAA AATCCAGCAG ATGTTCGTGC 6660 

AGCTGAGTTG GAAGATATCA TCCACAATAA ATTCATCTTA GACGCAACTT ATCTAGGTCG 6720 

CTATTCAGCT GAAACCATGG AAGGTGTCAA CCATATCTTA TTAGTCAATG GTGGTAGTTT 6780 
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GGATCTTCGT GAAGAAGATT TTACAGCATT AGAAGCTGCA AAAGACTTGA ATGATTTCCT 6840 

AGGAATCAAC TACTATATGA GTGACTGGAT GGAAGCCTTT GATGGAGAAA CTGAAATTAT 6900 

CCATAATGGT AAAGGTGAAA AAGGAAGCTC TAAGTATCAA ATCAAAGGTG TTGGTCGTCG 6960 

TGTAGCTCCT GACTATGTAC CACGCACGGA TTGGGATTGG ATTATCTACC CTCAAGGTTT 7020 

GTATGACCAA ATCATGCGTG TGAAGAAAGA TTATCCTAAC TACAAGAAGA TTTACATCAC 7080 

TGAAAATGGT CTCGGCTATA AAGATGAGTT CGTTGATAAC ACTGTTTACG ATGATGGTCG 7140 

TATTGATTAC GTGAAGCAAC ACTTGGAGGT TTTATCTGAT GCGATTGCAG ATGGAGCTT^ 7200 

TGTAAAAGGT TACTTCATTT GGTCATTAAT GGATGTCTTC TCATGGTCAA ACGGTTATGA 72 60 

GAAACGTTAT GGTCTCTTCT ACGTAGATTT TGAAACTCAA GAACGTTATC CTAAGAAATC 7320 

AGCTCACTGG TACAAGAAAG TAGCGGAAAC TCAGATTATA GACTAGTAGA ATTAGTCATT 7380 

AGATATAGAA TTTTAGTGAG TCAAAAAGAT GTTCAAAGAT TTTATCCAAT CTATTTATGA 744 0 

AAAAAAGTTT ATATTATAAA TTTCGAAAAA TGCTCTCAAA TACCGTGTTT GACGAGTGAA 7 500 

GAATTGAAAA GTCTTGGAAA ATGGTATGTC TCGACTGGTA AAGAATGGAT TTGTCATTCA 7560 

GATGATGAGC TGGAAGAATT TAAAAATCTA TTTTTAAATT TTATCAATCC TGAAGAATGG 7620 

GATACTATCT CCTTTGATTC AGATTTTATG CCGTTTCAAC AATCGTAACC AATTTCTCAA 7680 

AAAAGTTAAA TCTTATATTT AGTACTCTGT AAAACTCTTA TCTAATCACG TTGCTTATAC 7740 

Ti-AA-ltiAAAA TCAAAGAGCA ACTTTAAACT AGGAAGCGAG TCGCAGATTT CTCAATGCAT 7800 

AGCTTTGAGG AATTGGGCAA AAAGTCTTTG ATATAGAAAA ACGCATAGTA TCAGGTGTTT 7860 

CAACACCTGA TACTATGCGT TTTATTGTGG GAAGATTTAC TTTTTTTCTT CTGAAATTGA 7920 

GTTGTTACCC AGGCTCTTTC AGTTTATTAA GGCTTGATGA CTTTAATGTG TTTAGATAGC 7980 

TTAAAAAGGA TTGAATCACT TAGTTTAGAA TCTGAAACAA TAGTATCAAG ATTTGATACA 8040 

TTATAAAAAG TATAAAAATC AAACTTATTG AACTTGCTAT GATCTGCGAG TAAATATTTT 8100 

TTATTAGAAT TATTTAAAGC GATGCGTTGA GCCTCTCCCT CTTCCTCGCT AAAAGTAGC.T 8160 

AGAGCTCCGT TTTGAATACC ATTACAGCTA ACGAAAGCTT TAGAAAATTG GAGATTAGAG 822 0 

AGATTTTGTA GGGTCAATGT ACCAACAAAA GCACCTGTAA TATCGCGATA ATTTCCACCT 8280 

ATTAAAATCA AATCTGTTAA TTTTCGTTCG CTTAAAATCA GAAAAACAGG TAGACTGTTG 8340 

GTTACGACGC GGATATTGTC AATAGGCAAC TCACGCGCAA AAAACTCTAA TGTTGTTCCT 8400 

GGTCCAATGA AAATAGTTTC TCTTTCTTCT ACTAGACTGC CTGCAAAATG GGCTATTTCT 84 60 

TGTTTTTCTG CCGTTTGGAG GGCTTGTTTT TCAATATTTG ATCGCTCATT AGTCAAAAGG 8520 

GAGTTGGTTC GAAGTTTTTC AGCTCCACCA TGCACACGAA TCAGCAAATC TTTATCAGCT 8580 
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AATTCCTGTA AATAGCGCCT TGCAGTCATA TCTGAAACGG CTATTTCGTC .CATAATCTGT 8640 
TTAACTGTTA T 8651 
(2) INFORMATION FOR SEQ ID NO: 182: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3786 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 



AATCTCCAAT 


CAGTGCCACT 


TCAGCTACAA 


AGAAGAGGAG 


GATAATAACT 


CCGTTCACAA 


60 


GGACAGACAA 


GAATAATTGA 


TAGAAGGAGT 


CGGTTTCACT 


TGCTTGACTT 


GGTCTTGTAA 


120 


TGATwTGGAG 


ACTGGCAAGC 


AGAATGATTC 


CAATGCTAAT 


CACACACAAG 


AGGGCTGTAA 


180 


ATCGTAGGCT 


ATCAAAGAAA 


GCAAAGAAAC 


TAGCAATAGC 


AGTGAGGAmG 


ATTGGAATTG 


240 


CCAAGAGTTG 


ACTATATTGT 


TGGAGAACCT 


TGTCTAGCGT 


CCAGTCCTTT 


TCCTGGTGGA 


300 


TAAATCGTCT 


CACAACGAAA 


CTACCCAAGA 


GGAATGAAAA 


GAAGAAGAGT 


GTTGTCGCTA 


360 


CTAGGATAGA 


GATGATAGAA 


AAAAGAGTTA 


AAGGAGCTAG 


CTGCTCAGGG 


AAGCGACTGT 


420 


TAATGCTTGC 


TATATGTCCA 


TAGTAAGCAT 


GTTTGATGTG 


ATAGATACTA 


AAGAAAAAGG 


480 


AAGATGCAGA 


AAACAGAATG 


AGCAAGAGAA 


AGGCTGTGTA 


ACTGTGTGTG 


ATACTTGTTT 


540 


CCAACTTACT 


TGTAGGAGAT 


TTGATCGCTT 


CCACTAGCCA 


AGACCAAAAA 


TCAAGCACTT 


600 


GCTCTTTCCA 


TTTATCCCTA 


GATTTTGGAG 


CTTGGTCGGG 


GATATAAGGA 


CTTTCTAAAG 


660 


ATTTACTGAT 


AAGAAGTGGC 


TCTTTCGTGG 


TTGCTTTTTG 


CTGAGGAAGA 


GCTTCTTGGC 


720 


TCTCTTCAGC 


TATAGTGACT 


TTTTCTGTTT 


CTTTAGAAAG 


GTCTGGCTCT 


TCTTCAGTAG 


780 


AATTAGATGC 


CTTCTTTTCT 


TCTATTTCTG 


TTCTCGCTTC 


ACTGTCTTCA 


GGAGCTTCAA 


840 


TTTTCTCTTC 


TTGCTGGCTT 


TCCAATTCGA 


CTTCAGCTTG 


AGGGACTTCC 


TCCTCTAACT 


900 


GAGTATTTTT 


TTCAATTGGT 


GTATCGAGAT 


CGGCTATCGT 


TTCTTCAGCC 


TTGTCTGCAA 


960 


CCTCTTGAGC 


TTGCTCTTCA 


GGCTTGTTCT 


TGCTTGTTGT 


TTTTACAAAA 


TCATTACTTT 


1020 


CAAACCATTC 


TTGTTTCATG 


GTAGAACCTC 


CTTTTTAGTT 


AGATAAATAT 


GTTTCCATAG 


1080 


TAGCAAATGT 


AAGCGTTTTT 


GTCAACGTCT 


GCTTGGTGTG 


GATATTAGAT 


CAATATTATC 


1140 


ATCAGATCTC 


GCAATGAGTT 


GATCCTTGAC 


ATCGGTTTTT 


TCAGTTTTGT 


AAGGGTTGCT 


1200 


TAATTCCGTA 


CCTCTTGATT 


CAGGCTTTTC 


TCTTGTGAAT 


TGGAAGATAG 


AACCATAGTT 


1260 
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GCTTGAGATG TCCCAGTTAA TTCGTTGGCT TTCTTTCTGG TCTAGGATGA TTCTGAGATA 1320 

ATCTTTGGCA GTCAGTTCAA CCTTGCCATG GACTTGGATA TTTTCAGCGT GGAAGTGATT 1380 

CTCTGTTGAC TCTAGCTGAC TATCTGTAAG AACTGTATCA AAGATATTAA CGATATTGGG 1440 

CGTTGTGAGT TTACTGTTTT TGATACGACT TCCTTCAATT CGGAGGATAT AGCTGTTTGT 1500 

ATTGAGGGTC GCATTTTCAA GGCTAGCATT TATGATGGTG GTTTGTCCGC GATTGGCTGA 1560 

GATGTTGATC CCTTTTAGAG TTCTCCCTTT TGGTAGTCGG AGAATAACTT CTTCAAAACG 1620 

ACTAGAGTAG CTACTTGCGA TATGAAGAAT CCCACCAATT CCAGAAGAGA GAAACGGAGT 1680 

TTCAGACAGT TTCTTATCAG TGAGACTCAG AGTTCTATCG TTCTGATTGG TGATAAGATC 1740 

ATGGTGAGCA GAAAGAGATG GATGGTAAGA AATGTGGATT TGATCATCGA AAGAGTCTGT 1800 

GATGGTGAGC GTGTGTTGGT GGAGAGTAAT TTCTAGGTTT TCGACTTCCT TGCCAAAGGT I860 

TAGCTTTTCC GTACGGCTAT CATAGACAGG TTCTTTGGAC ATGGAAAGTA GGCTCTTAAt 1920 

CCCGTCAGAT TGGATACCTA CAAAAAGCAG GATAAAGCCG ATAACGGTAG TCACCACACC 1980 

AAAGATGAGA AATCCTTTTG TCCATTTACG CATGCTGATT ACCTCTCTTT CCTTTTTTAA 2040 

GAACAAATTG TACCAGACGA ACAATGAGTA GACCGAAGAA GCGAGTTGCA TAGGAAATGC 2100 

CAAGTAAAAC TAGCGAAGAA GCACCGATAG CCAGTAAACC AGAACCAAAA ATCAAGATAA 2160 

AGGCTGATTT GGCTTGGGCG AGGACAGTGA AACTTTCAAC TAAAAATAGG AATCCGCCGA 2220 

TGATACCCAG TATGGAAACT GCAAAGAAAG CCAGAATGAC AGTCAAAGCG GCTACAAGAA 2280 

TTGCGAACAG GGTCACGAGG ATGGCGATTC CCAGAGGAAT GCCGATAGGT GCTGCAAGGA 2340 

GGGCTAACAA GGCGATATGT AAAATTTGTC GGTTATTTTT TTGAGCGGGT GCTTCATTGA 2400 

TTTTTTTATC GAGAAGATTG GATAGAACTT CGTGGGCCGC TTCTTTGGGA GTTCCCAAAC 24 60 

TAGCGATGAG TTCTTCTTCT CCTTCGACTC CAGCATCGTC AAAGAGCTCT CTGAAATAGT 2520 

CCATGGCTTC GATACGGTCA GCTTCAGGTA GTTTCTTGAG ATAGAGTTCT AGCTGAGTCA 2580 

GGTATTCAGT TCTTGTCATG GCGGATACTC CCTTCTATGA TGCCATTGAT GGTGTCTGTA 2640 

TAGAGTGCCC ATTCATCTTT TAGGGTCAAG AGCTGCTCTA TACCACCGTT TGTCAAGGAG 2700 

TAGTATTTGC GCATGCGACC TTGGAACTCT CTAGAATAGG TTGTCAGAAA GCTATTGCCT 2760 

TCCAATTTTT TGAGAATGGG ATAGAGTGTG GATTCTTTGA TATTAGCGAT CAGCTTAATG 2820 

GTTTGGCTAA TCTCATAACC ATAAGAATCA CCCTGCTCCA GTACAGCCAA GATGAGAAAT 2880 

TCAATCAAGG CAGAGGATGT TGGAAAGTAC ATGGGAAACC TCCTTTTCTA ATGTGTAAGA 2940 

TTTTTATATA TAATTTTTCT ACACATACAT TGTACATCTA AAAGAAAGCC CTGTCAAGAG 3000 

AAATGTGTAA AATTTTTATA TATAAAAAAC TTCTAGCTAA AACTAGAAGT TTAAAGGATC 3060 
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TTATCCGCTC TGTCCACTGT AAAGAGGGCC ACAGTCATCA GGATATCGAT GAGCAAGAGG 3120 

GCAGCTACAG ATGGTACCCA AGAGTGGAAC AGGTCAAAAC TGTAACCAAA GAGGGTTGGC 3180 

CCAAAGGCTG CTAGGATATA GCCTCCTGTT TGAGATAGGC CGGAC7VATTG GGCTGTCTTT 3240 

TCAGGGGCGC TTGTCTTGAG TGAAAAGTTG ACCATGAGAT AAGGGAAGAG GGCACTGGTT 3300 

GCGGTTCCGA TGAGGAGATG GATGGCAAGC CAGTAAATGA AATTATTGAT TGGGAAAAAG 33 60 

AGCATGGAAA TGCCGACCAC ACCAGCTAGT GAAACCAGAG TGAGCATGAG CTGACGGTTG 3420 

CGAGTAGATA AACTGGTTGT CAGGCTTGGG ATGGTCATTG AAAAAGGAAT GCTAATCAGA 3480 

GATAAGATAG AAGTCAGCAA GCCAGCTTCG TGACTGGATA GACCTGCATG GATAGACATG 3540 

GTAGGTAACC AGGTCATGAC GGTGTAAAAG ATCAAGGATT GAAAACCTGA AAAGATAATA 3600 

ATTGCCCAAA CCTGTTTATT ACGCATGACC TTTATTTGAC TTTTTTGTTT GGTTTGTGGA 3660 

GCTAGTCTAT GATTATAGCG GTGATTTGGG AGCCAGACCA AAAAAGTTGC TAGACAGAGT 3720 

AACGTGAGGA GAAGGATAAG TCCTTTCCAA GAACTGGCTT GTGTAATGGG CACAGCTAGA 3780 

TAGGAA 37 86 
(2) INFORMATION FOR SEQ ID NO: 183: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3054 base pairs 

(B) TYPE: nucleic acid 

{ C) Strandedness : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183: 

tcagctaaaa aacattgcta aattgattga AGCTGGTGCT ACACATTCCG ATTCAACTTC 60 

TCACACGGCG ACCACCAAGA ACAAGGTGAG CGTATGGCAA CTGTTAAACT TGCGGAAAAA 120 

ATTGCAGGTA AAAAAGTTGG TTTCCTTCTT GATACAAAAG GACCTGAAAT CCGTACAGAA 180 

TTGTTCGAAG GTGAAGCTAA AGAATATTCA TACAAAACTG GTGAAAAAAT TCGTGTTGCA 240 

ACTAAACAAG GAATCAAATC AACTCGTGAA GTGATTGCGT TGAACGTTGC TGGTGCTCTT 300 

GATATCTATG ATGATGTTGA AGTTGGTCGT CAAGTTTTGG TTGACGATGG TAAACTTGGT 360 

CTTCGTGTGG TTGCTAAAGA TGATGCAACT CGTGAATTTG AAGTTGAAGT TGAAAACGAT 420 

GGTATCATCG CTAAACAAAA AGGTGTGAAC ATCCCTAACA CTAAAATTCC TTTCCCAGCT 480 

CTTGCTGAAC GCGATAACGA CGATATCCGT TTCGGTCTTG AACAAGGTAT CAACTTCATC 540 

GCAATTTCAT TCGTACGTAC TGCAAAAGAT GTGAACGAAG TTCGTGCAAT CTGTGAAGAA 600 
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ACTGGAAACG GACATGTTCA ATTGTTCGCT AAAATCGAAA ACCAACAAGG TATCGATAAC 660 

TTAGATGAAA TCATCGAAGC AGCTGATGGT ATTATGATTG CTCGTGGTGA TATGGGTATC 72 0 

GAAGTACCGT TCGAAATGGT TCCAGTTTAT CAAAAAATGA TTATCAAGAA AGTCAATGCT 780 

GCAGGTAAAG TTGTTATCAC TGCAACAAAC ATGCTTGAAA CAATGACTGA AAAACCACGT 840 

GCAACTCGTT CAGAAGTATC AGATGTATTC AACGCTGTTA TCGACGGAAC TGACGCTACA 900 

ATGTTGTCAG GCGAGTCTGC AAACGGTAAA TACCCACTCG AGTCAGTAAC TACAATGGCT 960 
ACAATCGACA AGAACGCTCA AGCTCTTCTT AATGAATACG GACGTCTTGA TTCAGATTCA ■ 1020 

TTTGAGCGTA ACTCTAAGAC AGAAGTAATG GCTTCTGCTG TTAAAGATGC TACTAGCTCA 1080 

ATGGATATCA AATTGGTTGT AACTCTTACT AAGACAGGTC ATACTGCACG TTTGATTTCT 1140 

AAATACCGTC CAAATGCTGA CATCTTAGCA TTGACATTTG ACGAATTGAC AGAACGTGGC 1200 

TTGATGTTGA ACTGGGGTGT TATCCCAATG TTGACAGATG CTCCATCTTC AACTGACGAT 1260 

ATGTTCGAAA TCGCTGAACG TAAAGCGGTA GAAGCAGGTC TCGTTGAGTC AGGCGATGAT 1320 

ATCGTTATCG TTGCTGGTGT GCCAGTAGGA GAAGCTGTTC GCACAAACAC AATGCGTATC 13 80 

CGCACAGTAC GTTAAGAAAA ATATAAAAAC CTATCATATC CAGCTTTAGA GCTTGTGTGA 1440 

TAGGCTTTTT GTATAGAGGG TAAGAAATAG GCAAAACTTT CATAATGGAT TGATACTCTT 1500 

CGAAAATCTC TTCAAACCAC GTCAGCGTCG CCTTACCGTA TATATGTTAC TgACTTCGTC 15 60 

AGTTCTATCT ACAACCTCAA AGCAGTGCTT TGAGCAACtG CGGCTAGCTT CCTAGTTTGC 1620 

TCTTTGATTT TCATTGAGTA TGAAATAAGA TATGCACAAA TTGATTAGAA AGTCAAATGA 1690 

ATTTCTACAA ATGTTTTAGC AATCGTAATG TACTTGTCTA GATTCGATCT GATATATTTT 1740 

CGATTTAATG ATATGGTATT TAAAACCTCC AAAGTAGCTT ACTCCATTCT TTTACTTACG 1800 

TGAGTGTAGA TGTTATTTAC TGTTTTAGCG TTTTTGTGTT CCACTCTAAC CATTATAGCA 1860 

TTCTTCTCAG CTAGTGTACT AAGGAGTGTG TGCCTGAAAA TATGGGAACT AAGGGGCTGG 1920 

TTTATCGGTT TCTCTAGTTT AGTATTTGCC TTTTGCAAAG TGATCTTAAA TGCCTTTC'l'C 1980 

TAAATTTACA TATCACTATT GTTTAACAAA ATCTAATCTA TTTTAGGTCA CTTATTCTTT 2040 

TTTTGAAATG TAGAATGAAC TTTTTCAAAG TTTTTCGAAT CTTTTAAAAT CTGTTTGCTT 2100 

TATATCGCCA TTCTCCCCCC TTTTTTAATT CTCCCTATAT AGCCTGACAG CTTTCCCGAT 2160 

GGTACGAATA TGGTTGCTTT CGTCTAGGTG GATGTCGGGG TATTCGGGAT TGAGTTTTTT 2220 

TGAGGCAGCC TTGGCGGAGT TTCTTGACAT AGTTAGTGCC GTCTACTTGG AAGATGCCGA 2280 

TGGTATTATA GTCAATCTGT GGGGTATTCT TGATAAATAG GTAGTCGCTG TTTCTTATCT 2340 

TTGGCTCCAT GGACTTGCTG ACGACATAAG CGATTGGGTC GTAGTCGTCT GGGATAATGG 2400 
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AAACTCCATA TCTAAATCGT TGTCCTGCAT CGAGCGGCTA CCTGCAGAGA TAAACTACCT 2460 

AACACGAGAG TAAGTAGTCT GTCTGTAGTC GTCCAGTCTG ATGATTTTTA CGATACTTCG 2520 

TTTTTCTGAT CATACAGTTG CCTCTCGGCA TAGGTCAGAA CTTTACCTTG TCTGGGTGGT 2580 

TCCCGTTGGT CGTAGATAGA TTGGATATCG CTAGGAGAAT CCTTTTGAAC TGGAGGAAAG 2 640 

AGGGCATCGA TCAAGCTACT GAATACTTTA ACTAAGTCAA ATATAGTATT TTTCTTAGTA 2700 

GACCTAACCC TTTTTTCATA ATTTCTAATG GTGTTTTTAC TTATACCTAT CTTAGTACCC 2760 

AATTCTTATT GAGTCCAACC ATTACTAGTC TATATTGTTT TATAGTTGAT TGAGTTTGGA 2 820 

ATAGTACGCT GTAGCTGCTA AAACATTTCT AGAAATTAAT TTGACTTTCC TAATAGAGTT 2880 

GTTCATATCT TATTTCAATC TATTATGTTT TTCACCTCTA ACAATCGCAA TCTCTTCTTT 2940 

ATCCATGAAT GAAATCGCTT TCTATTTTTG TAAGTAAAGC ATAACACGAA ATCCACGAAA 3 000 

ATGAAAACCT TTGTTGTGTT TTCGTAAAAA ATTTGTTGAC AGAGCACGAA ACGC 3054 
(2) INFORMATION FOR SEQ ID NO; 184: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1590 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184: 

TGTGATTTTC yGAAAATTTG GTAAAATATA TCTTAATCAT TTTCAGGAGG ACAAAAATTT 60 

GACAAGATAT CAGAATTTAG TAAATGGAAA ATGGAAATCA TCTGAACAAG AAATTACGAT 120 

TTATTCACCA ATCAATCAAG AAGAATTGGG TACAGTTCCA GCCATGACTC AGACTGAAGC 180 

TGATGAGGCT ATGCAAGCTG CGCGTGCAGC CCTGCCAGCA TGGCGAGCTT TATCAGCAGT 240 

TGAACGTGCG GCTTATTTGC ATAAAACAGC AGCTATTTTA GAACGCGATA AGGAAGAAAT 300 

TGGTACTATC CTTGCCAAAG AAGTAGCAAA AGGGATTAAA GCAGCAATTG GAGAAGTAGT 360 

GCGTACAGCA GACTTGATTC GTTATGCTGC TGAGGAAGGT CTCCGTATCA CTGGACAAGC 420 

AATGGAAGGT GGTGGTTTTG AGGCAACAAG TAAAAACAAA CTGGCTGTTG TCCGTCGTGA 480 

ACCAGTTGGT ATCGTGCTAG CGATTGCTCC CTTTAATTAT CCAGTTAATT TATCTGCTTC 540 

TAAAATTGCA CCTGCCTTGA TTGCAGGGAA TGTGGTCATG TTTAAGCCAC CAACACAAGG 600 

TTCCATTTCT GGACTCTTGT TGGCTAAAGC ATTTGAAGAA GCAGGGATTC CGGCAGGTGT 660 

TTTCAACACC ATTACAGGTC GTGGTTCAGA AATTGGGGAT TATATCATTG AGCACAAAGA 720 
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AGTCAACTTC ATCAACTTTA CAGGTTCAAC TCCTATTGGA GAACGTATTG GTCGTTTAGC 780 

TGGTATGCGT CCTATCATGT TGGAACTTGG TGGGAAAGAT GCAGCTCTTG TACTAGAAGA 840 

TGCAGATTTG GAACATGCTG CCAAGCAAAT TGTTGCGGGA GCCTTTAGCT ACTCAGGACA 900 

ACGTTGCACG GCCATTAAAC GTGTCATTGT TCTCGAAAGT GTAGCAGATA AATTAGCTAC 960 

TTTGCTTCAG GAAGAAGTTT CTAAATTAAC AGTTGGTGAT CCATTTGACA ATGCTGATAT 1020 

TACACCTGTT ATTGACAATG CTTCAGCCGA CTTCATTTGG GGCTTGATTG AGGATGCACA 1080 

AGAAAAAGAA GCTCAGGCTC TTACACCAAT CAAACGTGAG GGCAATCTTC TCTGGCCAGT 1140 

GCTTTTTGAC CAAGTTACAA AAGATATGAA AGTGGCATGG GAAGAGCCAT TTGGTCCTGT 1200 

TTTACCAATC ATTCGTGTGG CTAGTGTAGA GGAAGCTATT GCCTTTGCCA ACGAATCTGA 1260 

ATTCGGCCTT CAATCATCAG TCTTTACAAA TGATTTCAAA AAAGCCTTTG AAATTGCTGA 1320 

AAAACTTGAA GTAGGTACAG TCCACATTAA TAATAAAACC CAGCGTGGTC CAGATAATTT 1380 

CCCATTCCTT GGTGTCAAAG GTTCTGGAGC TGGAGTGCAA GGAATTAAAT ATAGCATTGA 144 0 

AGCGATGACA AATGTCAAAT CCATTGTTTT TGATGTGAAA TAACGTGTAA AACCAGGAAA 1500 

TTGTTTTCCT GGTTTTATTT TTTTGCTATA AAATAATAAT AATTATAGAA AAAATACGAA 1560 

CTTTTTGGTA TTATAATAGA TTGAAACCGG 1590 
(2) INFORMATION FOR SEQ ID NO: 185: 

( i ) G Ev w EKC E CHARACtEk 1 b'TI CS : 

(A) LENGTH: 4848 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185: 

CCTGCAGTTG TCAGACCTGT AATTTTCTTT TTATCTGTAA TAAGAATCGT TCCAGCGCCT 60 

AGAAAACCCA CACCTGATAT AACTTGAGCT CCTAATCGTG TAGGATCTCC TGTCCCAAAT 120 

TTATAAGATA CGTATTCATT CGTCATCATA ATCAAACATG CAGCTAGACA AACAATACTA 180 

TAAGTTCGGA TGCCTGCAGG CTGGGATTTG CTCCCTCTCT CTAAACCAAT TATACTACCA 240 

ATGACTACTG ATAAAACAAT CCTGACAACT ATTTCAATAT TTGATAACCC AAGACTAGTG 300 

GCTGTCATGA TTATTTCCTT ACTTTACGCC CCGGTCTTTG TGTGAAGTAT AATACCGTTC 360 

CAGAAATAAT CATCAGAACA ATTGTATAAA CAAATACCAG AGCTTGTGCA TTAGATGTTG 420 

CTGTTTCATC ACCTGCAGAT CGAATCGTAA TACCTAATGG TTGAGCTAGG GGATGGTAAA 480 

GGAATACAGA TAAGTCGAAG TCAGTTAATA AAGAGTTAAA GTTTAAAGCA ATAACAGAGA 540 
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GAACAACCGG TAAAATAAAT GGAATGATAA CCTTCATCAT AGTATiVAAAA GGTGAAGCAC 600 

CCATACTTCT TGCTGCATCT TCCATCTCAT CATCAACACT AAATAAAATA GCACGTACCA 660 

TTCTATAAGA AAATGGGATT TTTACAACTA TATATGCAAT AAGTAGAATT ACCAAACTAC 720 

CTACCAAAAT CTGATTCAAG ACAAGAAATT GTGGCTGATT AAAAGTAAAT AATAAACTTA 780 

CTGCTAAAAG TGTACTTGGT AGTAACCAAG GAAGTAGAGC ACCATATTCA AATAAGAAAT 840 

CAAAACGAGA TTTATGTTTT CTGACAACAC GAGCAAATAC AACTGCGAGA ATTGTTGCTG 900 

TTGTCGCAGC AATAATAGAA TAAATAAAGC TGACCAAGAA TGGAGAGAAT GCCGCACTAT 960 

TACTAAAGAA TAAGCGATAA TTTTCTAAAG TAAAGTTTGA TAATGTTAAG TTACCTGTTT 1020 

GAATTGCAAC TGGATCTGTA AATGAGTATA ATACTATAAA AATTAGTGGA AGCATGAAAA 1080 

CTGTGAACAA TCCATATGCT ACAATGTGAG CAATGATATT CCAAGGCTTA GACGCAATTT 114 0 

TTTGTTTTTT AAGAGGCGCT TTAGTCTTAG AGATAGAAAT ATAATTTCCA CCTTTTTCTA 1200 

TCTTATTCAT GATAGTAAGC AAAATTGTAG TTGCAATACC TAAAATAATT GCAAGTAGGG 12 60 

CAGCTAAATC ACGAGAATTC CCCATCCCTG CAAATGTAAT AATCATTGGA TTTATAGTTT 1320 

GAAATTCTTT ACCACCAACA ATCATGGGTG CTGCTACTGC AGATAAACCA CTAAGAAAAA 13 80 

CCATAATAGT AAGTGCAAAT AGAGTTGGAA TTAAGGTTGG TAACACTACT TTTCGGAAAA 1440 

CAGTAAATGG TTTTGCTCCC ATATTTCGAG CAGCCTCAAT AGTGTGATAG TCAACGCTTC 1500 

GAATTGTATT TGTTAAAAAC AATGTATGAT TAGCAGTTCC TGAAAATGTC ATAATGAATA 1560 

AGACTGCACC ATACCCAATA AACCAGTTAG GGTCTAAAGA AGGGATAACA TTTTGTAAAA 1620 

ATTTTGTAAT CAATCCATAA GGACCATAGA CAAATTTATA TCCAGTCGCT AAAACCACTC 1680 

CTCCATAAAT TAAAGAGGTC ATATAACCTA ATTTTAAAAT TTTAGCACCT TTAATATCAA 1740 

AGTACTCTGT AAATAGAACA CAAAGAATAC CTACGACATT AACTGTAATA ATGAGTGAAA 1800 

ATGCTAACTT AAAACTGTTC ATAATACTCT GAAGTGCCCT CTGAGATTTT AGAACACGAT 1860 

GTACAGCATC AAGGGAAAAT TCTCCTCCTT TTACAAATAC ATTCACTACT AGATCAAAGT 1920 

TTGGATAAAT AATAAATGTT ACTAAGAACC AGATTAACCC TAAACGAATA AGCCAATCTT 1980 

TTAAATTTAA TTTATGACGC ATACTGCACC TCCTTAAAAT TGCAGAACGT CTGATGGTGT 2040 

GATAAATAAT TCCACACTTT CTCCGACAGA TCTAATAGCA GCCTGACTAT CAATACTTGT 2100 

TACATTAAGA ATCTGACTTT CAGAAACTTT TATTGTATAG TGAATTGTAA CTCCAGAAAA 2160 

CTCAACATCA ATAATTGTCC CTTTTAGAAT AAAATCTTGT TCAGTTTCAC GATTGAATCG 2220 

AACrrrCTCT AATCGAATGT ATCCTTTTTT ATCCTCTAAG AAAACGCTTG TATTTTTCAA 2280 
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TAATACTTCG TGGACTGTTT CATCGGTCAA AACATTAATA TCTCCAATAA AATCACATAC 2 34 0 

AAATTCAGTT TGAGAATTAT GATAAATCTC TACTGGTGTA CCGACCTGTT CGATGTATCC 2400 

ATTGTTAAAG ACTGCAATTC TATCAGATAA AGTCAAGGCT TCCTCTTGAT CATGAGTAAC 24 60 

ATATAAAGTA GTAATACCTA ACTCTTTTTG AAGTCTTTTC AACTCTTTTC TCAAATCTAC 2520 

ACGTAATTTT GCGTCAAGGT TTGACAATGG TTCATCTAGA CAAAGAATTT TAGGTTCAAG 2 580 

AACCAGAGCA CGAGCCAATG CTACCCTTTG TTGTTGACCC CCAGATAATT CTGATACATT 2 640 

ACGCTGTAAC TGTTGATCAG AGATCTTAAT TTTTGCTGCC ACTGCTGATA CTTTAGCTTT 2700 

AATAACATCT GGAGCTACCT TCTTAACTTT TAAACCAAAT GCAATATTAT CAAAAACAGT 27 60 

CATAGTTGGA AATAGCGCAT AAGATTGAAA TACAATACCA ATTCCACGCT TTTCAGGTTC 2 820 

CAAATGAGTG ACATCTGTTC CATTAACTTC AATACTTCCT GATGATGGAT CTAGAAAACC 2880 

TACCAATGCT CTCAAAGTAG TTGATTTACC ACATCCTGAA GGCCCAAGAA ATGTAAAAAA 2940 

TTCCCCTTCA TGTATATCTA AATTCAGATT ATCAATTGCA ACAAAATCAC CATATTTAAT 3000 

TTGAATATTA TCAAATTTAA TCATCTCACT AACTCCCTCT ATTACTAAAC CAAAAGCCTC 3060 

TCTTTATTTC TTCCATAAAT TTAGAAATAA TAGAGAGACT TGGACATAAA AATTAACTCT 3120 

TATTTCTTAT TGTACGTATT CTAATTCAGC TTTTTCTACC CATTCATCCA AATGCTTTCC 3180 

AACAGCTTCC CAGTCAATAT TTTGTGGTTT CACTTGATCA ACAAATTTCT TCGTATCTTC 3240 

AGGTAGATCT TTGAGGGCAT CT-TtAtttGC AGGAATAGAT CCAAAGTTCT TACTATATTC 3300 

TACTTGAATT TCTGATTGAC CAAACCAATC AATAAATTCT TTAGCTAACG CTTGTTTTTT 3 360 

ACTAGTGCTT AAAACCATAG TTTGTTCAGT TACAAATGGT ACACCAATCT CAGGAGTCAT 3420 

AACTTTGAAA ACAACATTTT GTTCTTTTTG TCCAACTAAT GCACCAGAAC CCCACATCAT 3480 

TCCATATTGT ATTGGATCTT CTTTGTCTAA CATCTTAACA ATTGAACTTT CTCCCTTTTG 3 540 

AAGAGTGTAT GCATTTTTCA AATATTCTTT TGCTACTTCC CAACCTTTTT CGGAAACACC 3 600 

TAATTCACCT TTATCATCAA GGTATCGAAC TAAGATACTT GCTAGAATTG CCCGTCCTCT 3660 

ACCTCCTTGA AGACCAGAAA TTGAATATTT ACCTTTATAC TTACTACCTA ATTCAGTCCA 3720 

ATCTTTAGGC ATTTCTTTTA CATCAGGCGC CCCAATTAAA ACTAATGGTT GAACAATCAC 3780 

AGGATTATAA TAATTATCTT TATCTGATAA AGATTGATCA ATTTTATCTA ACCATTTAGG 3840 

CTTGTACTGT ACTAGTAATT TTTGATCTCT AATTTTATTT GAATCAACAG CACCAATTCC 3 900 

AAATACCATA TCTGCAACTG CATTATTCTT CTCAGCAATA ACACGGTCTG CTAATTGAGC 3960 

GCCAGCGATA TCAACCATTT TTATATTAAA ACCAGCTTCT TTTGCTTTAG CAGTTAACCA 4020 

ATCACCACGA CCATTTGAGA CTGAGTTCGA ATAGATAACT AATTCTTGAC TTTTATCAGC 4080 
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TTTTTCTTCA GATGAAGAAG CAGTCGTAGA ATTTGAACCT CCAGAGCAAG CAGCAAGTGT 4140 

AGTAAgAGCA ACTCCCGTTG CAAGTACAGT AGACCAAACT TTCATTTTTT TCATGATAAG 4200 

TTCTCCTTTT TTATTATTTT ATTTAAATTT TTCGTGATAT GGAACAAATT GTCTCATATC 4260 

TTCAAATACA GTATAGTCAA TACGGTTTAC AGTAATAGTT GGAATCTTCT CTAATAAAAT 4320 . 

TTCAGTTAAT TCTGCTCTGA CTTTAGTAAA CTCTTCTTCC TCCTCTTCGG TTAGAGGAAT 4 380 

CCGAAGATAC CCAATTGAAA TATGGAATTG ATATCTATCA TGATTAGGGA AACAAACACC 4440 

TGCTTTTTCT GAGACATAAG TACGAATTTC TTCTAATCTC TTTGCAGAAG CTTCATCTGC 4500 

AGGTTCAACT AGTATGTTTT GTTTTCCCAT TTCAGTTATA CGCATATGAA TTTCTTCATC 4560 

CAACAATGGA AAAATTTCAA GTTGTTTAGC AAAGTAATCA TGTATTTCCT GTAAAGGTGT 4 620 

ATCTAGAGGA AGATTACTGC TCCAAAACTC gtTCACGATT TTCATGGCAC AACAATTCAA 4 680 

TTACAGTCAT GTGAATAGAA TTCCTTGGAG TTAAAGTAAA CTTATCGATA AATGGTAATT 474 0 

CTCTATAACG TGATTGAATA ATATCAACAA CTTCCATCAA ATCTTGTTTA GTATAAAGAT 4800 

TTGCTACAAC TGTATTCCCA GGGAAATGAT TAAATTCCCC ATTCTCGG 484 8 



(2) INFORMATION FOR SEQ ID NO: 186: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3763 base pairs 

(B) TYPE: nucleic acid 
{C} STRAi'^DEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186: 

GTTATAAGCA ACACCTTCTT GCTTGCCATA AGTTGTGAAA TGGGTAGAAT CGATATCTAC 60 

AATGAGTTGG TTTAGCTGGT GAAACTGTAA AAAGAATTCG ACCAATTCAA GGTTGAGGCA I'^O 

TCGCAAACTA TGGACTGTTT CCTCGTCAGT TCTGGAAAGA AAACGGGATA AGGTTGGCTG 180 

TGAAGCAAGC TGCCCTCCTT CCAATAATTT TGGAAAGTAG GCATCAGCTG ACAATTCTTT 240 

ACAAGCATAG TCCGTTCCAT AACCTGTTAA CAGTTGAAAG AGGAACTGGA CAAGGATATC 300 

TGAATCCGAA TAACGACAGT AGCGGCGTTG GTCATTCGTT ACTAAATACT TAGAAATCCG 3 60 

CTCTTTTAGT TTCAACTGGG AAAAAAGTTC CTGAAAAAAG ATAAGACCAC CATACTGGGT 420 
TAAATGACCT CCATCGAAAG ATAGTTGGTA AAAAGACTTG TTTTGGAAGT GATGATTTGG ■ 480 

TAAACTGTTC ATGTGAGTTT CCTTTCTTTT TGTGTTTTTT TCTACACTTA TACCATAAAG 540 

GGGAAACTCT TTTTTGTCTA GTAAAAAACA CCCATTGGGT GAAAAAAGAA ACCATCCAGG 600 
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ATCTAAGCTA AGGCAAGGAT TCTGGATGGT TTTTAGATTT GGGGTGAATA ATTGGGGATT 660 

TAGGAGAAAT GATGGTATCT TCCAAATCAA AATCAACTTC ACTCCATAGT CTCAACTGAT 720 

TGATTTTCCC ATCTTGATAG GTCACATCCT TGTCAAGGAT AAACTGAGTC AACACCTCAT 780 

GTTGACCTTG ACACCTGATG TCATCTACCA AGAGCCAGAC ATCCTCTACC AACATGAGGA 840 

TTTTTCTCCT GTGAAGATAA GGCAAATCAG GTTCTGCTGA CCAATAAGCC CCCTCAATAT 900 

AATGCACTCC CTCCCTTTCT TTATGGTGAC AAAACAGGGA GTGAGGATAG TATTCATATT 960 

CCCAGGATCC CGTGATTCTT TCCGGAGCTT TCCCATCTAC AATGCAGGTC GAATGACTCC 1020 

AAGCACTCTT TAAGAGATAA CGTTCATATA TCTCCCGATA AGAATAACGC CCAGCATCTA 1080 

TGAAAATAGG TTGGCCTTGA TACTGTAAGC AAAAACTATT CTCGTCACTA TGACTATGGG 1140 

CACTTCCTAG CGGACCATTT TTGAAAAATA GATAACGATG TTCATCCTTA ATGCAGACAT 1200 

GTCCAGAGTC TTCAAAGATC ATGGACTTAG GCTGCCAAGC TCTCTTTTCA AATTCCTGCA 1260 

GTCGCTTGAC CTTTTCTCGC CCCAGGAACA AGAGGCTAAG CAAATCAACT TTAACATCCA 1320 

GACCGTTAAG AAGGTCTTCC TGGTTCAAAA CCACAGCAGA CAGGCTCAAA ATTTCTGTCG 1380 

TTTCTGTAGA ATCGCTATCA CCAAAAGCCA AAGTCCGTCC ATCTAAGCCT GTCATCATTT 1440 

GAATATAGGT CGCCATCTTT TCCAGCAACT CTTGGTAACT ATCTTGCAAG TCTGGAAGCA 1500 

AGAGACACAA ATCCAGCAAG GCTTTATAAA CCTCTACATG ATAGAGAATC GACTGTTCAA 1560 

ACTGGCTTCC ATCTCCTAAA AtCTGTGTCT CAATTTGCTG TTTCAACTCC TCTGAAGCAA 1620 

AATGGTAAGC TTCTTCTAGA TCCATCTTAT CTGAAAAGAA ATGATAGATA GCAAGCATCG 1680 

GAATTGTTTG TAAAATCCCC CAGTTACTAA GGGTGTACTT GGCGCGATAG TAGCTTTTCA 1740 

TAAAGTCAAT CTGCTTTTCT AGACTGACCA AAATTTTCTC TAGTTCTTTC TCCTCTAGCA 1800 

AGTCAAATTT CAAGAGGAGC AAGAGTAGTT TCAACCAAGT AAAGGAACGA ATACCCGTAT 1860 

CCAAGGTTCT AGTCATCAAG GATTGAGGAG AAAATTCTCT CACCTGCTCA ATCCAATCAA 1920 

ATAGAAAGAA CTTGCACTTT TGAATATAGT CCTTATCTCC TTCTACCAGA TACCCTATCA 1980 

TAAACTGCAA GAGATATTCT TGTCGATTGA GCATATAAGA CCATTCTGGA TCATCTTCAA 2040 

ATACTTGATC CCATACCATC GGCTGGATTT GATGGATTTT TGAACAAGGC TCCATATCCC 2100 

AAGGACTATC AAACATAAAA CGATTGTCCA TCAAGCGTTC AAGGGAACTC TTGACTTTCT 2160 

CATAGTCTTT TGAACAGTGC GACAAGATAT AATCACGACA TTGATTTCCA TCGACTCTTT 2220 

CAAAAAATTG TCTTCTTTCT TCTTTCATTA TCTATTACCA GAAAAAGAAC TACTTAAAAA 2280 

GCAGTTCTTT TGTCTTTCCC ATTACACTTT CCTTTTCTAC ATGGATGACC ACACCTTTTG 2340 

CAATCTGCAA GGAGACCAAG TCATCTTGGA TAGAAATGAT TTTTCCATGA ATTCCAGACA 2400 
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11T17V H/^7l Ttr^l^ri 

A 1 /VAV_AAL AC 


TTCATCACCA 


AATGTTAAAG 


AAGCTAAATA 


CTCTTGTCGT 


TGCTCCATCT 


2460 




CAACTTTTGC 


TGACGAATAG 


AATGAAAGCT 


TGACAGTAAA 


AGGGGACTCA 


2520 


CTGCCAAGAC 


AATCACTATT 


CCATAAAACA 


ATGTTGTATC 


CATTAAGCTA 


TAATCTTAAG 


2580 


CCAGCTTCCG 


ATAATTCCGA 


TGATAACTGT 


TAAAATAACG 


AGTTTATATG 


TTGTCCATTT 


2640 


CTTTTCTTTG 


ATCAAGTAGT 


AAACTAAAAG 


TGTAAATAGG 


GCTGGTAGAA 


GAGCTGGAGC 


2700 


AACCTTATCA 


AGCATTCCCT 


GAATACTTAC 


GATACTTTGT 


TTAGCGTCTG 


CTTTAACTTC 


2760 


CCCTGCAGCA 


AAGGTAATCG 


GCACCATAAT 


CTTAACAGAT 


GTCGCTGCCA 


AACCAGCAAT 


2820 


TACGtTACAC 


CGATAATATT 


GGCAATACGA 


GAAATCGTTG 


CCATCTGTTC 


GCTTAGTTTA 


2880 


TCAATCACAG 


TTGTTCCTAG 


TTTGTATCCA 


TACAGACCAG 


TTGACAATTT 


AATCGCTGTT 


2940 


AAAATCGTAT 


TCATCGCAAG 


GAAGAACAAG 


ATTGGACCGA 


CAACCAAGCC 


TTCTTGAGCA 


3000 


AACGAAGCTG 


CGATGGTTGA 


GAACAATGGA 


GCTAAACAGA 


ATTGAGAAAG 


AGAATCCCCA 


3060 


ATACCTGCCA 


ATGGTCCCAT 


CAAGGCCATC 


TTGATGCTAC 


GTGTTTCTTT 


TGCCGGACGG 


3120 


CCATTTTCCA 


ACATTACAAG 


ATGCAAGCTG 


GTAATAAAAG 


GCAGGAAGTG 


TGGGTTGGTA 


3180 


TTATAGAATT 


CACAGTTTTC 


TTCCAAGGCT 


TGGTAGAAAC 


CTTCCTGATC 


CTCTCCATAG 


3240 


TGTTTTTTCA 


AAGCAGGATA 


CATCACATTG 


GCATATCCCA 


ACCCTTGATA 


GTTACTATAG 


3300 


TTAAATCCAT 


TTTGACAAAA 


GAATGCCCGC 


AAAGACGTTT 


TAAGATAATC 


ACGTTl'TGTT 


3360 


AATTTGTTAG 


ATCCAGTCAT 


CGTGTGCTTC 


CTCCTCTACC 


ACATGATCCG 


CTGTTTTTGG 


3420 


CTTGTTATAA 


AATTCAATCA 


AAGCAAAGAT 


AGTACCTACA 


ATTGCAATAC 


CAATTGTTGG 


3480 


GATGTTTAGA 


TAAGCTGCAC 


AAACATATCC 


CAACAAGACA 


AAGGGAATCA 


ACTCTTTCTT 


3540 


AGCCATCACT 


GACAAGATCA 


TCGCAAAACC 


GATAGCTGGG 


AGCATTTTAC 


CAGCAACTGT 


3600 


CAAACCTGTA 


AGTAATACCG 


GTGGAATGTA 


GTCTACGAGT 


TTCAACAAGG 


TATCCATTGA 


3660 


AAGGGCACCA 


AGCAACCCAA 


GGTAAATCCA 


ATAAAGGCAA 


ACAACCAAAT 


TGTTGCATTT 


3720 


AGAGTGAACT 


TAAATTTCTT 


CAAATTATGG 


TTTTTCAAGT 


GCT 




3763 



(2) INFORMATION FOR SEQ ID NO: 187: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5053 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
{D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 
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CAATCTCTGA GTATGTGCGG TCAATACTAw CAAAGGGAAT yCCTGACGTC AAGTAATGTT 60 

CAATTGGmCT ATAGGTAATG GCAACCACTC CATCAACTTT ATTATGACGC AACATCTCCA 120 

GATAGTCTTG CTCTCTATTT GTACCATTGA TAGAACATAA GAGTAATTTG TTATTTCTCT 180 

TATAGACTTC ATTTTCCACA TGCATAGCAA ATTCTGAAAA GAAGGGATGC CAGATACTTG 240 

GTACAATGAT TGCAATCGTT TCTGTTCGAT TTTTTTTCAT TCCTCTAGCG TAGTAATCTG 300 

GAATGTAATT CAAAGTTTTA ATCGCTTGTT CCACTTTTTT CAAAGTTACT TCTTTAATGC 360 

CTTTTTCTTT ATTAATTACA CGTGAAACAG TTCCAACACT AACTCCTGCT TCTAAAGCAA 420 

CATCTTTCAT GGTAATTGAT TTTCTTTGTT CTACCATATT ATCACCTCCT TTCAATATAT 480 

AGTATCATGC AAATGCTTTT TAAGCAACTA TTTCTCAATC ATTTTTGGCC AGATCATTTA 540 

TCCCATCATG AATAAAATCA CTCCAATTAG CTTTTGAAAA TACTTCAATT TTCATGTGTA 600 

AACATCTACA TAAAACAGGA AAAGCCTTGG TTTCATGGCT TTTTTCGTAT CTTCTATAAA 660 

AAAAGCAAGA GTTTTAGATG GCTATAAATC TAGATGTACA TTTTGCTTAA ATGATTGAAG 720 

GTCTTTTCTT AACAAAAACA CCCCCAAAAT TAGACTTTTT CTGTCTAACT TTTGAGGTAC 780 

AGTTCAAACG CGAAATAGCG TTTTTTTGTT ATTTTTGGTT ACTCATCTAA TCGAATAAAC -840 

ATCATGGCAT TTAACAAGTA TATGAGTGAG ACCGTGTTTA TATTATTTGA ATAGATGAGT 900 

CTCTTATTTT CAATAGGAGG AATAATAAAA TTAGAAATAA TGATATCATA AGGTGAATCT 960 

TCTAAAGATT CCTTTGATAA TTCTAATTCA GTvCAAACTT CCAGTTCAAA ATTATTGCTA 1020 

CAATT^TAAG AAAGTGTCTC TGCAACGAAT TTTGCATGAT ACTGATCAAA ATTACTCATA 1080 

ACTAAAACCT TTAGTTTAGG CTGATTTTGT AGCAAATTAA TCACCAAATG TTTGGTATGA 1140 

GTGATGAAGG TATAAGATAG ATGATTTACC ATCATTGAAC TAGAACAAAC CTCAAGAGTC 1200 

TCTAAATAGT GAGAAAGCTC TTTTTTTATA TCTGAAACAA ATTTTGGAAA AATATTTTGA 1260 

AAGTTCCTGA TTGTATTCCC TTTTTGATCA AATAAAATAA ACTCAGTAAA CAACTCTTGA 1320 

CGATACAGAT GTGCGGTATT ATGCAGATGC CAAATCAGAT TATCCTTATT CTCCATTTCA 1380 

ATCTGATACT TGACTGAAAT CTGATCAATA AAATCACTCA ATAGATGGTA AGATTTTTCA 1.440 

ACATAACTAT CCTTTTTTAC GCATTTCATA AAGAGACTTT CATCTATGAA AAACATTTTT 1500 

TGAAAGTAAG ACACAAATAA TTGGCAAACA ACTTCTTCAT CTAAAGAGAT ATTGTATTCT 1560 

GATTCAAAAC TCTGAGCAAC ACCTTCTATT CCTTCTGCCT GCATTAAAAA ATCCAAACTT 1620 

TGGTCGTTAA AAGAATCTTT ATCTACTTCC ATAAAATGAC CAAACTTTAT TCTATATAGG 1680 

TTCGTAACTA GGAGCAACTT TAGCATTCTA TGCGTTGACA AATTCATTGG AAAGCTTGTT 1740 

TCCTTATAAA CCAATTCTAA CAATTGAGAT AGTGGCTCTG ATGAAAAATT TTCAAATGGC 1800 
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CATTCTAGGA AATAATATTT TTCTGAAAAA TATTGTGCAA AAAAGTAACG AATGTCTCTC 1860 

TCATTTCCAA TGATTTGAAC AGGGGTCAGA CTAACTTCAA ATTGAAATTG CCTTTTAATC 1920 

ACTTTATTGA TTTGGCTAAT AATACGATAG AGCGAAGATG AACTGATATA AAATTCTTTA 1980 

CAAATACTCT CAGCTTGACA ACCTTCATTA AAGAAGATGA ATTCTAAAAT CGAAAAATGA 2040 

GTTGAATGTT TAAAGAAATG ATGGTAAACC ATTTCAATAT CACTATCATC GGTATTAATA 2100 

ATGCGTATAC CATTAGTAGA AGAATGAAAA ATCAAGTCAG GAAAAGCAGA TTTAACATGG 2160 

GATAGATCAT CTTTGACTGC ACGTTCTGTA CAATTTAATA ACTCTGCTAG TTCAGAACGA 2220 

TGAAACCAAC GTTTATGTTC AAATAATAAT TCTAATAATT CTAATTGCCT ATGACTTTTT 2280 

TTAGATAATA AATCTCTCAT GAATATCTTT CTCTCTTTAT AAATTATCGG ATTAAACCTC 2340 

TTGCAATTAT ACCACAAAGA ATAGGTATAG CATGATATAA CGACTTTTCC TAAAATCTTT 2400 

TATTTCGTAT AATAACACTA CGGAGACAAT ATATAAACAA TTTTCTTATT TTACCGTCTA 2460 

TTGAGGGCGT GAATACAGAA TCAAATTCAA GTCTAAAGAT TATATTTTTA ATTTTAAAAA 2520 

TTATATAATA GCAACAATTA AAGAATTTGA TTTTTTAAAA TTATATAATA ATAACAATCG 2 580 

AAATAATTGA CTTTTCTATA TTAAAGTTAT ATAATAGTAA TAATCAAAGA AATTGATTTT 2 640 

TTGATATTAA AATAAAAAAG GAGGGTAGGC AGTGTTGTGA TCAATTATTG CTGGAGGTCT 2700 

TATTGGTCTC TTGGCAGGTA AAATCACTAA AAAAGTAGTT CTATGGGAAT CATCGCAAAT 2760 

GTATTCGCTG GTTTAGTCGG GGCATATGCA GGACAATCTC TTTTAGGTAG TTGGGGTCCA 2820 

GCAATCGCTG GAATGGCTTT GCTCCCATCT ATTGTAGGTG CAGCGATTGT GATTACTGTA 2880 

GTGTCATTCT TTACAGGTAG AAAGTAAACT TTTCGCCAGT AAAGTTAGCA AACTATTTTT 2940 

AAATCAATGA CGGGAAAAAT AGTTTAAATG TTAAATCGAA AGGATTGTAT ATGTCAAAAG 3000 

CAAAGAAAAT ATGTTTCATT ATTTTCTGTA TTTTAATCTT GACAATTTTC CTTCCTGTTT 3060 

TGATAGATTA TCATCAAGTT AGTGATCTAG GTATTCATCT ACTTAGCTGG AGACAGAACT 3120 

CCGTAGTTGA ATTCTATCTT GCTAGATATG TCTTTTGGGG GACAGTGGTT CTATCAACTT 3180 

TAGTTTTATT ATCCATTTTA GTTGTGATGT TTTATCCTAA ACGTTACTTG GAAATCCAAC 324 0 

TTGAAACTAA AAACGATACA TTAAAATTAA AGAATTCGGC AATCGAAGGT TTTGTTAGAA 3300 

GTTTGGTGAG TGATCATAGA TTGATCAAGA ACCCAACTGT TCATGTAAAT TTACGAAAAA 3360 

ATAAATGTTT CGTTCATGTA GAAGGTAAAA TTCTTCCTTC AGACAACATC GCTGACAGAT 3420 

GCCAAATAAT TCAAAATGAA ATAACTAATG GATTGAAGCA GTTTTTTGGT ATTGAGCGTC 3480 

AAGTAAAACT TGAAGTTGCA GT/VAAAAATT ACCAACCAAA ACCTCAAAAC AAAAAGACTG 3540 



wo 98/18931 



PCT/US97/19588 



1140 



TTAGTCGTGT 


GAAGTAAGGA AGTAAAAAAT 


GGAATGGCTT 


AAACAATATC 


GATATCCAAT 


3600 


TATCGCTGGT 


CTCATAGGCG 


TATTTCTGGC 


TTGTTTGATT 


GTCTCCTTTG 


GCTTCTTCAA 


3660 


AACAATATTT 


GTATTGATTT 


TAGGAGCACT 


GGGAGTTGCA 


GCTGGATTAT 


ATATCGAAAA 


3720 


AAACTATATA 


GATAAATAAA 


AAAATAAAAA 


TTACTAATTT 


AATTAAAGGA 


GTTTCATATG 


3780 


TCAAACGAAA 


AAAACACAAA 


CACTAACGTA 


GAAAAGAAAG 


ATGCTACTGT 


TGTAGCTCAC 


3840 


GAAATCAAAG 


GGGAACTTAC 


TTACGAAGAT 


AAAGTTATCC 


AAAAAATCAT 


TGGTCTTTCA 


3900 


CTAGAAAACG 


TTTCAGGTCT 


TTTGGGAATC 


GATGGTGGTT 


TCTTCTCAAA 


TCTTAAAGAA 


3960 


AAAATCGTTA 


ACAGCGATGA 


CGTAACAAGT 


GGTGTTAACG 


TAGAAGTTGG 


TAAAACACAA 


4020 


GTTGCAGTTG 


ACTTAAACGT 


TATTGTTGAG 


TACCAAAAAA 


ATGTTCCAGC 


TTTATATTCA 


4080 


GAAATCAGAG 


AAATCGTATC 


TTCAGAAGTT 


GCTAAAATGA 


CTGACTTGGA 


AATTGTTGAA 


4140 


ATCAACGTAA 


ACGTTGTCGA 


CATCAAAACT 


AAAGAACAGC 


ATGAAGCAGA 


CTCAGTAAGC 


4200 


CTTCAAGATC 


GCGTATCTGA 


CGTTGCTGAA 


TCAACAGGAG 


AATTCACTTC 


AGAACAATTC 


4260 


GAAAAAGCTA 


AATCTGGTCT 


TGGATCTGGT 


TTCTCAACTG 


TTCAAGAAAA 


AGTTAGCGAA 


4320 


GGTGTAGAAG 


CTGTTAAAGG 


TGCAGCAAAT 


GGTGTAGTAT 


CTCACGAAAA 


CACTCGTGTA 


4380 


AACTAAGATA 


AAATAAATAT 


AACAGGAGAA 


ATTATCATGT 


CAGTAGAAGA 


AAAATTAAAT 


4440 


CAAGCTAAAG 


GTTCTATTAA 


AGAAGGTGTT 


GGGAAAGCCA 


TCGGTGATGA 


AAAAATGGAA 


4500 


AAAGAAGGTG 


CAGCTGAAAA 


AGTTGTT-TCt 


aaAGtaaaag 


AAGTTGCCGA 


AGACGCTAAA 


4560 


GACGCTGTAG 


AAGGTGCTGT 


AGAAGGTGTT 


AAAAACATGT 


TGAGTGGCGA 


CGATAAATAA 


4620 


GGTTAAAAGT 


TACTTTATCT 


TTTTAGTAAT 


ATTAGTCAAA 


AGAGTCTGAG 


TCT^GATGAT 


4680 


TCTCAGAAAA 


CAAAAAGCTA 


GAGATTCCCA 


ATTGCGGAAC 


TCTAGCTTTT 


TAATTTTGCC 


4740 


TCTTTCTCTT 


ATTATATTTC 


AGCAGGTTGT 


TGGCCATGAG 


TACGAATCCC 


ATGTCAATTC 


4800 


TCACTTGACG 


CTTACCTCTC 


AGATGACATC 


TCTTATAACC 


CAAACAAACC 


TTTATCTGCC 


4860 


CAAAGACAGA 


TTTCATATCA 


ATCTTACGTT 


TAGCGAAAAT 


TTGTCTACCC 


TTGGAAGATA 


4920 


AAAGTGCCTG 


ATATTCTTTA 


GTTTTTAAAC 


ACTGGTAACG 


TTCATTCATA 


TACAGTCTCT 


4980 


TTTGAGGGGC 


TGATTCAGGT 


TCATAATCGC 


AGTCAACATT 


GATTTCAAGG 


CTGTTTGCTT 


5040 


TCTATCTCCC 


CGG 










5053 


(2) INFORMATION FOR SEQ ID NO: 188: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6492 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 



AATTCTCTTT 


TTTCCAACAA 


AATGTATGAC 


CTGCACTTGA 


ATACTTCTCA 


TTGTTTGTAC 


60 


ATTCATCTAC 


TTTCATATAA 


TCTTTTACAA 


AATCATAATA 


TGACATAACA 


CACTATCCCT 


120 


TTTAGACAAT 


ATTCCAATTA 


GCCTTATTAA 


TTCAAAACTA 


TTGTATTAGT 


AATTATAACA 


180 


GATGTATAAT 


AGAAAAGCAA 


TGATAGATAT 


TATCAATTAA 


GCGAATTTAT 


ATCTAAAAGG 


240 


GATATTAAAG 


AAAGGAGATA 


TGCTTATGAA 


GATTTACAAA 


AAACTATTTG 


CTTATGTCCA 


300 


AGATAAGAAA 


TATCTTGGGG 


TTTTGGCCAT 


AATTTTTTCT 


GCTATATCTG 


CTGCACTTAC 


360 


AGTATATGGA 


TATTATTTAA 


■ TCTACAAATT 


TCTAGATAAG 


TTAATAATTA 


ATTCAAACTT 


420 


ATCCGGTGCA 


GAGAGTATAG 


CATTAAAATC 


TGTTATTACA 


CTAACAAGTG 


GAGCGATATT 


480 


TTATTTTGTC 


TCAGGAATGT 


TTTCACATAT 


CTTGGGATTC 


AGGCTTGAAA 


CAAATTTAAG 


540 


AAAAAGGGaA 


TCGATGGTCT 


GGAAAAAGCA 


AGTTTTAGGT 


TCTTTGACTT 


AAATCCATCT 


600 


GGTCAAATAA 


GAAAGATTAT 


AGATGPCAAT 


GCTGCACAAA 


CTCATCAGGT 


GGTAGCACAC 


660 


ATGATTCCCG 


ATAGTTCTCA 


GGCAATAATC 


ACACCCGTAC 


TTGTACTTGC 


ACTTGGCTTT 


720 


ATAGTAAGTA 


TAAGAGTTGG 


CATAATTTTG 


CTTGCTCTTA 


CTATAATTGG 


TGGCTTAATT 


780 


TTAGGGGCAA 


TGA-rOGGCGA 


GCAAGAATTT 


ATGAAGATAT 


ACCAAGAATC 


CCTATCTAAA 


840 


CTAAGTGCTG 


AAACTGTTGA 


GTACGTGAGA 


GGAATGCAAG 


TTGTAAAAAT 


ATTTAAAGCA 


900 


AATGTAGAGT 


CTTTTAAAAG 


CTTTTATAAG 


GCGATAAAAG 


ATTACTCAAA 


GTATGCTTAT 


960 


GATTATTCCC 


TATCTTGTAA 


AAGGCCTTAT 


GTTTTGTATC 


AATGGTTATT 


TTTTGGACTG 


1020 


ATTGCAATTT 


TAATTATTCC 


TATAGTTTAT 


TTTATGACTA 


GCTTAGCTAG 


CGCAAAGGTG 


1080 


ATTTTACT'aO 


AGCTTATCAT 


A'TTTTATTT 


TTATCAGGAG 


TTCTCTTTGT 


TTCATTCATG 


1140 


AGAATGATGT 


GtACTCCATG 


TATATTTCTC 


AAGGAAATTA 


TGCAGTAGAT 


ACTTTAGAGG 


1200 


CGCTTTACGA 


AGATATGCAA 


AAAGACAAAT 


TAGTGCATGG 


TAATGTCAAT 


AATTITAAAA 


1260 


ACTATAATAT 


AGAATTTGAG 


AATGTTAGCT 


TTGCTTATAA 


TGATAAAGCT 


GTCATTGAAA 


1320 


ATTTATCCTT 


TAATTTAGAA 


GAAGGAAAGT 


CCTACGCACT 


TGTCGGTTCA 


TCTGGATCAG 


1380 


GCAAATCAAC 


AGTAGCAAAA 


CTTATATCAG 


GTTTTTACAA 


TGTTAATAAA 


GGAAGCATAA 


1440 


AGATAGGCGG 


GATAGCAATA 


AGTGAATATT 


CTGACGAAGC 


CTTAATTAAA 


GCCATTTCCT 


1500 


TTGTTTTTCA 


AGATTCAAAA 


TTATTCAAGA 


AGAGCATTTA 


TGATAATGTA 


GCGTTAGCTA 


1560 


ATAAAGATGC 


GACGAAAGAT 


GACGTTATGA 


GAGCCTTAAA 


ATTAGCAGGA 


TGCGATTTAA 


1620 
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TATTAGACAA 


ATTCCCAGAA 


AGAGAAAATA 


CAATCATAGG 


CTCAAAAGGT 


GTTTATTTAT 


1680 


CCGGTGGAGA 


AAAACAAAGA 


ATTGCAATTG 


CTAGAGCAAT 


TTTAAAGGAT 


TCCAAAATTA 


1740 


TTATTATGGA 


TGAAGCATCA 


GCATCTATTG 


ACCCAGATAA 


CGAGTTTGAA 


TTGCAAAAAG 


1800 


CTTTTAAAAA 


TCTTATGAAG 


GATAAAACAG 


TTATCATGAT 


TGCACACAGG 


CTATCTACAA 


1860 


TTAAAGACCT 


TGATGAAATT 


ATTGTCATGG 


ATAGTGGAAA 


AATTATAGAA 


AGAGGGTCTG 


1920 


ACAAAGAATT 


AATGTCAAAA 


GATACAAGGT 


ATAAGAGCCT 


GCAAGAGATG 


TTTAACAGTG 


1980 


CGAATGAATG 


GAGGGTTTCA 


AATGAAAGAG 


TTTTATAAAA 


AAAGATTTGC 


TCTTACAGAT 


2040 


GGAGGAGCAA 


GAAATTTAAG 


TAAAGCAACA 


CTGGCTTCAT 


TTTTCGTTTA 


TTGTATAAAC 


2100 


ATGCTTCCTG 


CCATATTACT 


TATGATTTTT 


GCTCAGGAAG 


TTTTGGAAAA 


TATGGGCAAA 


2160 


AGCAATGGCT 


TTTATATAGT 


ATTCTCAGTT 


TTGATTTTGA 


TAGCAATGTA 


TATTTTGCTT 


2220 


TCTATCGAAT 


ACGATAAATT 


ATATAACACA 


ACCTATCAAG 


AAAGTGCAGA 


TTTAAGAATA 


2280 


AGGACAGCGG 


AGAATTTATC 


AAAATTACCT 


CTATCTTACT 


TTTCTAAACA 


TGACATTTCC 


2340 


GACATTTCAC 


AAACAATCAT 


GGCTGATATT 


GAAGGCATAG 


AGCATGCAAT 


GAGCCACTCA 


2400 


ATACCAAAGG 


TGGGCGGCAT 


GGTACTGTTT 


TTCCCATTAA 


TATCTGTAAT 


GATGCTAGCG 


2460 


GGCAATGTCA 


AGATGGGTTT 


AGCTGTAATT 


ATTCCATCTA 


TTTTAAGCTT 


TATATTTATA 


2520 


CCTTTATCTA 


AAAAATATCA 


GGTTAATGGA 


CAGAATAGAT 


ATTATGATGT 


CTTAAGAAAA 


2580 


AACTCAGAAa 


GC'ITTCAAGA 


AAATATCGAA 


ATGCAAATGG 


AGATTAAAGC 


ATATAATTTA 


2640 


TCGAAGGATA 


TTAAAGATGA 


CTTATATAAA 


AAAATGGAAG 


ATAGTGAGAA 


AGTACACTTA 


2700 


AAGGCGGAAG 


TAACTACAAT 


TTTAACTTTG 


TCTATATCTT 


CAATATTTAG 


CTTTATATCT 


2760 


CTTGCTGTTG 


TGATATTTGT 


CGGCGTAAAT 


CTAATTATTA 


ATAAAGAGAT 


AAATTCTCTC 


2820 


TACCTTATAG 


GATATTTACT 


AGCTGCTATG 


AAGATAACAG 


ACTCTTTAGA 


TGCATCTAAA 


2880 


GAGGGCTTGA 


TGGAAATATT 


TTATTTATCG 


CCCAAAATAG 


AAAGATTAAA 


AGAAATTCAA 


2940 


AATCAAGATT 


TACAAGAAGG 


CGATGACTAT 


AGCTTAAAAA 


AATTTGATAT 


TGATCTAAAA 


3000 


GATGTTGAGT 


TTGCCTACAA 


TAAAGACGCA 


AAAGTTTTAA 


ATGGTGTAAG 


TTTTAAAGCT 


3060 


AAGCAGGGAG 


AGGTCACTGC 


TTTGGTAGGT 


GCAAGTGGCT 


GCGGTAAAAC 


AACTATCTTG 


3120 


AAACTTATAT 


CAAGACTTTA 


TGATTATGAC 


AAGGGACAAA 


TCTTAATCGA 


TGGCAAAGAT 


3180 


ATAAAGGAAA 


TATCAACAGA 


ATCCCTTTTT 


GATAAGGTGT 


CTATTGTTTT 


CCAAGATGTG 


3240 


GTTCTCTTTA 


ATCAAAGCGT 


TATGGAAAAT 


ATTAGAATCG 


GTAAGCAAGA 


TGCAAGTGAC 


3300 


GAAGAGGTTA 


AAAGAGCAGC 


AAAACTTGCA 


AATTGCACAG 


ATTTTATAGA 


AAAAATGGAT 


3360 


AAAGGTTTCG 


ATACAGTTAT 


TGGTGAAAAC 


GGAGCTGAGC 


TATCAGGAGG 


AGAAAGACAA 


3420 
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AGATTATCAA 


TAGCCAGAGC 


CTTCTTAAAA 


GATGCGCCGA 


TATTGATCTT 


AGATGAGATA 


3480 


ACAGCAAGCC 


TTGATGTTAA 


CAACGAGAAA 


AAGATTCAAG 


AGTCTTTAAA 


TAATTTAGTT 


3540 


AAAGATAAAA 


CTGTTGTAAT 


CATTTCACAT 


AGAATGAAAT 


CCATAGAAAA 


TGCAGACAAG 


3600 


ATAGTAGTTC 


TTCAAAACGG 


AAGAGTAGAA 


AGCGAAGGTA 


AGCATGAAGA 


GCTTTTACAA 


3660 


AAATCAAAAA 


TTTACAAAAA 


TTTAATAGAA 


AAGACAAAAA 


TGGCAGAAGA 


ATTTATTTAT 


3720 


TAGGAGGACT 


ACAATGGATA 


ATAAAAAATT 


AAAAGTAAAA 


GATTTAGTAA 


GCATCGGTGT 


3780 


TTTTGGCGTA 


ATTTATTTTG 


CCTTCATGTT 


TGGAGTTGGT 


ATGATGGGCT 


TGATTCCAAT 


3840 


ATTGTTCTTA 


ATATACCCGA 


CAGTATTAGC 


CATAGTTGCA 


GGAACTGTTG 


TTATGTTATT 


3900 


TATGGCTAAG 


GTTCAAAAGC 


CATGGGCACT 


ATTTATATTT 


GGTATGATAT 


CACCACTTGT 


3960 


GATGTTTGCA 


GCTGGTCATA 


CCTACGTAGT 


TGTGGTTTTA 


TCACTTATAG 


TAATGATAAT 


4020 


AGCAGAATTA 


ATTAGAAAGA 


TTGGTAATTA 


TAATTCATTT 


AAATACAATA 


TGCTTTCTTA 


4080 


TGCAATCTTC 


AGCACATgGA TATGTAGCTC 


TTTAATGCAA 


ATGCTTTTAG 


CAAT^GAAAA 


4140 


ATATATGGAG 


TGGTCTTTGA 


TGACTATGGG 


AAAAGATTAT 


GTTGATGTAT 


TAGAAAAGTT 


4200 


AATAACTTAT 


CCTCACATGG 


CTTTAGTAGC 


CTTAGGTGCT 


TTCTTAGGAG 


GAATTCTTGG 


4260 


AGCATATATA 


GGCAAGGCTC 


TATTGAAAAA 


ACACTTTTCA 


AATGGATTAT 


ATTGTGTGGG 


4320 


ATACTTTACT 


CCTTGCCTAA 


TTTTATGGTG 


CTATCTGAAT 


TAAACCCTAT 


AGTTAAGATG 




TTTTTGAGTA 


TACCTATTGT 


TATTAGAATG 


TTTATTTTAC 


CATTTATGGC 


AGCAAGCTTT 


4440 


ATGATAAAGA 


CCTCGGATGT 


AGGCGCAATA 


ATTTCATCGA 


TGGATAAGCT 


TAAGATTTCA 


4500 


AAGAATGTAT 


CCATACCTAT 


TGCGGTTATG 


TTTAGATTCT 


TCCCATCTTT 


TAAGGAGGAG 


4560 


AAGAAAAACA 


TCAAAATGGC 


TATGAGAGTA 


AGAGGGATAA 


ATTTTAAAAA 


CCCAGTCAAA 


4620 


TATCTTGAAT 


ATGTTTCTGT 


GCCACTACTC 


ATTATATCAT 


CTAATATATC 


AGATGACATT 


4680 


GCAAAAGCGG 


CAGAAACAAA 


GGCAATAGAA 


AATCCAATTG 


CCAAGACCAG 


ATACATTCGC 


4740 


GTAAAGATAC 


AGCTAATTGA 


TTTTGTTTAT 


GTTTTAGCGG 


TTGCTGGACT 


TATTGTGGGA 


4800 


GGCTTAATAT 


GGTTGAAATA 


AAAAATTTAA 


GTCTTGATTA 


TGGTGAAGAG 


CATATATTAG 


4860 


ATGATATATC 


ACTATCCATA 


GCCGAGGGAG 


AGTGCGTGCT 


ATTTACAGGA 


AAAAGTGGAA 


4920 


ATGGTAAGTC 


ATCTTTAATA 


AATTCAATCA 


ATGGACTAGC 


TGTAAGGTAT 


GATAACGCAA 


4980 


AGACAAAGGG 


CGAAATAATT 


ATTGATGGTA 


AGAATATAAA 


AAATTTGGAA 


CTTTATCAAA 


5040 


TCTCAATGCT 


TGTTTCAACT 


GTTTTTCAAA 


ATCCTAAGAC 


ATATTTTTTT 


AATGTCAATA 


5100 


CGACATTAGA 


ATTATTATTT 


TATTTGGAAA 


ATATCGGTCT 


TGCAAGAGAA 


GAGATGGACA 


5160 
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GGCGTTTGAA GGATATACTT GAGATATTCC CGATAAAAAA TCTTTTGAAC AGAAATATAT 5220 

TTAATCTATC CGGCGGTGAA AAACAAATTC TTTGCATTGC AGCTTCTTAT ATAGCAGGTA 5280 

CAAAGATTAT AGTTATGGAT GAGCCTTCAT CGAATTTAGA TATTAAAAGC ATAAGTGTTT 534 0 

TGGCAAAGAT GCTAAAGATA TTAAAAGAGA AAGGCATAAG CATAATTGTT GCAGAGCATA 5400 

GAATTTATTA TTTGATGGAC ATAGTTGACC GTGTATTTTT AATAGATAAA GGAAAGCTTA 54 60 

AAAAAACTTA TACTAGAAGT GAATTTTTAA AGCTAGATAA AAATGAATTA AATGCTTTAA 5520 

GTTTAAGAGA TAAAGAATTA AGTAAATTAA AAGTTCCTTA TTTAAAAGAA GGTGGAGAGT 5580 

ATCAGATAAA AAATCTTAGT TACAAATTTA CTGATGATGA GTGTTTAAGC TTAAAAGATA 5640 

TTTCGTTCAA GCTTGGGAAA ATTTATGGCA TAATAGGATC CAACGGACGA GGAAAATCAA 5700 

CGCTTTTAAG ATGTTTAATA GGTCTTGAGA AAAAATCAAA AGAAGAAATT TATTTTAAGG 5760 

GAGAGAAGCT ATCTAAAAAA GAAAGACTCA AAAACTCTTC ACTTGTTATG CAAGATGTAA 5820 

ATCATCAATT ATTCACAGAT GAAGTATTCA ACGAGCTTAG ATTAGGAGTA AAGAATTTTG 5880 

ATGAAGAAAA GGCGAAAATC ATTTTAAACC CCAATTATTC ACCCCAAATC TAAAAACCAT 5940 

CCAGAATCCT TGCCTTAGCT TAGATCCTGG ATGGTTTCTT TTTTCACCCA ATGGGTGTTT 6000 

TTTACTAGAC AAAAAAGAGT TTCCCCTTTA TGGTATAAGT GTAGAAAAAA ACACAAAAAG 6060 

AAAGGAAACT CACATGAACA GTTTACCAAA TCATCACTTC CAAAACAAGT CTTTTTACCA 6120 

ACTATCTTTC GATGGAGGTC Al'l'TAACCCA GTATGGTGGT CTTATCTTTT TTCAGGAACT 6180 

TTTTTCCCAG TTGAAACTAA AAGAGCGGAT TTCTAAGTAT TTAGTAACGA ATGACCAACG 6240 

CCGCTACTGT CGTTATTCGG ATTCAGATAT CCTTGTCCAG TTCCTCTTTC AACTGTTAAC 6300 

AGGTTATGGA ACGGACTATG CTTGTAAAGA ATTGTCAGCT GATGCCTACT TTCCAAAATT 6360 

GTTGGAAGGA GGGCAGCTTG TTCACAGCCA ACCTTATCCC GTTTTCTTTC CAGAACTGAC 6420 

GAGGAAACAG TCCATAGTTT GCGATGCCTC AACCTTGAAT TGGTCGAATT CTTTTTACAT 6480 

GTTCACCAGC TG 64 92 
(2) INFORMATION FOR SEQ ID NO: 189: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7174 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear ^ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189: 

AACTGAAGGT AAAGGCTTCG ACGCAGAACG TGACGCTGCC CAAGCTGCCC TTGATGACCT 60 
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TAAGAAAGCT CAAGAAGACA ACAACTTGGA CGACATGAAA ACAAAACTTG AAGCATTGAA 12 0 

CGAAAAAGCT CAAGGACTTG CTGTTAAACT CTACGAACAA GCCGCAGCAG CGCAACAAGC 180 

TCAAGAAGGA GCAGAAGGCG CACAAGCAAC AGGGAACGCA GGCGATGACG TCGTAGACGG 240 

AGAGTTTACG GAAAAGTAAG ATGAGTGTAT TGGATGAAGA GTATCTAAAA AATACACGAA 300 

AAGTTTATAA TGATTTTTGT AATCAAGCTG ATAACTATAG AACATCAAAA GATTTTATTG 360 

ATAATATTCC AATAGAATAT TTAGCTAGAT ATAGAGAATT ATATTAGCTG AACATGATAG 420 

TTGTATCAAA AATGATGAAG CGGTAAGGAA TTTTGTTACC TCAGTATTGT TGTCTGCATT 480 

TGTATCGGCG ATGGTACCAG CTATGATATC ATTAGAAATA CAAACATATA AATTTGTAAT 540 

ACCGTTCATA ATTGGTATGA TTTGGACAGT AGTTGTATTT CTTATGATCA ATTGGAATTA 600 

TATAGGCAAA TACTAAGAAG AGACAAAAAT ATATAAATAT TTCTGTACTT ATAGGATATT 660 

TAAAATCAAA ATAAAGTTAA TTTACTTATT TGCAGAGGTT GCAACCCAGC CTCTGTTTTT 720 

CGATAAAAAG GGACGGAATC TCATTTGTTT GGGTTTTGTC TCATCAATAG AAAGGAACAA 780 

AGAGTGTTCG TAACTGAACA CGGGTTTCAG AATTTCTTAC TAAATATAAA AGAAAGGAAT 840 

TGAACCCGAC CTAAATGGTG GTTCGATTCA GAACATCAAT AGAAAGGAAT AAGGGTGTTC 900 

GTAACTGAAC ACGGGCTATG GACTGTGCCA AAAAGATAGT TTTTTCTAGG ACGTAAGCGT 9 50 

CCGTCGTCAA AACTCCTAGA TGGCTGTGTC CGTTTGACGC CCTTTGTATC TTGAATTATG 3 020 

AACAATACTG AATTTTATGA TCGTCTGGGG GTATCCAAAA ACGCTTCGGC AGACGAAATC 1080 

AAAAAGGCTT ATCGTAAGCT TTCCAAAAAA TATCACCCAG ATATCAACAA GGAGCCTGGT 1140 

GCTGAGGACA AGTACAAGGA AGTTCAAGAA GCCTATGAGA CTTTGAGTGA CGACCAAAAA 1200 

CGTGCTGCCT ATGACCAGTA TGGTGCTGCA GGCGCCAATG GTGGTTTTGG TGGAGCTGGT 12 60 

GGTTTCGGCG GTTTCAATGG GGCAGGTGGC TTCGGTGGTT TTGAGGATAT TTTCTCAAGT 1320 

TTCTTCGGCG GAGGCGGTTC TTCGCGCAAT CCAAACGCTC CTCGCCAAGG AGATGATCTC 1380 

CAGTATCGTG TCAATTTGAC CTTTGAAGAA GCTATCTTCG GAACTGAGAA GGAAGTTAAG 1440 

TATCATCGTG AAGCTGGCTG TCGTACATGT AATGGATCTG GTGCTAAGCC AGGGACAAGT 1500 

CCAGTCACTT GTGGACGCTG TCATGGCGCT GGTGTCATTA ACGTCGATAC GCAGACTCCT 1560 

CTTGGTATGA TGCGTCGCCA AGTAACCTGT GATGTCTGTC ACGGTCGAGG AAAAGAAATC 1620 

AAATATCCAT GTACAACCTG TCATGGAACA GGTCATGAGA AACAAGCTCA TAGCGTACAT 1680 

GTGAAAATCC CTGCTGGTGT GGAAACAGGT CAACAAATTC GCCTCGCTGG TCAAGGTGAA 1740 

GCAGGCTTTA ACGGTGGACC TTATGGTGAC TTGTATGTAG TAGTTTCTGT GGAAGCTAGC 1800 
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GACAAGTTTG 


AACGTGAAGG 


AACGACTATC 


TTCTACAATC 


TCAACCTCAA 


CTTTGTCCAA 


1860 


GCGGCTCTTG 


GTGATACAGT 


AGATATTCCA 


ACTGTTCACG 


GTGATGTTGA 


ATTGGTTATT 


1920 


CCAGAGGGAA 


CTCAGACTGG 


TAAGAAGTTC 


CGCCTACGTA 


GTAAGGGGGC 


ACCGAGCCTT 


1980 


CGTGGCGGTG 


CAGTTGGTGA 


CCAATACGTT 


ACTGTTAATG 


TCGTAACACC 


GACAGGCTTG 


2040 


AACGACCGCC 


AAAAAGTAGC 


CTTGAAAGAA 


TTCGCGGCTG 


CTGGTGACTT 


GAAAGTAAAT 


2100 


CCAAAGAAAA 


AAGGCTTCTT 


TGACCATATT 


AAAGATGCCT 


TTGATGGAGA 


ATAATACTCT 


2160 


TCGAAAATCT 


CTTCAAACCA 


CGTCAGCGTT 


GCCTTGCCGT 


ATATATGTGA 


CTGACTTCGT 


2220 


CAGTCGTATC 


TACAACCTCA 


AAACAGTGTT 


TTGAGCAGCC 


CGTGGCTAGT 


TTCCTAGTTT 


2280 


GCTTTTTACT 


TTATAGATTT 


TTTAAGACTT 


TCCTAAGTAA 


TGACGGACGG 


TAGTGACCTC 


2340 


CTTCGAAGTT 


CCATACCTAA 


ACTTTGAACC 


TAAGTTTTAA 


AGTTTCCGGA 


CAGCTGAAAC 


2400 


CAAGCTGTTT 


CAGGTGTTTT 


CATTACGGCA 


GAAAGTCTTC 


GATTTAGTTG 


TGAAATGGTG 


2460 


AATGATACTC 


TTCAAAAATT 


TCTTCAAACC 


ACGTCAGCGT 


CGGCTTGTCA 


TGGGTATGGT 


2520 


TACTGACTTC 


GTCAGTTCTA 


TCCACAACCT 


CAAAACAGTG 


TTTGAGCTGA 


CTTCGTCAGT 


2580 


TCTATCCACA 


ACCTTAAAAC 


GGTGTTTTGA 


GCAGTCTGTG 


CCTAGCTTTC 


TAGTTTGCTT 


2640 


TTTGATTTTT 


ATTGAGTATG 


AATTACCTAA 


ATTATGATGC 


ATAGTTGATG 


GGATATATAT 


2700 


AATAGATTGA 


AATAGAATAT GAACAAATTG ATAAGAGGAT 


TTTAAAGTAA 


TCTCTAACAA 


2760 


TGCT'i"l"At;AA 


ACTATGGTGT 


GCTATTCTAA 


ATTCAATTCA 


CTATAACTTG 


TTTACGTTTT 


2820 


AAAAAAGAGC 


CGTCGGGCTC 


TTTTTACTTA 


TCTTCAGTTC 


CCTGCATTTC 


TTTTATCACA 


2880 


GCTAGTCTAG 


TCTGGATATC 


CTTTTCCAAG 


ACCTTAAACT 


TGTAAGTCAA 


GTCTTCTTGG 


2940 


TATTCCTTGA 


TAAGTTCTTT 


TTGCTGGTTA 


ATGATTTGCA 


GGCTGTTTTG 


GATAATATCC 


3000 


ACATCGTCCT 


TGATAGCTTG 


AACGCX5GTCA 


GTGGTATTCA 


AGACTTCATC 


TGTGATGGTT 


3060 


TGGCGATTTT 


TTGTAACCAG 


ATAACTTCCG 


GCTGCAGCTC 


CTGCAAATAG 


CAGTAGGTTG 


3120 


GATAATTTCA 


TAGCAACTCC 


TTAAGCGTTT 


TTGATGGTTT 


CAGCGACTTG 


AGCAAGTTTG 


3180 


TCAAAGTCTG 


GTTCGTGGGC 


GATAAAATCA 


ATCTTGAGGT 


CATCGTCAGC 


ACTGTAGCGA 


3240 


GGCACAAGGT 


GAACGTGAGT 


ATGAAAAACT 


GTTTGACCAG 


CGACTTCTTC 


ACAGTTGGAA 


3300 


ATGATATTCA 


TACCAGCAGC 


CTTAGTGACT 


TTCATGACTT 


TTTGAGCTAC 


TTTTGGTACT 


3360 


TGGGCAAAGA 


GTTGGcTGGC 


GCTCGTAGCA 


TCCATCTCCA 


AAAGATTGCG 


ATAGTGTTCT 


3420 


TTTGGCACGA 


CCAAGGTGTG 


TCCTAGTGTT 


ACTTGAGAGA 


TATCAAGAAA 


GGCAAGGACC 


3480 


TGCTCATCTT 


CATATACTTT 


TGAAGCAGGA 


ATTTCCCCTG 


CGATGATTTT 


ACAAAAAATG 


3540 


CAATCTGACA 


TAAAATCTAC 


CTCTACTGTA 


CTGAATTTTG 


ATATAATATA 


GCTACATTAT 


3600 
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ACCAGATTTG GAGAAAATAT GTTAGAAATT AAAAACCTGA CAGGTGGCTA TGTTCATGTT 3660 

CCTGTTTTGA AAGATGTGTC CTTTACTGTT GAAAGTGGGC AGTTGGTCGG TTTGATTGGT 3720 

CTCAATGGTG CTG6GAAATC AACGACGATC AATGAGATTA TCGGTCTGTT GGCACCTTAT 3780 

AGTGGCTCCA TCAATATCAA TGGCCTGACT CTGCAAGGAG ATGCGACTAG CTACCGCAAG 3840 

CAGATTGGCT ACATTCCTGA GACGCCTAGT CTGTATGAGG AATTGACCCT CAGAGAGCAT 3900 

ATCGAAACGG TTGCTATGGC TTACGGTATT GAGCAAAAAG TGGCTTTCGA ACGAGTAGAG 3960 

CCCTTGTTAA AAATGTTCCG TTTGGAACAG AAATTAGACT GGTTCCCTGT TCATTTTTCA 4020 

AAAGGGATGA AGCAGAAGGT CATGATTATC TGTGCTTTTG TGGTGGATCC AAGTCTTTTC 4080 

ATCGTGGATG AGCCTTTCCT TGGTCTTGAT CCGCTGGCTA TTTCTGATTT GATTCAGCTT 414 0 

TTGGAAGTGG AGAAGCAAAA GGGCAAGTCT ATTCTCATGA GTACCCACGT GCTGGATTCG 4200 

GCGGAGAAGA TGTGTGATGC CTTTGTCATT CTTCACAAGG GAGAGGTGCG TTCCAAAGGC 4260 

AATCTCCTGC AACTACGTGA AGCCTTTGAT ATGCCTGAGG CTAGTTTGAA TGATATTTAC 4320 

TTGGCTCTGA CCAAAGAGGA GGATCTATGA AAGACTTGTT TTTAAAGAGA AAGCAGGCCT 43 80 

TTCGTAAGGA GTGTCTTGGT TATCTGCGCT ATGTGCTCAA TGACCACTTT GTCTTGTTCC 4440 

TGCTTGTCCT GTTGGGCTTT CTAGCCTACC AGTACAGTCA ACTCTTACAA CATTTTCCTG 4500 

AAAATCATTG GCCTATCCTT TTGTTTGTAG GAATTACGTC TGTTTTACTT TTACTTTnGG 4560 

GAGGAACTGC CACCTATATG GAGGCTCCAG ACAAGCTCTT TCTCTTAGTT GGAGAAGAGG 4620 

AAATTAAGCT CCATCTCAAG CGTCAAACTG GCATTTCCCT AGTCTTTTGG CTCTTTGTAC 4680 

AGACCCTTTT CTTGCTGTTA TTTGCGCCTT TATTTTTAGC AATGGGTTAT GGCTTGCCAG 4740 

TTTTTCTGCT CTATGTGCTT TTATTGGGGG TAGGAAAATA TTTCCACTTT TGTCAAAAGG 4800 

CCAGCAAATT TTTCACTGAA ACTGGACTGG ACTGGGACTA TGTTATTTCT CAAGAAAGCA 4860 

AGCGTAAGCA AGTCTTGCTT CGTTTCTTTG CCCTCTTTAC GCAGGTCAAG GGAATTTCAA 4920 

ACAGCGTTAA GCGTCGTGCC TATCTGGACT TTATTTTAAA GGCTGTTCAG AAGGTGCCTG 4980 

GGAAGATTTG GCAAAATCTC TATCTGCGTT CTTATCTGCG AAATGGCGAC CTCTTTGCTC 5040 

TCAGTCTTCG TCTTCTCTTG CTTTCCTTGC TGGCGCAGGT TTTTATCGAG CAAGCTTGGA 5100 

TTGCGACAGC AGTGGTAGTT CTCTTTAACT ACCTCTTGCT CTTCCAGTTG CTGGCCCTCT 5160 

ATCATGCCTT TGACTACCAG TATTTGACCC AACTCTTTCC GCTGGACAAG GGGCAAAAGG 5220 

AAAAAGGCTT ACAGGAGGTA GTTCGAGGAT TGACCAGTTT TGTTTTACTT GTGGAATTAG 5280 

TTGTTGGGTT GATTACCTTC CAAGAAAAAC TAGCCCTTCT AGCCTTACTA GGAGCTGGTT 5340 
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TGGTTTTACT AGTCTTGTAT TTGCCTTATC AGGTAAAACG TCAGATGCAG GACTAACATT 5400 

GCTGATACGA CACTAAAAAA GAAGTTGAGT TCAGTCTGTC TCAACTTCTT TTTTGTTACT 54 60 

ACAGGATAAT GGTTGGTCCG TAGAGACTTA TACTCTTCGA AAATCTCTTC AAACCACGTC 5520 

AGCGTCGTCT TACCGTACTC AAGTACAGCT TGCGGCTAGC TTCCTAGTTT GCTCTTTGAT 5580 

TTTCATTGAG TATTAACTTG GTCTTGACTT GGTCAAAGTG GAAGCGGTCA TAGGCCCGCC 564 0 

AAGCGGCGCG AGTTGGAGCA TCTGGATCAA GAGCGCTGAG TCCCATGAGA AGACTGGAAG 5700 

TCTGGTAAAA TTTTTCTAGT TCAATCAAGA ATCGATTATC CACTGTTTCA GCCTTGGCTA 5760 

GAAAACCAAG AATAGAGTTT AATTGCTCCT GTU^GCGGAC GTCGTCAGCG CTTGCCTGTT 5820 

TGCATGCTTG GTAGGCTTTG TTTAAGTCAG TAATCAAAGT ATGAGCTCTT TTGATGGGGT 5880 

CTGTATCTGT CATGGGAATG CCTCCTTTAA TCTGGGTGCC AGTCTTACTT CTGGCAACTG 5940 

TGTTTTGATA CTGTTAGTTT ATCACTTTTA ATTCTTTTTT TTTATTCAAA TCTTTAATTG 6000 

TCATTGAAAT GTCTTGAATT GCGCTGAGTG AATTTTATGA TAAAATAGTT GTAAGCTCAT 6060 

CATGATGTTG TAGAAAATAA TCCTTTTAGG AGTTTTCAAA GACTGTTTAG GATTGGGTGT 6120 

GCTTGGGCTA GACCTTTTCT GTTATTCTTT TCTTAGGAGG AGAATCCAAT GAAATATATG 6180 

ATTATTCAGA CGCAGAAAAC AGTCTATAAA GTAAACATCG ACGATATCTA CTATATCCAA 6240 

ACACATCCAA CTAAAGCCCA TACCGTACAG ATTGTTACAG AAGAAGCTAG TTTTAATATG 6300 

CTTCAAAATT 'TAagTAATCT TGAGAACCAA TGTGGGGAAA CCTTGATGAG ATGTCATCGA 6360 

AATTGTTTGG TTAATCTTGA TAAATTAAAA TCGATTGATT TTCAAGAAAG AATCCTTTTT 6420 

CTCGGAGAAG AAGGTCAATA CGCTGTCAAG TATGCCAGAC GTCGCTATAG AGAAATTCGT 64 80 

CAAAAATGGT TGAAAGAGGG AGAGTAAGAA GATGAGAATA TTTGTTTTAG AGGATGATTT 6540 

TTCCCAACAG ACTAGAATTG AAACGACGAT TGAGAAACTT TTGAAAGCAC ATCATATCAT 6 600 

TCCTAGCTCT TTTGAGGTAT TTGGCAAGCC GGACCAACTG CTGGCTGAAG TGCATGAGAA 6660 

GGGGGCCCAT CAGCTATTCT TTTTGGATAT TGAGATTCGA AATGAAGAGA TGAAGGGACT 6720 

GGAAGTGGCT AGAAAGATTC GGGATCGGGA TCCTTATGCC CTGATTGTCT TTGTGACGAC 6780 

TCACTCGGAG TTTATGCCCC TGTCTTTTCG CTACCAAGTG TCTGCTTTGG ACTACATTGA 6840 

TAAGGCCTTG TCAGCAGAGG AGTTTGAATC TCGGATCGAG ACAGCCCTCC TCTATGCCAA 6900 

TAGTCAAGAT AGTAAAAGTC TGGCGGAAGA TTGCTTTTAC TTTAAATCAA AATTTGCCCA 6960 

ATTTCAGTAT CCTTTTAAAG AGGTTTACTA TCTCGAAACG TCGCCCAGAG CCCATCGTGT 7020 

TATTCTCTAT ACCAAGACAG ACAGGCTGGA ATTTACAGCG AGTTTAGAGG AGGTTTTCAA 7080 

GCAGGAGCCC CGTCTCTTGC AGTGCCACCG CTCTTTTCTC ATCAATCCTG CAAATGTGGT 7140 
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GCATTTGGAT AAGAAAGAAA AACTGCTTTT CTTT 7174 
(2) INFORMATION FOR SEQ ID NO: 190: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3207 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190: 

CCACCAGGGA AAATCATTGA AGTTGGTAGT CACCAAGAGT TAATGCAGGC GCAAAGTTTC 60 

TACCATCATC TATTCAATAA ATAAGGAGAA TGTCATGAAT CCTAATCTTT TTAGAAGCGT 120 

CGAGTTTTAT CAGAGACGTT ACCATAACTA TGCGACAGTG TTAATTATAC CTCTTTCATT 180 

ACTATTTACT TTCATCTTGA TTTTCTCCCT TGTTGCCACA AAAGAAATTA CTGTTACTTC 240 

CCAAGGAGAA ATCGCCCCTA CAgTGTCATT GCCTCCATTC AGTCAACCAG TGATAATCCT 300 

ATCCTAGCTA ATCATTTAGT GGCAAATCAA GTAGTTGAAA AAGGGGACTT ACTCATCAAA 360 

TACTCTGAAA CAATGGAAGA AAGTCAGAAA ACTGCCTTAG CAACTCAATT ACAAAGACTT 420 

GAGAAGCAAA AAGAAGGACT TGGAATTTTG AAACAAAGCT TAGAAAAAGC GACTGATCTT 480 

TTTTCTGGCG AGGATGAATT TGGCTACCAT AATACCTTTA TGAATTTTAC TAAACAATCC 540 

CATGATATTG AACTGGGTAT CACAAAGACT AACACCGAAG TTTCAAATCA AGCTAATCTT 600 

TCCAATAGCA GTTCATCAGC TATTGAACAA GAAATTACAA AAGTTCAACA ACAAATTGGA 660 

GAATATCAAG AGTTGAGAGA TGCTATCATA AATAACAGAG CACGCTTACC AACTGGCAAT 720 

CCGCACCAGT CAATTTTGAA TCGTTATCTT GTAGCCTCAC AAGGACAAAC ACAAGGAACT 780 

GCAGAGGAGC CATTTTTATC TCAAATTAAT CAAAGTATTG CAGGTCTTGA ATCATCTATC 840 

GCAAGCCTCA AAATTCAGCA AGCTGGTATC GGAAGTGTAG CAACTTATGA TAACAGTTTA 900 

GCAACCAAAA TTGAAGTACT CCGCACTCAG TTTTTACAGA CAGCCTCACA GCAACAACTA 960 

ACTGTGGAGA ATCAATTAAC AGAATTAAAA GTACAACTAG ATCAAGCCAC ACAGCGTTTG 1020 

GAAAACAATA CCTTAACCTC CCCAAGTAAA GGTATCGTTC ATCTGAACAG CGAATTTGAA 1080 

GGTAAAAATA GAATTCCAAC TGGTACAGAA ATTGCTCAAA TATTCCCTGT CATCACAGAT 1140 

ACAAGAGAAG TACTAATCAC TTACTACGTA TCTTCTGACT ATCTACCTCT ACTAGATAAA 1200 

GGACAAACTG TAAGATTAAA ACTGGAGAAG ATTGGAAATC ACGGCACCAC CATCATCGGC 1260 

CAACTTCAGA CAATTGATCA AACTCCTACC AGAACAGAGC AAGGAAATCT CTTTAAATTA 1320 
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ACCGCTCTTG CAAAACTATC TAACGAGGAT AGTAAACTCA TCCAATATGG CTTACAAGGT 1380 

CGCGTCACTA GTGTAACTAC AAAGAAAACA TATTTTGATT ATTTCAAAGA TAAAATTTTA 1440 

ACACATTCTG ATTAATTTTC AGATAACACT CTATAACTAT TTATTATCTT ATCAAAAAGG 1500 

AGAATCATAA CATGGATAAG AAACAAAACC TAACTTCATT TCAAGAACTA ACAACTACCG 1560 

AACTCAATCA AATTACAGGT GGAGGATTGT GGGAAGATTT ATTATATAAC ATTAATAGAT 1620 

ATGCTCATTA CATCACATAA GAACTTCATC ATCCAATACA ACTATAAAAA AATAAGACCG 1680 

AGAAACAAGT ACTCTCGGTC TTATTTTTCA TCATTCTGTA TGTATCACAG TAAGTACCTG 1740 

ACGAAAGACT TGATTTTGAC AGGTGGTATT TAGACTGGTA TTAGGATGGC TTTCCACAAT 1800 

CTTCATGACG GTATAGAGAC CAACTCCTCT CTCCTCCCCT TTAGAACTGG CTCCAAAGGA 1860 

GAAGATTTCA GAAATATCGA TGCCCTCTTC TTTGATGGAG TTTTCGATGA TAAAGGTCTC 1920 

CTGTGCTCCA TTTTTTAAAA AGGCGATTGA AACATGAGGT TGACTAGCTT CCACACTGGC 1980 

TTCAATAGCA TTGTCACAAA GGATAGACAC AATGGTTAGA AAATCAAGTA GACTCATCCC 2040 

CTCGACCTGA ATCTCCTCAG GAACTTCGAC ATTAAAGACA ATGTTCTTAT CTCTGGCTTT 2100 

TAAAAATTTC CCTGCTAGAA GACTTTTGAG GGCTTTATCA CGAATATTTA CCAATCTGCC 2160 

CAGGTCATAT TTATTGTTCT GCAATTTCTG ACTGGAATCC TTTAAGACGG AGCCATAGAC 2220 

CTCTTTTATC TGCTCCATAT CCTCCTCTTC AATGCCCAGA CGTAAGCTAG TCAAGAGGTT 22 80 

GGTATAATCA TGACGAAAGC TCCGTACTTC CTTGTAAAGC TCCTCTATAT GCCGACTATA 2340 

GCGTTCCATA TCTCTATAGC GCAGGGCCTG CTCTTGTTCC AATCTCTCAT AGAGTTTTTC 2400 

CTTCAAATAG GTATCCAATT TCTTGATAAC CCCCATAAAA AAGAGTAGGT AAAAGACTAG 2460 

GATGAGATGG CGAACAGTCT TTGATTGAAT ACTTTGTTCA TATTCAAAAA AAGACAGACT 2520 

TTCCATGACT AGATAGTAGC CACCCATTAT CCAGTTAATC TGAGTCAGGG ACTTTTGAAA 2580 

GGCTTTATCG AGAATCTCCT TTCTCAAGCT AGTAAAATCG TAGTCCAACC ATTTCAAAAA 2640 

AGCTAGAGAA ATGAAGAAAT TGAAAATTAT TATACATAAC CCAGTAAATG AGTAGCCATC 2700 

ATATACTTGC CCTTGTCCCA AAAATGGAAG CACAAAATAG GAGACTCCTC TATAAAAGAG 2760 

ATTCACCAAT ATCATTGGAA AGAGACCATA AAAGAAAAGG AGTTTTTTAG GAAGCCCTCT 2820 

CAATAATAAG AAAGATAAGC CTATGCCGTA CAAGGGTTCC ATAAAATAAG ATAGGTAAAC 2880 

ATTTCCTACT ATATAGCTAA TCATCACAAA AACAAAGGCC AACAGTATCT TCAAAAGAAA 2940 

GGCCTTAAAA ATCCTCTCGA AAGTAAGATC AATTCCATCC ACCTTAAAGA AGATGACAAT 3000 

TTCTAGTCCA TTAGTAACAA GTGTATACAA CAATATCCAA GCAATGTTCA TAAATTCTCC 3060 

TAGCTCAGTG TAATTTATTG ATGGCCTCAG ACACTTCCCT GACCTTATAA CGGGCGATTA 3120 
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GACAACTTCC ACCATTGGGA GAGAAGAGCA GTTTTTCTTT CTTATCCAAA TGCACCACAT 3180 
TTGCAGGATT GATGAGAAAA GAGCGGT ' 3207 

(2) INFORMATION FOR SEQ ID NO: 191: 

(i) SEQUENCE CHARACTERISTICS r 

(A) LENGTH: 10357 base pairs 

(B) TYPE: nucleic acid 
to STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191: 



CTGAATCAAG 


TGTACTGCAC 


CAGTTCGTGC 


ATCAGGCATA 


ACAACATCTA 


CAGATATAAT 


60 


ATTGTTTTCT 


GAGTCCGCCT 


CATAAGTTAA 


AATCATAAAT 


TTTTCGATAT 


TCGAATTTTT 


120 


AGTAGCTTGT 


TCAATTTCTT 


GAATCATTTC 


ATCAGAAACT 


AACTCCATCT 


GAATTGGAAA 


180 


GGAATGACTA 


TTTTCATCAT 


TTTTGTAGGA 


AGAATGTTGA 


TTAAGATAAA 


GTGTATTCAT 


240 


CTGAGCATAT 


TCAAATAAGT 


AGCCACTCTT 


ATTTTTTTGT 


ACCAAAGGAA 


ATTGGTTTGT 


300 


AAGTCGCTTC 


TTACCCTTTA 


TAATTAACAA 


TACTTTCCCA 


TATTTTTCTG 


TATTTGTTTC 


360 


AAATTCTAAA 


TATCCCCAAG 


TCTGTCCTGC 


TAATTGTAAT 


TTATACTCAA 


ACAAATCTGC 


420 


TGATGCAAAT 


GCAGTATCAA 


TATGATTAGG 


TCGCGTCCAT 


GCATAACCAT 


TCGACACTAT 




CATTGTCTCT 


CTTTTTTCTA 


GACGTTCATC 


TACATAATCT 


TTTTGCCCTT 


TCATCAAAGT 


540 


ATCTACAATT 


TTTTGTGCCT 


CAAGCGAATC 


AAAGAGATCC 


TGATTCAACA 


TAATTCTTCC 


600 


TCCTCCAAAT 


ACTTTTTAAT 


GAATTATACC 


ATTTTCTTAA 


AGAAATTACT 


ACAATAATTA 


660 


TCTTTTTCTT 


AAAGTTCTGT 


GTCAGAGTAA 


TTTAGAAAAT 


TATATCTTCT 


ATAGTAAAAT 


720 


CAATTAAAAA 


CTGAACAAAT 


TTATTGGGAA 


ATTCAAATCG 


CTTTCTGAAA 


ATATTTTAGG 


780 


AACCGTAGTG 


TAATATTCCA 


GATTCAATTC 


ACTATAAAAC 


TGACCTTTCT 


CCTGCAAAAG 


840 


AAAAAGGAAA 


GACTTCCTTT 


CGTGCCTTTC 


CTCTTACTTG 


CTACTTGTTT 


GATTATTTTT 


900 


GGTAAGCTAC 


TGCTTGTCTG 


ATAAAATCCT 


GAATCGGCTC 


TCCTTGGTGG 


AGAGCTTTTA 


960 


CTATTTTCGA 


ACCGACGATA 


ACACCATCTG 


ACACCGCATT 


GAAGCGTTCC 


AGATCGGCTT 


1020 


GACTAGATAC 


ACCAAAACCT 


GTCAAGACTG 


GGATGTCGGC 


CACTTGATGA AGTTGCGCCA 


1080 


AGTGCTTGTC 


CAAATCTGCA 


CGGTAATTGC 


CTGATTTCCC 


TGTCACTCCA 


TTGATGgCAA 


1140 


CGGCATAGAT 


GAATCCCTCC 


GCCCCTTCAA 


TCAACTCTTT 


CTGGCGCTCA ATTCCTGTGG 


1200 


TCAAGCTTAC 


TAAAGGAATC 


AAGGCGATAT 


CTGTATTTGC 


CAAAAATGGT 


TCTACAAAGT 


1260 
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TGGCATGTTC ATGAGGCAGG TCTGGGATAA TCAAGCCCTT CACAGCTGTA TCAGCCAGAT 1320 

CTTTGACAAA GTTCTCCACA CCGTACTGAA AGAGGGGGTT GAAGTAGGTC ATGATGACCA 1380 

GTGGAATCTC TGTTTCAATG GTTTTCAAGG TTTCAACTAA AGCCTGGGTA GAGGTCCCGT 1440 

GGGCTAAACT GCGCAAGCCA GCTTCTTCGA TAACAGGTCC ATCTGCAACA GGGTCTGAAA 1500 

AGGGAATACC CACTTCAATT GCAGAGACAC CCAAATCTTC TAAAAAGTGA ATTGTTTCAG 1560 

CAAGACCGTC CAAACCTTTC TCGTGGTCAC CAGCCATGAT ATAGGGAACA AAAATTCCTT 1620 

TTCCAGCTGC TTTAATAGCA TTTAATTTTT CTGTTAGTGT CTTAGGCATG AGCTTCTCCC 1680 

TTCTTTGCTG CATCTGCTTC CAAGCGGTCC TTGACTTGAA CCACATCCTT GTCCCCACGA 1740 

CCTGATAGGC AGACAATCAT AGACTTTTCT GGTCCAAGTT CTTTGGCCAA TTTCACCGCA 1800 

AAGGCGATAG CATGGCTAGA TTCCAAGGCT GGGATAATCC CTTCCACACG AGACAAGAGT 1860 

TGGAATCCTT CCAAGGCTTC TTCGTCTGTC ACAGGGACAT AGCTGGCACG TTTAATATCG 1920 

TGGTAGTGAG AATGCTCTGG ACCGATACCA GGATAGTCCA AACCTGCTGA GATAGAGAAG 1980 

GCTTCAAGAA TTTGACCATG GGCATCTTGG AGCACATCCA TGAGGGAACC GTGAAGGACA 204 0 

CCTGGACGAC CCTTGGTCAA GGTAGCTGCG TGGTGCTCTG TATCCACACC AAGCCCTGCT 2100 

GCTTCAGTTC CATACATAGC TACTGACTCA TCTTCTACAA AGGGATGGAA GAGCCCGATA 2160 

GCATTCGACC CACCACCAAC ACAGGCTACT AGGGCATCTG GCAGATCTCG ACCTGTCAAG 2220 

tCACGGTACt GlTGTTTAGC CTCTCGACCG ATGACACTTT GGAAGTCACG AACGATTTCT 2280 

GGAAATGGAT GAGGCCCCAA GGCAGAACCA AGGATATAGT GGGTATCGTC GATATTAGCC 2340 

ACCCATGAAC GAAGGGCTGC ATTGACCGCA TCCTTGAGCA CGCGCGAACC ATCTGTTACA 2400 

GCCTCGACCT TGGCTCCCAA AAGCTCCATG CGGAAGACAT TGAGGGCTTG GCGTTTGACA 2460 

TCTTCCTCAC CCATGTAGAT GGTACATTCC ATGTTAAAGA GGGCTGCAGC AGTTGCAGTT 252 0 

GCCACACCGT GCTGACCAGC ACCCGTTTCT GCGATAATTT TCTTTTTACC CATGCGTTTG 2 580 

GCAAGCCAAA CTTGTCCTAA GGCATTGTTA ATCTTGTGGG CTCCTGTATG GTTAAGGTCT 264 0 

TCCCGTTTGA GATAAATCTT GGCTCCGCCA ATATGCTGGG TCAAGTTTTT TGCGTAATAA 2700 

AGAGGAGTTT CACGTCCTAC GTACTGGCGC AAAAGCTGGT TTAATTCCTC TTGGAAACTT 2760 

GGGTCTGCCT GACTTTCACG GTAGGCCTTC TCCAACTCCA AAACTGCTGT CATCAATGTT 2820 

TCTGGGACAA AACGTCCGCC GAATTTTCCG TAAAATCCAT CTTTATTTGG TTCCTGATAT 2880 

GCCATGCTTT ACCCTCTCTA TAAATCTTCT AATCTTTTCA TGATCTTTTT GTCCATCTGT 2940 

CTCCACTCCG CTCGATACAT CTACTGCATA GGGAGTAAAG TGTTGAATTG CTTTTACTAC 3000 

ATTATCTTCA TTAAGGCCAC CTGCGATAAA GAAGGGCTGT GCTAGTCCAG TCGTATCCAG 3060 
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TTGACCCCAA TCAAAGGGCT GGCCACTTCC TGCCACAGGG GCATCAAAGA GTAGATAATC 3120 

TGCCTGAGAA TTGGGGACAT GCCCATTTCC ATCTACCTGC ACAGCCTGAA TACTGGCACA 3180 

AGGCAAATTC TCAAATAAAT CATCTGCCAC CTGACCGTGA ACTTGAACCA AGTCCAAGCC 3240 

AACTTTGTCA ATCGCTTCCA GCAGTTCTAC CCGACTTGGT GAAACAAATA CTCCAACCTT 3300 

TTTCACATCT GCAGGAATAA GCTTTGCCAA CTCAGCTGCC TCTTCTAAAG TCACCTGTCT 3360 

TTTACTAGGT GCAAAGACAA AACCGATATA GTCGGCTCCT GCTGAAACGG CTGTTTCCAC 3420 

CGCTTCTTTG GTCGATAGTC CACAAATTTT AACCTTTGTC AATCTGCAAC TCCTTGATTC 3480 

TCTGGGCCAC ATTTTCTGCC TGCATAAGAG CTGTCCCTAC CAAAATTCCG TTAAAGTATG 3540 

GGGCTAGTCG TTCCGCATCC TGCCCTGTGA AAATGGCAGA TTCAGAAATG TAATAGCGAC 3600 

CTTCCTCTAA GTAAGGGGCT AAATCTACAC TGGTCTGCAA GTCGACCTCA AAGGTAGTCA 3660 

AGTTGCGGTT GTTGACCCCG ATAATCTCAG CACCAAGTCT GTGGGCTACC TCTAGTTCAG 3720 

CTAGATTGTG AGTCTCCACT AAGACTTCCA GACCAAGCTC TGTCGCGTAG TCATACAGTT 3780 

CCTTGAGGCG TTCTTCGGAC AAGGCTGCCA CAATGAGCAA GATAACTGTC GCACCTGCAT 3840 

TGCGAGCGCG GATGATTTGC TTTTCATCGA TGATAAAGTC TTTGTTGAGC GTCGGAATCT 3900 

CTACCTGACT GG/U^TTTCC CGTAGATAAT CCAAATGCCC TTTAAAGAAA ACCTCATCTG 3960 

TCAACACCGA AATCATCACT GCTCCGTTTT CTTCATAAGT CTGGGCCTGT TGCACAATAT 4020 

CCACATCGAG ATTGATATCT CCCAAACTAG GGCTAGCTTT CTTGACCTCA GCGATTACCT 4080 

GCAAGCGGTC CTGATGATTC TTCAAAAATT CTGCCAAGCG ATAGGTCTGG CGCAGAGGCT 4140 

GGATTTGCTC CAGCTTCATC TGCTCCACCT CACGCGCCTT CTGCTCTAAG ATTCGTGCTA 4200 

AAAATTCCTG ACTCATTTTT GGTACTCCTG TAACAGTCTG AGTTTTTCAA GGGCCTTGCC 42 60 

TCTAGCAATC ACTTGACGGG CCAAGGCAAC CCCTTCCTTG ATGCTATCAA TCTTACCATT 4 320 

AGCATAGAAA CCAAGACCAG CATTCAAGAC TGTCGTTTCC AAGAATGGAC TTGCTTCGTT 4380 

TTTCAGAACG CTAAGCAAAA TTTCTGCATT TTCCTGAGCA TTCCCACCAC GAATATCTTC 4440 

CATAGCATAG CCTTCCATTC CCAAATCCTC TGGAGTAAAG CTTGACAAGC TGATTTCGCC 4500 

ATTTTCAAGA AGTGCAATCT TGGTTGTTCC GTTCAAGCCA GCTTCATCCA ACCCTTCTGG 4560 

TCCAGCAACC ACGATGGCAC GTTTGCGACC CATATTTTTC AAAACCTGAG CTGTACTTTC 4620 

TAGGAGTTCT GGACGACTAA TTCCAAGAAG CTGTGTTTCT AAAGCCATTG GATGAATCAG 4 680 

TGGACCAGTC AAGTTCATAA TCGTTGGAAT TCCCAATTCC AAACGAGCTG GCATGATGTA 4740 

TTTCATAGCT GGGTGCATAT TTTTAGCGAA GAGAAAGACG ATTCCAGTTT TATCAAAGAC 4 800 
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CTTACCTAGT TCAGCTGGTT TGAGGTCAAG ATTGATTCCC AAGGCTTCGA GGACATCTGC 4 850 

GGAACCAGAT TTAGAAGATA TCGAGCGGTT ACCGTGTTTG GCCATGTGAA TACCGCCACC 4920 

AGCCAAGACA AAGGCTGCAG TTGTGGAAAT ATTAAAACTG AAAGACTTGT CCCCACCTGT 4980 

ACCACAGTTG TCCATGGCAT CATGAATCTC AGTTGGAATA TGCTGGGCAT GTCCTCTCAT 5040 

GACTTGGGCA ATGGCTGTGC GTTCTTCAGG TGTTTCCCCC TTCATCTTAA GAGCTAAGAG 5100 

GAGAGAAGCA ATCTGCGCTT CAGTTACACG CCCAGTTACG ATACGCTCAA TGACATCCGT 5160 

CATTTCCACA CCTGATAAAT TTTCAAATTT TGCTAGTTTT TCAATAATCT CTTTCATCCT 5220 

AGTTTCCTCA CTTTACAACC TCCTCGATAA AATTCCGAAT AGAAGACAAG CCGTCTGGCG 5280 

TTCCAATGCT CTCTGGATGG TACTGGAAGC CATAAATCGG TAGGTTTTTA TGTTGAATCC 5340 

CCATGATGGC TTGGTCATCA GTCGAACGAG CTGTCACTTC AAAGTCTTCT GGCATTTCCT 5400 

CAATCAAAAT ACTGTGATAA CGCATGACCG CACGGCCATC CTCAATACCT TGATACAAAA 5460 

CAGATGGCGC TTCAAAGTTG ATATTGCTCT GTTTCCCATG CATGACTTTT GGAGCCAAAC 5520 

CTAGCTTACC ACCAAAGACT TCTGCAATGG CTTGGTGGCC CAAACAAATC CCAAGAATCG 5580 

GCTTCTTGCC TGCAAAATCA CGAATCATGT CTTCCATCTT TCCAGCATCA ACTGGCCAAC 5640 

CAGGACCAGG AGAAAAGACC AGACCATCTG CTTTTTCAGC TTCTTCATAC AGCTTGGAAT 5700 

CATCATTTCT CAGAACCTGA ACTTCTGCAA AATTCCGAAT GTATTGGGCC AAGTTATAGG 5760 

TAAAAGAATC ATAG'l-lXi'l'CA ATCAATAAAA TCATGGTCTT AGTTCTCCAA TTCTAGTCAT 5820 

AGATTTTGCT TTGTTAATGG TTTCTTGGTA TTCGTTTTGG GCGATAGAGT CGTAGACAAT 5880 

CCCTGCCCCA GCCTGCACAT AGGCTCTTTG ATTTTTGAGA ATCATGGTTC GGATGGCGAT 5940 

GGCCAAATCC ATATCACCCG TCGCAGACAA GTAGCCGATT GCCCCAGCGT ATACTCCCCG 6000 

TTTTTCCGTT TCCAGTTCAT AGATACGTCT CATCGCTCGA ATCTTTGGTG CTCCAGAAAC 6060 

GGTTCCAGCA GGAAGCGTTG CTTTCAAGGC ATCCATGGCA GTGAGTTCTG GAAGCAAACG 6120 

CCCCTTGACT ACGCTGGTCA AATGCATGAC GTAGCGGAAG AGCTCCACTT CCATATACTT 6180 

AGTGACTTGG ACACTGGTCG TTTCAGAGAT GCGGCCAATA TCGTTACGCC CCAAGTCTAC 6240 

CAACATTCGA TGTTCTGCTG TTTCCTTCTC ATCAGAGAGG AGGTCAGTCG CCAAGGCCTT 6300 

GTCTTCTTCA TCCGTAGCCC CTCTTGGTCG CGTCCCTGCA ATCGGATTGG TTGTCACGAT 6360 

GCCATTTTTG ACAGAAACCA AACTTTCTGG ACTAGCTCCG ATGATTTGAT AATCCCCAAA 6420 

ATCATAGAAA TAAAGGTAAT TAGAAGGATT AGTCACGCGG AGATTTCTGT AGAAGTCAAA 6480 

TGGATTTCCA GTAACTTCTG CTGAAAAACG CTGGCTGAGT ACACATTGGA ACATATCTCC 6540 

GTTACGAATC AAGTCACGAG CTGTTTCTAC CATTCCCTCA AACTTATGTG GAGCGATATG 6600 
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CGGTTTGAAG TCTAACGGAG ATAGATCCAA ATCTTCAAAT TCATTTGGAG CAGGAATGCG 6660 

TAATTCCTCA AGCACTTGGT TCAAGGATTT TTCCAAGGCC TCTTGACTGC GCTCACTATA 6720 

AAGTGCATCC TCTATGACAT GTATCTTCTC CTTCTTGTGG TCAAAGACCA TATAGCTCTC 6780 

ATAGACAAAG AAATGCATGT CTGGCGTCCC AATTGTATCC TCAGGGATTT GACCAATTTC 6840 

TTCATAAAGC GAAATCATAT CGTAACCCAC AAAACCAATG GCTCCACCAC CAAAAGGTAG 6900 

CTCTGAGTGG TGCTGACTCT TATGAATCAC TTCATAAAGG AAATCCAAGG GATCCCGATC 6960 

AATCACTTGA CCATTTTGAT AGAGAACCCC ATTTTCAAAC TTAATCTCAA AAACTGGATT 7020 

ATAGGCTAGG ATAGAAAAAC GAGCTGTTTC CTTGTCTCTC GGAATACTCT CTAAAATAAC 7080 

CTTATGTTGC CCCTTTAAGC GCATATAAGC CAAGATTGGT GATAAGACAT CTCCATGAAT 7140 

GATTCGTTCC ATTGTAATTT CCCTTTCAGT TCTACTTCTA GTCCGTGGTG ACTGTATGAA 7200 

AAATCCCCAC GCAAAATAAC TTGCGTGAGG ACGAAATTCG CGGTGCCACC TCAATTATAG 7260 

GATTTCTCCT ATCTCTCATT CCTGTCTCAG ATATCTCCTG TAACAGGCTG TGCGATAAAG 7320 

GGCACTCCCT TGAGAATGAT GTTTTCTTCT CTCGTTTCAG ATGAACCCAA CTTTACAGCT 7380 

TTCTCTGCTT GTTTTCAGCA ACCACAAGCT CTCTGTGAGA GAAAGAACTG TAATTTTTCC 7440 

ATCTATTATT TTTTAGCTTC TAGTAGTCTG CAATCGCAGC TAGGTCCTTG CCTCCACGAC 7500 

CAGAGACATT GATGAAGAGA TGTTCATCTC GGTACACCTT TATACTCTTC GAAAATCTCT 7560 

TCAAACCGCG TCAACGTCGC CTTGCCGTAG GTATGGTTAC TGACTTCGTC AGTTCTATCT 7620 

GCAACCTCAA AACAGTGTTT TGAGCTGACT TCGTCAGTTC TATCCACAAC CTCAAAACAG 7680 

TGTTTTGAGC TGACTTCGTC AGTTCTATCC ACAACCTCAA AACAGTGTTT TGAGCTGACT 7740 

TCGTCAGTTC TATCCACAAC CTCAAAACAG TGTTTTGAGC AGCCTGCGGC TAGTTTCCTA 7800 

GTTTGCTCTT TGATTTTCAT TGAGTATTAC TAGCTTTTTT CGTATTAGTC CAGCCTTTTT 7 860 

GTTTGCTTTT AGTAGTAGGC ATGGAGCTGT AGATAGAACT CAAGTTCATC AAAGCGACTT 7920 

AAGGCCCTAA TAAAAGATAA ACCAAACGAC GGATAGAAAA AAGCCCACAC ACAGAATATA 7 980 

CTTCCGTGTG AGGGCGTTGG TAACGCGGTG CCACCTCAAT TATAAAGGGA CTATCCCTTT 8040 

ACATCTCTGC CTTGTTTAAC AACAAGCTGC ACTGTAAGGT GTGCGCACCG AATTTTCATT 8100 

GTTTCAAATT CATTTTCAAA ATCAGCCCAC TTTCACTACT TCCAACCACC TATTCACAAT 8160 

CACCACAGGC TCCCTGAAGA TCAAAAATAG TTACTTTTCT GATTTGTTGA ACTTATTTTA 8220 

ATACTTTGTT TTTTCTTTGT CAAGACTTTT TTACGATTTT TTTGAAAATA TCATTCGAAT 8280 

ATGACCATGT CTTCCTTAGA TCGAACATGA ACATGTCCCA CTTCTTAGAA ATTGGATCCA 8340 
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ACTCAATAGA AACTGAATGG AGGCTAAACA GAACTTATTT TAGAACACTC CATCTTTTCC 8400 

ACTAGGATTT TCAAGAATTA AACAATACTA GAAACTCTGT CTCCTAACAA ATTTAGGAGA 84 60 

AACTTCAACA GATGTGACAC TTTCCCCTTT AATT^TTGCT AAAACACCTT CTATCATTTC 8520 

TTTAGCCAAT TTAACATAAT TGGGAGCAAT TGTAGACAAA GCTGGAGTAT AATACTGAGA 8580 

AATAGGAATA TTATCAAATC CAATGATAGA AATATCATCT GGAATAAGAA TTCCTTTCTC 8640 

ATAGCACGCA CGAATCAAGC CCTGAACCTT TTCATCTCCT GAAACAAAAA TAATGTCCGG 8700 

AT7VATTTTGG GTAGTCAAGT GCTGCATTGC ATAAGAATAA ACTGAATCAA TTGTAGATAA 8760 

GCCATAAATG ACTTTTAAAT CCATAAAGTA ATTTTTATCA TTCAGAAAAG AACGCACACC 8820 

TCTTTCACGA TCCTTATTAA CATGGGATTC TCCTCCCATA AGCAACCACA TATTTTTAAA 8880 

TTTTTCTTCA GTTACAGCTT TCATCATATC ATAAGTAGCT TGAAAATTAT TATTAGATAC 8940 

ATAGACTACT CCAGACGTTT GAGATTCACC GAAAACAAGA AAAGGCATAT GGTTCTTCTT 9000 

TAAATACTGA ATTCTGATAT CATCTACACT TTCATAAAAA ACAATAACAC CATCTACTAG 9060 

GCTACCTGTG CTTGATATAA TTGAATTACT AATTGTATCC TCCTCTCCAA AGTACTCAAC 9120 

TATAGCATTA ACACCAAATT CTTTACACGT CCGTAACACT TTATCTAACA GCGTATGAAA 9180 

CCAAATTAAA GGAAAAGAGT CGATTTTTTT TACAGAAATC AATATATTTA TAGCTTCTTT 9240 

TTTAGTTAAA TTTTTTGCAT ACGCATTTGG AATATACGAC AATTCCTCTA TAACTTTTTG 9300 

AATCGCTtGA tAagTTTCTT CTTTAACATT TACTCCACCA TTAATAACTC GTGAAACTGT 9360 

TTTTGGAGAA AAACCTGATA AACGTGCAAT ATCATAAATA GTTACCTTTT TCCCATTTAT 9420 

ATTTTTCATT TCAGTCCTCC ATTACGAACA TTCTAATATT ACTATACAAT ATTTAATTTT 9480 

TTTTAACAAG AGAATTTAGT AAATTATTTA AGATCCACAA ATTCACAAAA TTAATTTTAC 9540 

AAATATTCTT CCCCTTCAAA AAAGTTTAAA TTGCATTTCA CACCTTTATT TTTAAGAATG 9600 

TTTCCAACTT CACGACAAAT AAATTCATAT GAGAAAAAAC TGCCATAAAA TTGTAGATTA 9660 

ACTTTTTCAG TAAAATGTGT AGGATTTATA AAAACATATA ATAGCCTGTC AATGTAACAT 9720 

TTTAACATAG AGTTAATTTT TTCTTTAAAG ATAACATTTG TTATCAACTC ATCAGGAGGT 9780 

AAATGAAAGG CAAACACCAT TTCACAAATA TCATAAAAAG AAATAAATTT GTATACTTGT 9840 

ATCAAACAAT TATTATCAAA ATATTCTATT TTACCTAAAT CAA/^TTGAT TTTATAATCT 9900 

TTCATAAAAA CCTCTGAGCA AAAATCTACT CAAAAATTAG ATGATTAAAA CATCTAAAAA 9960 

GCAAAAGGAC AAAAACATCT GTCCCTTTGT TTACTAAATT TCAGCTAATT TCTTCGACAT 10020 

AAATAACACC TACAATATTA GCAATTTCTT CCATCAGTCG AAGATGTTCA AATCTACCTG 10080 

ATAATTCCAG AGTAATAAAT GACGCTATTT TTTTGTCCGG AACATCAAAG TATTCAATTC 10140 
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TGTCAGAATT AACATCTCCA AACGCTGTTC TTGAATCGGT CATTCTGATA CCATTTTCTG 10200 

CACAATAAAC CAATACACGA TTATAGGCTT CTGTAGATTT AACCACTATA TACAATTCAA 102 60 

TCATTTTAGA ACGATTTTGC AGATATTTTT TTAGTGGTTG GAACATGGAT ATCACACCCC 10320 

AAACAGAAAT GGCTACTAAA AGAGCTCCCT CATAAGG 10357 
(2) INFORMATION FOR SEQ ID NO: 192: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6867 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 

CGGGACATTC TCAATCTTCT GTCTTTTGTT TTTCTCTTCT TTCTATGATA CAATGGAAAA 60 

AATAAATTCA AAAGGAGTTT TTTTATGACT TATCCAAATC TCTTGGACCG CTTCTTAACC 120 

TATGTTAAGG TCAACACGCG CTCTGATGAA CACTCTACTA CTACTCCAAG TACACAGAGT 180 

CAGGTTGACT TCGCAACAAA TGTCCTAATT CCTGAAATGA AACGTGTTGG ACTGCT^AAAT 240 

GTTTACTATC TACCGAATGG TTTTGCTATT GGAACCTTGC CAGCCAACGA TCCGTCTTTA 300 

ACACGTAAGA TTGGTTTTAT ATCGCACATG GATACTGCTG ATTTTAATGC TGAAGGAGTC 360 

AATCCACAGG TAATTGAAAA CTACGATGGT GGTGTGATTG AACTAGGGAA TTCTGGTTTC 420 

AAACTCGATC CAGCTGACTT CAAGAGTCTT GAAAAATATC CAGGACAAAC GCTCATCACA 480 

ACAGATGGAA CAACCTTGCT AGGTGCTGAT GACAAGTCAG GAATTGCTGA AATTATGACA 54 0 

GCCATTGAAT ATCTAACTGC TCATCCTGAA ATTAAGCACT GTGAGATTCG TGTTGGTTTT 600 

GGTCCAGATG AAGAAATCGG TGTTGGTGCC AATAAATTTG ATGCAGAAGA TTTTGATGTG 660 

GATTTTGCCT ACACTGTTGA TGGTGGTCCA CTAGGTGAAC TTCAGTACGA GACTTTCTCA 720 

GCCGCTGGTG CTGAATTGCA TTTCCAAGGT CGTAATGTCC ACCCTGGTAC TGCCAAAGGG 780 

CAGATGGTCA ATGCCCTTCA GCTAGCAATT GATTTTCATA ATCAACTTCC AGAAAATGAC 840 

CGACCTGAGT TAACTGAAGG TTACCAAGGT TTTTACCATC TAATGGATGT GACAGGTAGT 900 

GTTGAGGAGG CGCGTGCAAG CTACATCATT CGTGATTTTG AAAAAGATGC CTTTGAAGCG 960 

CGTAAAGCAT CCATGCAATC TATCGCTGAT AAGATGAATG AAGAACTTGG GAGCGACCGT 1020 

GTCACTCTCA ACTTGACAGA CCAGTACTAC AATATGAAAG AAGTCATTGA AAAAGATATG 1080 

ACTCCAATTA CCATTGCTAA AGCCGTTATG GAAGATCTAG GTATCACGCC TATTATCGAA 1140 
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CCAATCCGGG GTGGAACAGA CGGCTCTAAG ATTTCCTTTA 
ATCTTTGCAG GTGGCGAAAA TATGCACGGA CGTTTTGAAT 
GAACGTGCAG TTGATACCAT CATTGGCATC GTAGCTTATA 
GCTCAGCTAC TTCGCCTTTC TTTTTATTCT ACTGGTTTTT 
GAAGATTCTG TTGTTTCATT TTCTGAAGTT GATTCAGCAG 
CTTGGTTTGT TTTCX3TCGCT AGCAGTTTCA ATGTTAGATT 
TTCTCAGCAC TGGTGTTATC ACCATTTGCT TCAGCATTTC 
CTTGCGCTAG CTTTTGACTG GATTTGATGA TTCAAAACTA 
AGTAAAGCTG TTTTGTCTTT ACTCTTAGCA GAAAGTTGAT 
TCAAAGTCCG CATCAGATCC ATTATTACTT TCTAAATAAG 
TCGTAGAGTT TTTGATAGAG TACAAGTGTC TGAGGATCTT 
TGTTGAAGGG CGCTAGCGAT ACGAGTCAAG ACATCTTTTA 
AAGTCTGCAT CAGCCTTGTT TGTGGCAGCT TTTAGATTTT 
TGTCTGATTC CTTCTTCATG GATTTGTTCC AAGAGTTGAT 
TCTACTTCTT CCTTGCTATC TGTCGCAGAT TATTGGTTGC 
AACAGGAGAG TTATAATCCA AGATTACAAG GCCTTACAGA 
CaAtOttCGT CCAAGACGCT ATTCGCTTCG CACAGCAGCA 
TTTAAAGTTT AGGTGTCAAG ACCTCTTTTT AGTGTGCCCA 
TCAACTAACT TTTATTTTTT TCAAACTTTC AGTAAACTGA 
TCTTTGTAGA TGCTTCTGCT ATCAGCTAGA AGTTGATCTA 
TCATCAAAAG TTCCAGGTTG ATAGTTGGAT TGCAGGGATG 
GCTTCATATC CCTTAGTTTG AACCTTGATG TAGTGATTGT 
AAACCTTCTG AATCTTCACT TATAATTCGA TTGGCATCAA 
TCATGATGGA CATGTAGTGA CGGATTACTT AATACAGAAC 
TCCGTGTTAG AGTGTGATGG GGGATTGTTA AGAGATGACT 
TCCCCATGTC TTACTATATA AGCATCACCT GTATCTCTGA 
ACATATGTGG CTGCTAATTC ACCTGCCGAC AAGTCACTCT 
CCACCATGTG GTACTATAGT AGATTGAAAT AGAATATGAG 
AAAGTAATTT CTAACAATGA TTTAGAAACT ATGATGTGCT 
TATATAACCA TCATCGGTAG TATAACGTCC CTGTAATTTT 



TGGGAATCCC AACTCCGAAT 1200 

ACGTTAGCCT TCAGACTATG 12 60 

AAGGCTAAAA AGACGAGGTA 1320 

CTTGATTTCC AGTAGTTGTA 1380 

GTTTAGAATC TCTTGTATTG 1440 

CTGCAGTTGC GTTTGGTTGG 1500 

TTGCTGGACT TGTTTCTTCA 1560 

GAATAGCTTT TGTCGATTCA 1620 

CTAATAATGC ATCCACCTTA 1680 

AGTGAAGCGA CATGAGAATA 1740 

GCTCAGCATT TTCCTTTTCT 1800 

CCTGACTGTT TACTTCATCC 1860 

CTACTTCTTC TGCCAAGGAT 1920 

TTGCCTTGCT CAAAAGACTT 1980 

TATCTACCAT GTACTCCTAA 2040 

AATAAGAAAT CCAGATAAGA 2100 

CGGATTCAAT ATGCTTTAAT 2160 

AAATTTAGAG AAGTAATCAA 2 220 

CCTAAAGCTA ACTCAATCTG 22 80 

CTTTTGCCAA GACTGCCTTC 2340 

GAATCTTGTT TTTCAAAGCC 2400 

GGTCGCCATG AGGAATCACA 2 4 60 

AACCATGACC ATCTTCTTCC 2 520 

TAGAAGAACT TCCTACCTCT 2580 

TAGGAATATA GTGATAGTGA 2 640 

CAATATCATT AGGGTTAAAG 2700 

CAGGAATGAA ATGATAGTGA 2760 

CAAATTGATA AGGGGATTTT 2 820 

ATTCTAAATT CAACTCACTA 2 880 

GCTACAGATA CTTCTGCACT 2940 
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AGCTCCTTTA TCGTCTTTAC CATGTTCTTG TTTTTGGCGA TTGATTTCAT CTTTTGTTCG 3000 

TACATTTTCT GCATGAGCTT GATCTTTAAG GTAAACATAA TACTTTCCAT CTACCTTAAT 3060 

AATATATCCT CCCTTAACCT AACTGACGAT ATCTTGATCT TTCGGCTGAT AGTTGGGGGC 3120 

TTTCATTAAT AGCTCTTCAC TAAAGAGCGC ATCAAAAGGA ACTTTACCAT TATAGTAGTG 3180 

ATAATGATCG CCATGAGAAG TTACATAACC TTGATCTGTA ATCTTAATAA CAATTTGTTT 3240 

TGCTTGAATT CCTTCTTTTT GACTAACCTA GTCTGGAGTC AAATTTTCAG TCTTCTTAGT 3300 

GTCTTTATTA CTGTTTACAT ATGAAACACG ATTTTTATCT GTATTGGCCT GTTAGCTATG 33 60 

TTGGTTCAGA GCATAAACAC ACAGACTTAA GGAAAGGATA ACAACAGATC CAGCTGCTAT 3420 

ATATTTCTTT TTAAATTTCA TAATTACCTC ATTTCTATAA TTATTTATAT GATGTCTTCA 34 80 

TTATTAAATG ATTAAATAAA TTAATTAACC AATTAATTAA CTAGTAAATA TTCCACCTCT 3540 

TTTTAAGTTG TATGTCAAGA AATTTTATAT ATTAATAATA AAATGAAATT CTCCCAAAGT 3600 

CAGAGTTTTA TTTCTAACTT TTGAGAGAAC TTCATTTTTG ATTCAGACTT TTTCTACTGC 3 660 

TATTCCTTAC GCTATGAGAT CAGATAAATT CTTTTTTATC ACTTCTCCAC TTGGCAATCT 3720 

TAATTCAATC GTTCCATCCA TATTGAATAT AACACTATCT AAGCCTAATC CGTAACTAGC 37 80 

TGTAAATTTT TCTAATTTTT CTTGTACAGG ATCTACTGCT GGAGCTTCCT CTAATGCTGG 3840 

ATCTAACATA GGGTCACTCC CCACATTCCC TTCTGGATTC AACATTCCAT TATCCGTTGA 3 900 

GTTTTCTGGT TTTACAGGTT TTTCGTTTGG TGCCTCTGGT AAAGAATCTG CTGGTTTATT 3 960 

TTCTGTTGGT TGGTTCTCAA CTGTTCCAGT AGATACTTTT CCATTTTCAG ATGGTTTATT 4020 

TTCACCATTT CCTTGAGGTG CTTCTCCTGT AAAATCTGCC ATATTCTTTT TAATGACTTC 4080 

TCCCGATGGT ' AAATATAATT CAATTGTTCC GTCCATATTA AACAAGACAT TTTCTAGCTT 4140 

CATCCCATAA CTTTCAGCAA ATTTTGCTAC TTTTTCTTGT ACAGGATCCA CTGTAGGAAC 4200 

TTCTTCTAAC GTTGAATTAC TAGTACTATT CCCAGTTTCA GAAAGTTTTT CTTTTTCTAC 4260 

CTTCTCACTA GTCTTTGGTT CTTCTACCTT TTCATCAAGT TTTAAGTTTT CTTGTGCTTT 4320 

ATTCCTTTTA AATTGTGGTA GAATACTTGG TTTATCAGTT TGATTTTCTT TTTCCAAGAT 4380 

AGGTACTTCC ACAATATAAG TCGATTGATT GTCCAAATAA GCATTTGCCA TGAAGGTTAC 4440 

AGGAATTTTA TTTCCGGCCG TTCTGGTTGT TCCTTGGTTT AATTTCGGAA TCGGTAATTT 4500 

GATTTCACCA ACTTTATAGT TATTTTCTAA ATAAGCATTT CCATGAAATT CATCAAACAC 4560 

TCTGACTAAA GCATCAGTTC CTTTAGGCAC TGCAAATTGA GGGTTCACTC TTAAATAAGT 4620 

ATCCCCTGCA TGGAAAGGAT AGAAAATCGT TTGACTGGCC ATTTTGTAAG CTAAAGAGGT 4680 
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TGGAACTGTA AATGTACCAT CATAACTTAC TTCTGGATAA TCTTTTGAAG CGATAGTATA 4 740 

CTTAAATGTT TGTCCTGGTA AATAAGGTTG ATCTAATTCA AAGTTTGCAA TATTCCCTAC 4800 

TCCTTCTCCA AATACTTTAC CAGATACTTT CTCCAATACT TTTCCATCTG GTGTTATTAA 4860 

TTTTACTAGC ATATTGATAC CTAATTTTTT CTCCAATTCA GGCGGAAAAC TAAAAGAAAC 4920 

GCGTTTTTGA CCATTGGCTA GAGTAAAGTT TTGATTATTA AACGTACTAT TTTTTAACAA 4980 

ATTAACAACA TTCGTTAATT CTTCTCCAGT ATAAACTTTA TTCCCTTCTT TTTTAGCAAC 5040 

TCCTTCTTCG GGTTTAAACA GTTCATAGTT ACTGTGAGAA TGACCAATTC CAACCGGTTT 5100 

ATGTTCATCA ATCGGATCTG CATGATGGTG ATCTCCATGC GGATAAATAA TCGCATTTTT 5160 

TTCTTTATTC ACGACAATAC TTTCACGTTT GACACCATAT TGTTTCATAA TGCCAGCAAT 5220 

TTTTTCTTCG ATTTTTTTAT CTAAATCTTT CATTTCTTTG GCATTACTTG GATAATCCTG 5280 

TTCATGAGAT GACAAAGAAT CTAATCCATT ATGACTAGTT TTAACTTCCT CTAAATGTTT 534 0 

TTGCGCAsCT TAATTTGCTC TTCTGTCAAG TCCTTCTTGA AGAAATAATG ATTGTGGTCT 5400 

CCGTGACTCA TGACAAAACC TGATTCATCT TCAGCGATAA TACGATTAGC ATCAAATCCG 5460 

TATCCATCTT CTTCATGTTT CTCATGTGAA GTTCCTGGAT TGATTGGAAG AGATGGAGAA 5520 

GGTGTTGCTA GACTATTGTT TGGAAGAGTC GGTTGCCCAA TTTGATTTGA TTTTGGAATG 5580 

TAATGGAAAT GATCACCATG TCTTACAATA TAAGCTGTAG CCGTTTCTTC AACGATATCT 5640 

TTT-GGA'ri'AA AAATATAACC ATCAGATGCT GAAGAGAGCT CCTTACTTGT CGTTAAAGAA 5700 

GAAGGATTGC TTGAAAGACT GCCTAGACTA GACACTACTT CATTAGGTTT TGCATTTGTA 57 60 

GAAACTGTAG AACCAGTTCC ACTGATAGGC ACCATTCTGG CAATCTTTTC TTCTAAGGCA 5820 

GAAAGCTTGC TGTAAGGAAT AAAGTGGTAA TGGTCGCCAT GCGGAATCGC AACTCCATTT 5880 

GGTGTACGAC TGATAATCTT AGCAGGGTCA AAGACCAGGC CATCTGATTC ACTGTAACGT 5940 

TGGGCGCTAG GTGAATCATA GAGTTCCTTC AAAAGACTCT GGAGATTTTC AGATTTATTT 6000 

GCTGGCTTGC TAGTTGATCC TTTTGCTACA GATTGCGTGT TATTGTCACT AGCTGTTGAA 6060 

GAATAGCTTA ACTGACTCGG TTGCATATTT TTTCCAGCCA GATGTGCTTT AGCTGCTGCT 6120 

AATTCACTAG CAGATAAATC GCTTTTGGGA ATGTAGTGAT AGTGACCTCC ATGAGGAACG 6180 

ATATAAGCAT TACCCGTATC TTCGATAATA TCAGCTGGAT TAAAGACATA ACCATCATTT 6240 

GTCGTATATC GTCCCTGAGA CCTTGCTACA GC7UVCATTAG AGTTAACCTT CTCATTATCT 6300 

TTGACATGTT CTTGTTTTTG ACGATTGATT TCATCTTTAG TTCGAACATT ATCAGCATGA 6360 

GCTGCATCTT TCAGGTAGAC ATAATATTTT CCATCGACCT TGATGATATA ACCACCCTTG 6420 

ACTTCATTGA CAATATCAGC GTCTTTAAGT TGATAGTTTG GATCCTTCAT CAAGAGTTCT 64 80 
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TCACTAAAGA GGGCATCATA AGGAACTTTC CCATTATAGT AATGATAGTG GTCACCGTGT 6540 

GACGTTACAT AGCCCTGATC TGTAATTTTG ATTACAATTT GCTCAGCCTG AATTCCTTCT 6600 

TTCTGGCTAA CCTGGTCTGG TGTCAAGTTT TCACTTTTCT GACTTGACTG GCTGCCATCC 6660 

ACATAAGAGA CACGATTATT GTCCTTATTT TCCTGCGAAC GATGCTGGTT TAGTGCATAG 6720 

GCACATAGAC TCAAGGATAC GATAACAGCT GATCCAGCTG CTATATATTT TTTACTAAAT 6780 

TTCATAAATC CCTCATTTCA ATAAATGATG AAGTTTTTTC TCAACTTCTT TTACTTTATT 6840 

AAATAGTTTT CTAAACCCGG GGGTACC 6867 



(2) INFORMATION FOR SEQ ID NO: 193: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 999 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193: 



CGTTCTAAAA 


ATGCAGTACG 


TTTGATTGAG 


AAATCAGTTA 


AAGGTATGCT 


TCCACACAAT 


60 


ACACTTGGAC 


GCGCTCAAGG 


TATGAAGTTG 


AAAGTATTTG 


TTGGAGCTGA 


GCACACTCAC 


120 


GCTGCACAAC 


AACCAGAAGT 


TCTTGACATT 


TCAGGACTTA 


TCTAAGGAAA 




180 


AGTATGTCAC 


AAGCACAATA 


TGCAGGTACT 


GGACGTCGTA 


AAAACGCTGT 


TGCACGCGTT 


240 


CGCCTTGTTC 


CAGGAACTGG 


TAAAATCACT 


GTTAACAAAA 


AAGATGTTGA 


AGAGTACATC 


300 


CCACACGCTG 


ACCTTCGTCT 


TGTCATCAAC 


CAACCATTCG 


CAGTTACTTC 


AACTGTAGGT 


360 


TCATACGACG 


TTTTCGTTAA 


CGTTATAGGT 


GGTGGATACG 


CTGGTCAATC 


AGGAGCTATC 


420 


CGTCACGGTA 


TCGCTCGTGC 


CCTTCTTCAA 


GTAGACCCAG 


ACTTCCGCGA 


TTCATTGAAA 


480 


CGCGCAGGAC 


TTCTTACACG 


TGACTCACGT 


AAAGTTGAAC 


GTAAGAAACC 


AGGTCTTAAG 


540 


AAAGCTCGTA 


AAGCATCACA 


ATTTAGTAAA 


CGTTAATTCG 


AAAGAATTAC 


TATACTTATA 


600 


CAGAGCACCT 


TTCGGGGTGT 


TCTTTTTTTA 


TACTTTCTTA 


CTAAATTGGT 


GCAATTGACA 


660 


CAGTTGTTGC 


GACTTTAGTC 


GCTTACAAAT 


GTGGCTGCAA 


CCTGACATGG 


TCAGTTGCCl' 


720 


CAAAACGTTA 


ATCAATACGA 


TTATATCAAC 


GTTTCAAAGC 


ACTCAAGGGT 


TTACCCTATG 


780 


GGTGCTTTTT 


TCTATACTTT 


CTAAAAAAGT 


TTACCCTAAA 


ATTTGCCCTA 


AAATTACCCT 


840 


ACTTATTTTT 


AAGATGTTGG 


TAGGCAACTT 


GTCCAGCAGA 


TAATGGAACT 


ATGTTTGAAG 


900 


TATTAACATA 


AGTCTTAGTT 


GTAACGGTAT 


CGCTATGAGT 


TAATGCTTCA 


GAAATGGCTT 


960 
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CTAAGCTCAT TCCTGCTTTT TTAGCAAGTG TCGCTCCTG 999 
(2) INFORMATION FOR SEQ ID NO: 194: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2315 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 

AATATTATCA CTGTTCTTGA AGGCAGAACA CAAGCTGTCA TCCGAAATCA CTTTCTTCGC 60 

TACGATAGAG CCGTTCGTTG TCAAGTGAAA ATCATTACGA TGGATATGTT TAGTCCTTAC 120 

TATGACTTGG CTAAACAGCT TTTTCCGTGT GCTAAAATCG TTCTAGATCG TTTCCATATT 180 

ATCCAACATC TCAGCCGTGC CATGAGTCGT TTTCGTGTTC AAATTATGAA TCAGTTTGAA 240 

CGAAAATCTC ATGAATACAA GGCTATCAAG CGTTACTGGA AACTCATCCA ACAGGATAGT 300 

CGTAAACTCA GCGATAAACG TTTTTATCGC CCTACTTTTC GCATGCACTT AACAAATAAA 360 

GAAATTCTTG ACAAGATTTT AAGCTATTCA GAAGACTTGA AACACCACTA TCAGATCTAT 420 

CAACTCTTAC TTTTTCACTT TCAGAACAAA GACCCTGAGA AATTTTTCGG ACTCATTGAG 480 

GACAATCTGA AGCAGGTTCA TCCTCTTTTT CAGACTGTCT TTAAAACCTT TCTAAAGAAC 540 

AAAGAGAAAA TCCiTCAACGC CCTTCAACTA CCCTATTCAA ACGCCAAATT GGAAGCGACC 600 

AATAATCTCA TCAAACTTAT CAAACGCAAT GCCTTTGGTT TTCGAAACTT TGAAAACTTC 660 

AAAAAACGGA TTTTTATCGC TCTGAACATC AAAAAAGAAA GGACGAAATT TGTCCTTTCT 720 

CAAGCTTAGC TTTTCTTCAA CCCACTACAG TTGACAAAGA GCCTATTTTC GCTGATTCTC 780 

CACTACATTT GACTGGATTC TAATTTTTTA GAGAAATACA AAAGAGCTAG CTTTAGCTAG 840 

CTCTTTTCCT ATGCGGAGAG AGGGACTTGA ACCCTCACGA CCTAAAGCGG TCACAGGATC 900 

CTTAGTCCTG CGCGTCTGCC AATTCCGCCA TCCCCGCGTC GATTACTTTA CTAGTATATC 960 

AACTTTTGGG ATGCTTGTCA ACACTTTTTT TCAAATTTTT TCATTTTCAC CAACCAGGTT 1020 

ACTCAAAAAG TTCATTTAGA TTTTCATCTA CTAACTTAGC TCCGAGTGTA TTTTTGAAAT 1080 

GACCTAGGGC AAATTGATGA TTTTCAGGCC AGATGGAAGC AACAGCTGGT TTAACAATCT 1140 

CGATGTCATA TCCTAGATTA TAGGCATCTA TAGCTGTATG TAGGACACAG ATATCCGTCA 1200 

AGACACCTGT TAAGATAACG GTAGACACTC TACGCTCTCT CAAACGAATA TCTAGGTCAG 1260 

TCCCTGAAAA AGCTGAGTAA TGGCGTTTAT CCATCCAAAA GACACGACTG TCTGAACCAT 1320 

GCTCTTGATA AAAGATCCCC AAATCTCCAT ATAAATTCCG TCCACTCGTC CCAATCAGAT 1380 




-.4 



WO 98/18931 



PCT/US97/19588 



1163 



TATGAGGAGG AAATAACTTA CTTTCCGGAT GGAAACAATC GTTTTCTTCA TGAGCATCAA 
TAGTAAAGAA GATATAATCT CCTCGTTCAA AAGCTAATCG AGTTACCTTG CTGATGGCAT 
CCGAAATCGC CTGAGCTGGA GCACCTGCTG TTAGTTTCCC ACTATCAGCA ACAAAATCTT 
CTGTATAATC AATCGAAATT AAAGCCTTTG TCATTAGTAA TCTCTTTTCT TCACTTCTTC 
AAAAATATCT GAAATCAAGA CCTTAAGATA GGTTCCCTTC ATTCCAAGTG AGCGACTTTC 
AATAATCCCC GCAGACTCAA GTTTACGAAG AGCATTGACA ATCACAGAGC GAGTGATTCC 
GATACGATCT GCAATCACTG ACGCAGTCAA CTTCCCTTCA TTTCCATTTA ATTCCCCTAA 
AATTGCTGAA ACAGCACGGA GTTCGGAGTA AGAAAGGGTA TTGACCGCCA TGGTGACAGC 
AGTACGACGA CGAATATTTT TCTCATCTTC TTCACGTTGG AAGTTAAGAA GCTGAATCCC 
AACAACGGTA CTGGCAATCT CAACAAGAAC CAAGTCCTCA TCTTCGAATT TTTTATCATT 
ACGCCAAATA ATCAAAGAAC CAAGGCGAAT CCCCGATACA TGAATCGGTG CAATAGTCGT 
CAAGCCATCT GGAAAATCAT CTCTACTCTC AATAGGGAAA ATACTCATAT CATGCTCAAC 
AGGCAAGTTT GCTTCTGTTT CGTAAATCAT ATTAGCCCCT TGAACGTAGT CATCTGGGAA 
AATCTTAGTT TGGAAGAATT GCTtACGCGA TCTGTATTTG TTTTATAACG CATAAAATAG 
CCAAGCAGAC GTCCCTTACT ATTGATAATG CAGGCATTGC AATGAATAAT ATCCGCTAAC 
TGACGCGTAA TAGCGTTGTA AGGGAGCTCA TCTCG 
(2) INFORMATION FOR SEQ ID NO: 195: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6693 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195: 
CGATTTCTTC CATTTCTTCA AATAAGAATA CTTCATCTGA CATATGTGTT ACCTTCTTCA 
TCAAAAATTA TTTTGTAATC GATTACATTG CAGATCGTAA CATAAAGAAA AACAGATGTC 
AAATATTAAA CGTAAAAACA TGGTCACTAA AGAACTATAA GAGAAAAGGT AAACCTAGCG 
ACGCGATGAA CGCTGGGTCG TTTGGTTTCG ATTGCTCTCT TCCTCTTGTT TTTTCTGTTC 
TTCTTCTTGT TTTTTCTCAG CTTCCTTGGC CTCTTGTTTG GCTTTTTCCT CAGCTTCCAT 
AATTAATTTA TCCGCCACAG TGTAGCTGTA GATTCCAGCT TCCATGTCGA CCACACTCGG 
TTCTGACAAT TGAGGCTTAA TCTTACTGTA ATATGGCAGT TTCTTACTCA TTTCAGATAG 



1440 
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AGGAACCAAG ACTTCGTCCG AATCATTCAT GGTCAATCGA ATTAAATCGG ATGTCACCTT 480 

GCTTGGGGCT AATTCCACCT TTTGGATAGC CGCCTTGAGT TCTGGGCTAA TTTGAGCAAG 540 

TTCTGAGACA AAAACTTTGA TTTGTTCACT ATCATTAAAG AGAACTGATA AATAAGTTTC 600 

TGGTAAACTG TTCAGACTCA CAGAACTAGT CTCAAGCTGA CCACTGGAAA GAATAGGATA 660 

ATGATTTTCA CCAGAAATAT AGTAGGCCAC AATATCATAT TCCTTGACCT TAATAGTGAA 720 

CTTAGTTGGA AATTGATAGA CAAGTTGAGC TGATTCAACC CAATAGTTAG ACTTAATCTG 780 

CTTTTCATAT TTTGCCTTGT CTAGCAGAAG GTTAATCGTA TAATCCGAAT CCTGAATGCC 840 

TGAAGCCTGT CGAATATCAT CAGCTGTAGT TTGCACCGTT CCCTCAACAC GAATATCTTT 900 

CATGGTCGCA TAAGGACTGA GCAAGTAGGC AGAGACAAAC AATAGAAGCA GACTTGGAAA 960 

TAAAATCGTG AAGGCTCGCA AGATATGGAT ACCAGGAATC TTTGCTTTGG CTGGTTTTTC 1020 

CTTTGTAGCC TTTTTAGCAA GCTTTTTATC CTGTTCCTCC TTCTCTTTAG ACTCTGGTTC 1080 

TTCTTTCTCT TCTTTCTCTT TGTCAGCCTC TGAGGATGCT ACTTTTTCTT CAGACTCTTC 1140 

CTTAGCTGAT TCTGAATCTT CCTGGTCTGT TTCACTCTCC TGGTCCTGTT TATCCTCTGA 1200 

CTTCTCAGAT TCTTCTCCCA TTCGAGCTTG TCTTTCCTTT TCCTTCTCCT CAGCTAGAGC 1260 

CGCCTCTTCT TCAGCCTTCT TTTTTAGATA TTCTTGGTTT CGTTTCTGCC ATTCTGATAA 1320 

CTCTTTCAAT TCTTCGAGGG TTTCTTTGTC CTCATTTTTC TTATCTTTTG ACATTTACTT 1380 

TCCtI-ATGAT AAATCTTTTT TCAACAATTG ATAAAAATCT GCTAGAGATT TCAATTCCTT 1440 

AGAAGCTTTC ATCTTAGCTT GGTAATCTTC CTTGTGACTT AGTAAGTGAG AAAGCTTCTC 1500 

TTCCAAACTA TCCAAGGTCA AATCGCTTTC TTGAAGGTCT TCTGCATAGC CTTTCTTAAC 1560 

AAAGTAAGCT GCATTTTCAA TCTGGTCACC ACGACTAGCT TCACGACCAA GCGGCACAAT 1620 

GACATGCAAT TTTGCTATCG CCAAGAGCTC AAAAATCGTA TTGGCACCAC CTCGTGTCAC 1680 

AACAATATCA GCCAATTCCA TCAAGGGTTG ATAGAGATCG GTCACATAGT CAACACGAAA 1740 

AAGATTTTGC CTCAACTCAT TCAGACTAGA ATCTCCAGTT AGATTGATAA TATTGTAGCG 1800 

CTCTGTTAGT TCTTTCTTAT GGTCTGTCAC CAATTGGTTA AAGACACGAG CGCCTGCAGA 1860 

ACCGCCAACA AACAATACAG TTGGCAATTT GGGATTAAAG TGGGTTTGAA TATCCACCAA 1920 

TTCATCTGGT TCTGGAGTGT TTTTGTCCGA AACCTTGGTC ACCGCTCCCA CATGCTCAAC 1980 

CTTAGCCAAA CTCGAAGCTT GTTCAAAGGT TGAATACATC TTAGTCGCAA ATTTATAGGC 2040 

GATTTTATTG GCCAAGCCCA TAGACAGGTC AGATTCGTGA ATAAAGACAG GCACTCCTGA 2100 

CACACGCGCA GCGATAACAG GCGGTACTGA GACAAAGCCC CCCTTTGAAA AAAGGGTCTG 2160 

TGGACGCAGT CGCAACATGA TAAAGAGCGA TTGGACAATT CCCCAACCAA CTTTGAAGAC 2220 
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GTCCAGCATA TTTTGCCAAG AGAAATAGCG ACGCAATTTT CCAGTCGCAA TAGAATGGAA 2280 

GGTGACATCC AAACCTGACT TAAGGATTTC TTGGTGTTCG ATACCACACT TGTCCCCGAT 2340 

ATAGTGGACT TCCCAACCAT CTTCGATGAA CTTGGGCATT AACAAAAGAT TGAGGGTAAC 2400 

GTGTCCAACC GTCCCCCCAC CTGTAAAGAC AATTTTTTTC ATATTATTCT TTTAACTCCG 24 60 

CTACTGTGTC GATAAAGAGG TCGCCACGTA CTTCAAAGTT AGCATACATA TCCCAGCTAG 2520 

CATTGGCAGG ACTAAGAAGA ACCACATCTC CTTGAGTCGC AAGCTCATAG GCCTTGCGGG 2580 

TCGCATCTGC AATATCTGTC GCCTCCACAT AAGCGACACC AGCCTTGTCT GCTGCCCGTT 2640 

TGACACGTTC TGCAGATTGA CCCAGGATGA CCATCTTCTT GAGTCCAGTA ATGTCTGGCA 2700 

CCAATTCGTC AAACTCATTG CCACGGTCCA AACCACCTGC AATCAAGACG ACCTTGCTGT 2760 

TGTCAAATCC TGACAAGGCT TTTTGAGTAG CCAAGATATT AGTTGATTTA CTGTCGTTAT 2 820 

AGAATTTAAC ACCCTTGATG TCATCCACAA ACTGGAGACG GTGTTTGACA CCACCGAAGG 2 880 

CTGAAAGAGT TTCCTTGATG GTTTGATTGT CCACATCACG AAGCTTGGCT ACAGCAATAG 2940 

TCGCAAGGGC ATTTTCCACA TTGTGGCTAC CTGGAACACC GATTTCATTC GCTGCCATGA 3000 

CTACTTCACC ACGGAAGTAG AGTTGACCAT CTTCCAGATA AGCTCCATCA ACCTTTTCAA 3060 

GTGTTGAAAA TGGTACAACA GTGGCTTCTG TCTTGGAAGT CAAGTCTTTT GCCAAGTCTT 3120 

GATTAAAGTT CAAGACAAGG AAATCAGCTG CTGTCATCTT GTTCTGGATA TTCCACTTGG 3180 

CTGCTACATA TTCCGAAAAT GACCCATGGT AGTCGATATG AGTTGGCATG AGGTTGGTAA 3240 

TAACCGCAAT CTCTGGATGG AATTCTTGAA CACCCATGAG TTGGAAAGAA GAAAGTTCCA 3300 

TAACAAGCGT GTCCTTATCT GATGCTATTT GAGCAACCTG ACTAGCTGGA TAGCCGATAT 3360 

TCCCTGATAA AAGACCATGT TGGCCAGCAG CAGTCAAAAC TTCCCCAATC ATAGTCGTTG 3420 

TGGTTGTCTT ACCGTTCGAT CCTGTGATAC CAATAATCGG TGCTTCTGAA ATCAAATAAG 3480 

CCAATTCCAC CTCAGTCAAG ACTGGAATTC CCTTGGCCAA AGCCTTTTCA ATCATGGGAT 354 0 

TGTTGTAGGG GATACCTGGA TTTTTCACCA TAAGGGCAAA CTCTTCATCC AAGAGTTCCA 3600 

AAGGATGGCC ACCTGTAATG ACCTTGATCC CTTCTTCCAG CAAACTTTGG GCAGCTGGAT 3660 

TGTCCTCGAA AGGTTTCCCA TCATTTACTG TCACAATGGC ACCTAGCTTG TCCAACAAAC 3720 

GAGCTGCAGA TTCACCAGAC TTGGCCAAAC CTAAAACAAG GACTTTCTTA TTTTTAAATT 3780 

GATCTATTAC TTTCATGTCT CGAACTCCAT TTCTACTCCT ACTATTTTAC CATTTTTATG 3840 

GAAATAAAAA AGCCACAAAG TGTGTTTGTG ACTCTTTCTT CTAACTGAAT CTTACCATAT 3900 

CATCTATGTG ATAAATCGGT AACTCGAATG ACCTGATCCA CTTGCTCCCA AATCAGAGGA 3960 
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TTATGGGTCG CAATAATAAT GGTCCGATTC GGATTTTTTA AAGATTCTAG GATGGAAAGT 4020 

AATTCCTCAG AGTTTTTGGG GTCTAAGGAA GCGGTTGGTT CATCTGCGAG GATCAAAGGT 40B0 

GGATCCTTTA AAATTATCTT CGCTAGTGCA ACACGTTGTG CTTCTCCTCC TGATAACTCA 4140 

AATATAGGTT GCTTCAAATC CAAATAAGAG AGGTTTACAC GGTTTAGAGC TTGTTTCATC 4200 

AAAGAGATTT TCTCTTTTTC CTTCAACTTT TTACCAACTA AACCCAGATT GAGATTCTCT 42 60 

TTGACGGTTT GGCTTTCAAT TAAGCCAAAA TCTTGAAATA AGTATCCTAA GTAATCTCTA 4 320 

AAGAAAACAG AAGGCTTGAT GTCCTTAAGA GAAGTGCCAT CATAGATGAT TTGCCCTTTG 43 80 

TCATATGGCT CCAATCGTCC AATCATATTC AAGAGTGTTG TCTTACCACA GCCACTTGTA 4440 

CCGATTAAGG CATAAATTTT CCCACCTTCA AAATGAAGAT TCATATCTGA AAATAGCTGA 4500 

CGGCTTCCAA ATTTTTTAGA TATATTCTTT AGTTCAATCA TCCTATTTTC CTTTCATAAT 4560 

TGTCATAGAA ACACGAGATT CTTTCTGCGC TTGACGGTAA AGCGTCAAAA CTGCACTAGC 4 620 

TAGAAAGACC AATAAAGTGA GCAAGCCAAT CACCAAGTCT CGACTGCTTA AAATAAAGAG 4680 

ACTAGCACCA AATACAAAAC TAGCAAATTG GCTAACCATA TACTGAGCAT GTGTTTCAAA 474 0 

AAATCGTAAA CCTGAAATTC GTTTAATCAA GATATCTCGG CGGAATTGCT CGAAATATAG 4800 

AAGATTGACA GAATAAAAGA GTAACAAGGA ACTGGCTATT CCAACAATAG CTCCTAAGAT 4860 

TAAAGTTGCT GTTTCAGTTT GAACTTCATT ATAACGAGTT AGATAAACAC TTCTTCCTTC 4 920 

TTTAAGATAG GATACTTGC-T CataaATTCC AGCTTTCTTC AAGAGTTCTA GCCCACTCTC 4 980 

ATATCCTTTG ATAAAGAGTT GTTTTCCAGC ATTGATAGAC CAACTAGATA AGGATATAAA 504 0 

ACTATCACCT GTAGAAGTCG GCGTGAATAC CACTAAAATC GGATCAGTCA AATACTGAGT 5100 

AGATACGGGA TTCTCACCGT TATTATAAAC AAACCGCTTT TCTCCCATTG AAAGATAACT 5160 

AACGTGCGCT TTCATCTCAT AATCCAAAGG AGCACTTGCC TCCTCACCAG ATTTTCCATA 5220 

ATAACTCAAT CTTTCTTCAA AAACTTTCTT AAGTTCTGCT TCTCGAGAGC GCAAATGTTC 5280 

TGGGAGCAAG AGGATAAACT CACCTTTTTG GAGATGGGCT AACTTCTGTT TGGTCTCAGC 5340 

ATCTACCACG ACCTTTTCCT TGTCCAAATA ACTGGGACTA ACATAGAGCG TATTAGCATC 5400 

TGAACTATAG GTATCCAGTG TCTCTCCCTG TTCATTTTTT CCTTGTGGAT TGGCAAAATG 5460 

GAGCAGATTA TCCTTTACAT AAAGAGCTTG TTCTTCTTCG ATTGCTTCCT TGGCAAAGGC 5520 

ATACCACTTG CTCTGATTTT CTGTATCTTT TCCTCTATCA CCTAAGCCAA AGGAAATCTG 5580 

GTAATAGTCT GCTCTGTCCT GCCATGCTTG TTTTGAAATT TCAAGTTCTT TCAATCGTTG 5640 

GTAAGACGTC AAACCTGTCT TAACAGCGTA GCCTACTGTA AAAACAGCTA CTAACTGACA 5700 

CAATAGGGTT AAAGCCATCA AGCGTTTAAG GGGTAATCTT CCCTTAATAA CGGGAACTAA 5760 
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TGCTTTGTAA 


CTCAAACTCA 


TTAGGTAAAG 


GAGCATTAGT 


AAAATTGAAA 


TCGCCAATAA 


5820 


AAACAACAGA 


TAGAAACTAA 


TCCCAAAACC 


ATAGGTGGCT 


AACAAGATAG 


GATAAAACAA 


5880 


ACCTTGACTA 


AAAAGAACGA 


CTCCCCCACC 


TAGGAAGGAA 


AGGAGGGCTG 


ATAGAAGGAG 


5940 


CCATTTGATA 


TCAGTAGATA 


AAGAATGCCC 


CATGATGGAT 


AAGAGAGTCT 


GACCAGAAAA 


6000 


GAGTTTTATA 


CCTGCTGCTC 


TCATTTCCTT 


AATCCGAGTG 


ATAATCACTA 


AAGCAAAGAA 


6060 


AGATAAGCCA 


AATATTGCTA 


AACTAATTAA 


AATAAGGGGA 


TTTAGTAATA 


TTCGAAAAGC 


6120 


AAGAAAATAG 


GGCGGTATCT 


TTCGGTCAGC 


ACTTGCTTTA 


TAACCCAAAT 


CTCCTAATTT 


6180 


ATCGGCAAGC 


TTTTCTTTCG 


TCAAGGAGCC 


TGACAAAAGG 


AGATAACTAT 


TTAGCGGAnT 


6240 


AtACGTTCAC 


GACTTTCTTG 


GCTAGCTTCT 


TGGAATTCTT 


TTGGTAAAGT 


TCCCTGACCA 


6300 


TAAGTTGCAT 


AAGTAAAGTG 


AGTCGTCCCA 


TCCTTACTCG 


GCTCTACAAT 


TCTTCTAGCT 


6360 


ATTAAACTCT 


GTTCTGAGTT 


TGCAAAATTC 


TCCAATTCCT 


GTTCAAATAC 


CTCACGCGTC 


6420 


GGTTCCTGAG 


TATCTTTTTT 


GACACGAAGT 


AAAGAAACGG 


AATCATAGCT 


TGCATATAAA 


6480 


TATTGTGGCG 


CACGTAAGAC 


AATAATCCAA 


GCAAGGAAGA 


AGCTGAGAAA 


AAAAGTTGAT 


6540 


AATAATATGA 


ATAGTTTCTT 


CATAGTAGAC 


TCCTTGTAAA 


CAAAATTCCC 


CCTGTAATTT 


6600 


CTTACAAGGG 


GAACGATTTA 


AATCAATGAA 


CGATTAGTCA 


TAATCACAGT 


AAAATGCTAC 


6660 


TTGTTCTCCC 


CATTTAGTCC 


AAATCCATGC 


AGG 






6693 



(2) INFORMATION FOR SEQ ID NO: 196: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1847 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196: 

CCGGTCTATG TACCCACTAC TTTGGGACAA TATGGGGATC AGCTACCCAA AACTAATCGA 60 

GCGTTTGGTT GACCTTGCCA AGGAAAGTTT TGACAAGCGC GACGATTTGA TATAAAATGA 120 

AAGAGAGGGT AGAAGCCAGA ACCATCACTG CACGGTGACT AGAGTTCTCG GACTTCAGCC 180 

CTTTTTAAAG GAGTAGAAAT GAAATTAACA ATCCATGAAA TTGCCCAAGT TGTTGGAGCC 240 

AAAAATGATA TCAGTATCTT TGAGGACACC CAGTTAGAAA AAGCTGAGTT TGATAGTCGT 300 

TTGATTGGAA CTGGAGATTT ATTTGTGCCA CTTAAAGGTG CGCGTGATGG CCATGACTTT 360 

ATTGAAACAG CCTTTGAAAA TGGTGCAGCA GTAACCTTGT CTGAGAAAGA GGTCTCAAAT 420 
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CATCCTTACA TTCTAGTAGA TGATGTTTTG ACAGCCTTTC AATCCTTAGC ATCCTACTAT 480 

CTTGAAAAAA CGACTGTTGA TGTCTTTGCT GTTACAGGTT CAAATGGCAA GACAACGACT 54 0 

AAGGATATGT TGGCGCATTT ACTGTCAACA AGATACAAGA CCTACAAAAC ACAAGGCAAT 600 

TACAATAATG AGATTGGCCT TCCTTACACA GTTCTTCATA TGCCTGAAGG AACAGAAAAG 660 

TTGGTTTTGG AGATGGGACA GGATCACTTG GGCGATATTC ATCTCTTGTC TGAATTGGCT 720 

CGTCCAAAAA CAGCCATCGT GACCTTGGTT GGAGAAGCCC ATTTGGCCTT TTTCAAAGAC 780 

CGTTCAGAGA TTGCTAAGGG AAAAATGCAA ATTGCAGACG GAATGGCTTC AGGTTCCTTG 840 

CTTTTAGCGC CGGCTGACCC TATCGTAGAG GACTATTTGC CAACTGATAA AAAGGTGGTT 900 

CGTTTTGGGC AAGGGGCAGA GCTGGAAATT ACTGACTTGG TTGAGCGCAA AGATAGTCTG 960 

ACCTTCAAGG CCAATTTCTT AGAGCAAGCC CTTGATTTGC CAGTAACTGG CAAGTACAAT 1020 

GCGACAAATG CTATGATTGC ATCCTATGTT GCCTTGCAAG AAGGAGTTTC AGAGGAGCAA 1080 

ATTCGTTTGG CCTTCCAAGA TCTTGAATTG ACGCGTAACC GTACCGAGTG GAAGAAAGCA 1140 

GCCAATGGAG CAGATATCCT GTCAGATGTT TACAATGCCA ATCCAACTGC TATGAAACTG 1200 

ATTTTAGAGA CTTTCTCTGC CATTCCAGCC AATGAAGGTG GCAAGAAAAT TGCAGTGTTG 12 60 

GCGGATATGA AGGAGCTTGG TGACCAGTCT GTTCAACTTC ATAATCAGAT GATTTTGAGC 1320 

CTTTCTCCAG ATGTGCTTGA TACCGTGATT TTCTATGGAG AAAATATTGC TGAATTAGCC 1380 

CAA-i-ruuCCA GTCAAATGTT CCCAATCGGC CACGTTTACT ACTTCAAGAA AACAGAAGAC 1440 

CAGGATCAAT TTGAAGACCT AGTCAAGCAG GTCAAGGAAA GCCTTGGAGC CCATGACCAA 1500 

ATCCTGCTCA AAGGCTCTAA CTCTATGAAT CTAGCCAAGT TGGTAGAAAG TTTAGAAAAT 1560 

GAAGACAAGT GATTTTGTCA AGTATTTGCA AAGAATGATT GCCATTACAG ATACTGGCTT 1620 

AACCTTTACA AAAGATCCGT TTGACCGTGA GCGCTACGAA GACTTGCGAA GTCTGTTATC 1680 

TGAAATGTTG AATCAAGCAT CAGACCTTGA TTCCGAAGAA GTGGCAGAAG TCTTGAAGCC 1740 

AACTTCTGCT TATGCGACTC CGTTAATGGA CGTCCGTGCT TGGATTGTTG AGGATGAGAA 1800 

GATTTGTCTG GTTAGGGGAC AAGGAGAGGA TAGTTGGGCT TTGCCGG 1847 
(2) INFORMATION FOR SEQ ID NO: 197: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1062 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197: 
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CAAGCGAAAA 


CATTTTTTAT 


TCCAAATAAA 


CAGAGCATTT 


TAGGAGAACA 


AGAGATTTTG 


60 


AATGCCAAGT 


CGATCTTGGC 


CTTGCTAGAC 


GGTTTGGAGT 


CACATAGCTA 


TGATGTAGTC 


120 


TATCTCCGTC 


AGCCTCTTAA 


TCGTCTCGAA 


TATATCGAGT 


GTGCGATAGT 


GGGGCAATCA 


180 


CAATTTCTCT 


TTAAGGTCAG 


TTATGCTGAT 


GGTCAAAAGG 


CTTACCGTGT 


CGATCTTCCT 


240 


GACCTACTAA 


CAAAGACAGA 


CTGGCAGATT 


ATCAAGTCAT 


TTTTAGATGC 


TTTGCTTGCT 


300 


TATACAGGGA 


CTGATATTGA 


AGGGCTAGAT 


GGTTTTGATT 


TTG/^GCTTA 


TTTCCAAGCA 


360 


AGTATTCAAG 


CCTATCTAGC 


AGACCCTGTA 


GCTCGTTTTA 


CGATTTGCCA 


AGGAATTTTT 


420 


AATCCTATTT 


TCTTTAGTCG 


TGAGAACTTG 


AAAAGCTTTT 


TAGAGGCAGA 


TGGCTTGGCT 


480 


CAGTTTGAAG 


CGCGTGTGCG 


TGCGGTTCAA 


GAGACAGATG 


CCTACTTTGC 


GAGAGTTTCC 


540 


TTCTATCAGG 


ATGGAGAAGG 


AAAAGTGCAT 


GGCGTTTACC 


ATCTAGCTCA 


AGGAGTCAAG 


600 


ACAGTTTTAC 


CGAGAGAACC 


GTTTGTTCCT 


GCAGCCTATA 


TTGAGCAATT 


GGTGGATAAG 


660 


GAAGTCCAGT 


GGGAGATTGA 


CTTGGTTCAA 


ATCACAGGAG 


ATGGCTCTAA 


ACCAGAAGAC 


720 


TATGAAGCCA 


TTGCTCGCTT 


GGACTATGCA 


AAATTCTTAG 


AGGTATTACC 


CCCATCTTTT 


780 


TACCACCAAC 


TAGACGCCAA 


TCAAATAGAA 


GTGCAACCCA 


TATTAGACAA 


AGATTTTAAA 


840 


ACATTAGCAC 


AAGAAAAGTA 


AAGCAGAAGC 


AGGTCAATCG 


ACTTGCTTTT 


TTGACATAGA 


900 


AAAAATCCTG 


CCAAGaTGAC 


AGGATTGCTA 


CTCAATGAAA 


ATCAAAGAGC 


AAACTAGGAA 


96Q 


GCTAGCCGCA 


GCTGTACTTG 


AGTACGGTAA 


GGCGAAGCTG 


ACGTGGTTTG 


AATTTGATTT 


1020 


TTGAAGAGTA 


TGAAGTTTAA 


AGAAAAGCCA 


AGATACGAAG 


AT 




1062 



(2) INFORMATION FOR SEQ ID NO: 198: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6846 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 

TATCTACAAC CTCAAAAACA TGTTTTGawG gCTCGTCAGT cTATCTACAA CCTCAAAAAC 60 

ATGTTTTgAa kGCtcGTCAG tTCTATCTAC AACCTCAAAA ACATGTTTTG AcaGCcTcGT 120 

CAGTTCTATC TACAACCTCA AAAACATGTT TTGAGCTGAC TTCGTTAGTT TCATCTACAA 180 

CCTCAAAAAC ATGTTTTGAG CTGACTTCGT TAGTTTCATC TACAACCTCA AAAACATGTT 240 

TTGangnCnT CGTCAGTTCT ATCTGCAACC TCAAAGCAGT GCTTTgagcG CTTCGTCAGT 300 
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TCTATCTACA ACCTCAAAAC AGTGTGTTGC GCAGCCTTTA ATCAGCCGCC TAGTCCGCTC 360 

TATGGTATTC ATTAAGTCAA CATCTCTTGT TTAAGAGCAC CAAATCAGGA AATCTTCTCG 420 

ATTCCCTGAT TTTTTCTATT TACGTTTTCG TGTTGAGCTA CGTTCTGTCA AACCATGAGG 480 

TAAGAGAACT TCACGTTCTT CCAACTCTTC CTTATGCATA ATCTTGGTCA ACATACGCAT 540 

ACT/IATGGCA CCAAGGTCAT AAAGAGGTTG GGC7VATCGTT GTCAAGTTTG GACGGGTAAA 600 

GCGTGAGATT TGTGAATCAT CACTAGTAAT AATTTCAAAA TCTTCTGGCA CAGAAACACC 660 

CTTATCAGCC AAACCGTTCA AGACTCCTGC TGCCAACTCA TCACCTGTCA CAACTGCTGC 720 

AGTTGCATTT GATGAAATCA AACGCTCTGC TAAGGCGTAA CCATCATCAT AGCTATATTT 780 

AGATTCAAAT ACCAAACCCT CACTATAAGT GATTCCTGCT TTTTTCAAGG TTTCCTTGTA 840 

GCCAACTAAA CGAACCTTAC CATTGATGTC ATCCACTAGC GGACCGCTAA CGAAAGCAAT 900 

ACGCTCATTT TCTTTAGCAA GGTAACTCAC TGCATCAATT GTTGCTTGCT TATAGTCAAT 960 

ATTGACACTT GGCAACTGGT GCTCAACATC GACAGTTCCT GCGAGAACAA TCGGAGTACG 1020 

TGAACGCGAA AATTCTGAGC GAATTTTATC TGTCAAGTGA TACCCCATAT AGATAATGCC 1080 

ATCTACCTGC TTTGAAAAGA GGGTATTGAC AACAGAAACT TCTTTCTCGT TATCTTCATC 1140 

GCTATTAGCT AGGACAATAT TGTACTTGTA CATTTCTGCA ATATCATCAA TCCCCTTAGC 12 00 

CAAACTCGAA AAATAACCAT TGGTAATATT TGGAATCACG ACACCGACAG TGGTTGTCTT 12 60 

TTTACTTGCA AGACCACGCG CAACTGCATT TGGACGATAA TCCAAACGAT CAATTACCTC 1320 

TAGCACTTTT TTACGGGTAT TCTCTTTTAC ATTTTTATTG CCATTGACCA CACGGCTGAC 1380 

CGTCGCCATG GAAACACCTG CTTCACGAGC GACATCATAA ATGGTTACTG TATCATCTGC 1440 

ATTCATTCCT TTTCCTGTCC TTTCTATCTC ACACATTCTT TTACAAGTAG AGGTACTGAT 1500 

TGAAGCTCTA TATCTACTTA CAAAAGTGAA GATGTGAAAA TTTCGTTTTC ATATTTCTAC 1560 

TTATTCCATT CTATCACTAA TTGTAAACAC TTTCAAGTGT TTTTTGAAGA TTGATTGAAA 1620 

AAATTTCATA GAAAACCTAG GTTTAGCTCC TTGCTACCAC CTTAGACTAA ACAAAAAGGA 1680 

GGAAACTAAG CCCTCCTAAA GTTATAGTAA AATGAAATAA GAACAGGATA AATCGATCAG 1740 

GACAGTCAAA TCGATTTCTA ACAATGTTTT AGAAGTAGAG GTGTACTATT CTAGTTTCAA 1800 

TCTACTATAG GTATTGTTCC ATTCACTACC GTCAATTTTA GCACATAGTC TTCATGAAAA 1860 

TATTATATCA TCATAACCAA CCAGATTCTT TCGCGATATT AGCTGCCTCT GTTCGATTAC 1920 

CTGCATCTAG TTTCGAAAGA ATATTGGTGA CATAGTTTCG GACTGTTCCG TTGGATAGAT 1980 

AAAGTTTGTC TGCAATTTCT TGGTTAGAGA AGCCCTGAGC AATTCCCTTT AAAACTGCGA 2040 

TTTCTTGCTC CGTTAATGGA TTGGGATGCA TCATCACCAC TTCCATCAAT TCAGGCGAAT 2100 
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ACTCCTTGCG TCCTTCGAGG ACGGTGTGCA AGGTTTGCAT GAGGTCTGCA ATGTTTCTTT 2160 

CTTTTAATAC ATAAGCATCT ACTCCAGCCT TGACCGCACG TTCAAAATAC CCAGGACGCT 2220 

TGAAGGTCGT CACCACAACC ACCTTTGTTT CAAGCTTTTC TGCTCGTATC CACTCCAAGA 22 80 

CTTCAAGACC TGTCTTAACA GGCATTTCTA CGTCAAGGAT GGCGATATCT ACAGACTCCT 2340 

TTTCTAATAG TTGGATTGCT TCTTGCCCAT TCTTGGCTTG AAAGACAGAC TCTACATCCG 2400 

GTTGAAGCAT GAGCAACTGG CACATGGCAT CTCGCAACAT ACTTTGATCT TCTGCGACTA 2460 

ATACTTTCAT CTACTTTCTC TCCTTATAAA GTAGTCGAAC CTGCACTTCA GTTGGATGTT 2520 

TCTGACTGAT TACACTTACT TCTCCTGAAA ATGGAAAAAC ACGATTTCGG ACTGTATGGA 2580 

GCTCATCCCC GCTTATAGAG GCAAAGCCAC AGCCATCATC TCTCACTGTT AGAATGAGTT 2640 

CTTTCTCTGT CCGTTCTAAT TTCAAGTAGA CTTTAGACGC TTTAGCATGT TTGATGATAT 2700 

TGGTCACTAA TTCAAGCAAA ATCATGGAAG CCGTTGACTC CAATTCCTGA GTTAAGCTAG 2760 

ACTTGTCCAA GTGATTCTCA ACTTGAACCT CAATTCCAGC AATTTCTAAC ATCTTTTTCA 2820 

CAGTCTCTAG TTCGGATGTC AAAGTTCTAG ACTTAAGATT TTCCACAATG GTTCGCACTT 2880 

CATTCATGGA tCCTTGCTGA TCTGGTGAAT TTCTTTTAAT TCCTTTTCCA CCTGTGGATA 2940 

AGCCTCCATC TGAAATAACT GCAAGGCTAA ATCTGTCTTG ACACTCAGCA TAGCAAAGGT 3 000 

ATGTCCCAGA CTATCATGCA AATCCTGACC GATACGACTA CGTTCATTTT CAGCAAGCAA 3060 

TAGATTTATC TGAGCATTTT GCTTGACCTG AGCTTCTTTC AAATCCTCX3A CAATACGAAT 3120 

CCGAACCAAT CCAAAAGTCA TTAAATCGAC AAAAGTAAGA ATTACAAGTA GATAGAATAG 3180 

AAACTCAACT TCGATTCTCT GAAAAATCAA CAGTTGCCCC ACAACAAGGA CTTGAGCAAG 3240 

AAGAAAAGTC CAGACATGTA AAGACTTTAA ACTACGTACG CTGAAATGAT AACTTAAGAG 3 3 00 

ATTGGATAGG AAAAAGAAAA ACCAGATATA ATTAACAGCA ACAAAGGCAG TATTCCCAAC 33 60 

TACATAAGTC AGCATGAGGC CCCAATATAG CCAAGATAGG CGCTGGCTCT TAGTTGTTAA 3420 

AACACCCAAA TATGCCACTA CAAATAGAAT ATCAATCAAT AAATGCCAGG CAGAAAGCCA 34 80 

CCCAGTCACT ACAGACAGGA TGGGGAAAAT CATAAAAATT AAACTGATCC AAAACATATA 3540 

ATGTATTCTT TTCAGTCTTT CAAGCATTAA GCATTCTCCT TATGACCTTG AAGGTAAATG 3 600 

GTCAAACCAA ACAAAACTAC TGAAAAAACA AGTAAATAAA CTGTGGCTGA TAGATTGATG 3660 

CCACCCTCAT TTAAGAAGGT CTTGAGCAAC TCCATCAACT GATAGGTCGG GAGACACTTA 3720 

CCTACTACTT GCATCCAGTC TGGAAATAAA GAGATAGGCA TCCAGAGTCC ACCTAAAACA 3780 

GCCAACCCTA GATAAAGAAG ATTGCCCACG ACAGACATCA ACTGACTAGT TGGTAAGAGA 3840 
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GTCAAGGTCA AACCAAGCGC TACGAAGGCA ATACTTCCTA CTATCAGCAA AAGTGCAGGC 3 900 

CCAATCCAAT TTCCAAGAGA CATGTCCACA CCTCTTACAA AATGCCCAAC TGAGAAAACC 3960 

ACCAAGATTG AAACCAAATA ATCAACCAGC ATACTTGTTA TCTTTGATAG ATAATATTCT 4020 

ACCATATTTA CAGGGCTATG ACGCAATGTT TTCTGCCAGT TGTTGATCTT GTCGGTATGT 4080 

AAAACAACTG GGAATGAGAA GATAGCTGTT GACATCATGG AAAATGCAGT CATGGAGATA 4140 

AGATAATCAC GCATAAAATT CGCGAGTTCA CCTGGTGTGT CCTGATAGAT ACCAGAAAAA 4200 

AATAAATAGA AAGCCGTCGG CATCCCTACT GACAATAGAT AATAGATCAA TTGTCGTTTG 4260 

GTCAATAAAA ATTCTATCTT ACTAAGTGCT AGCCATCGTT TCATCTTAGT TATCTCCCTT 4320 

CTGCGTTTCT TCAAAGATTG TATCCAACAA ACTACGATTA TTAACTTCAA TTTCTTGTAT 4380 

GCCACATCCT GCTTGAACTA ACAGTTCCCA AAAAGCATCT GCTTCGCGTG TGACTACTTG 4440 

TAGAGCATCC TGTTTTTGTG ACCAGTTTTC AACCAAGTTA GACTGCTCAA TGACTTCCTT 4500 

GTATGCCAGA GGAAGGATAA AATGCTTTTC AATTCCCTCA CTACGCATAG CTAGAGGCGT 4560 

CGTATCACGA ATCAACTCTC CCTTATTTAA AACCAAAATC CGGTCAGCCG TATGCTCTAC 4620 

CTCTTCAATA TAATGAGACG AATAGAGAAT CGTGACTCCT TGCGCTTTTA GGTCCCGAAC 4 680 

GATTTCCCAA AAGCGTTGAC GAGTTGAAGT ATCCATGGCA GCAGTTGGTT CATCTAAAAA 4740 

GACAAGCTTT GGTCGCCCAA TCAAGGTCAA GACAAAAGAG AAGAGACGCT TTTGCCCGCC 4 800 

TQACAA-rrrr TCTGCGAATT GCTCTTTTTG TTGCTGGTCA AACTGCAATA GTTGATCGAT 48 60 

TTCCTGATCG CTCAAGGAAT TTGGATAGAT ACGTTGAAAG AAAGCAATCA ACTCTTTGAC 4 920 

CTTTAATTTC TGAACGATGA CATTTTCTTG AGGCAGATAA CCTCTAATAT AGTCTAACTG 4 980 

AGAACTCGTC ACTGACAAGC CTTGGATGGA TACTTGACCG CTTGTGACCA GTTTATCTCC 5040 

AAGCAGACAG TCCAAGAGTG TGGTCTTCCC AGCACCATTG GGCCCAATCA AGGCGACGCA 5100 

TTCACCTTCA GCTACCTCAA AGGAAATACC CTTCAAAATA GCCTTGCCCT TGATGTTTTT 5160 

ATTTAGGCTT TCTACCTTAA TCATATTCAT GATATTCTCC TTTCAACCAC TCCATTCTCA 5220 

TAAGGAAAAC GACGAAAATC ATAAATCCAA ACCCCAAAGC ACCACGAATG AATTGGCGAA 5280 

gCAAGGTTTG GTCAAACCAA CCTGTAAACA TTTCCACTAA CCATACCAAG AGTGACAGGC 5340 

CGATAAAGAA ATAGATGATC CCTCTCTTCA TTCCTCAAGC TCCTTTTTCA CATCTCCGAC 5400 

TAATTTCAAA CCTTCTCTAA CAAGCCAAGA CATCATTCCA AAGCCAGCAA AGAGCTCCCA 5460 

AGGAAAATGA TAGAAACTCT CATCCAATCC CGAAAACATG AGTTAGGTCA TAACTCCTGC 5520 

TACTACTAAA CTCACTGCGA TAATCATTTT ATTTCTCATC TCTTCTTCCT CCATTTCATA 5580 

CTACAATTAT AGTCTTTTGA AATCAGAGGA GACAGAAGCT TCTGTCACTA GAAAATATGA 5640 
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CAAATGTCAT AAAAAATTCT GTTCAAAACA AGCAAGATAC ACTATACAAT AAAACACAAT 5700 

TAGAAAAATC TAAGGCAACT TCCTCAAAAG AGATATCAAA CCCAATTCAC ACCATAATGT 5760 

AAACTAATAC TTATTTAAAA TCAAAAAGAG TAGAAATTTT TATCAGACAA ACACATATAT 5820 

AGTGTATTGA ATCTATAACA GTAGGCCTTA AATACTAAAA TATTTCTATA AATTAATTTA 5880 

ACTTTCCTGA TAGAGCTGTT CATATCTTAT TTCAATTCTC TAAATTATAC GTTGAACAAA 5940 

ACCCTTCTAT TTCTTTCTTA AAGATTTATA AGAGTTATAA AATCTGTTAA ATTTCAATGT 6000 

GTATACCTAA ACTACGGTAT TTATTGAAAA GACTGGAGAC AAAAAGTATA CGCTGCCAAA 6060 

ATGAATTACT GAAAATCAAA AAAGAGAGAA CCAAACTGAT TCCCTCTTAA TGTATATAAT 6120 

ATCTAGTTTT AAAAATACAC ACTCACATAT CTCTGTAATG AATCGGGAAG ACAGGATTCG 6180 

AACCTGCGAC ACCTTGGTCC CAAACCAAGC ACTCTACCAA GCTGAGCTAC TTCCCGAGTT 6240 

AAATAGAAAA ATGCACCCTA GAGGAGTCGA ACCTCTAACC GCCTGATTCG TAGTCAGGTA 6300 

CTCTATCCAG TTGAGCTAAG GGTGCTCCAT ATTATGCCGA GGACCGGAAT CGAACCGGTA 6360 

CGATCGTTAC CAATCGCAGG ATTTTAAGTC CTGTGCGTCT GCCAGTTCCG CCACCCCGGC 6420 

CTCTCTAAGC GAACGACGGG ATTCGAACCC GCGACCCCCA CCTTGGCAAG GTGGTGTTCT 64 80 

ACCACTGAAC TACGTTCGCA CTGTTTTCTT CTATCTAAAA ATGCCGGCTA CATGACTTGA 6540 

ACACGCGACC CTCTGATTAC AAATCAGATG CTCTACCAAC TGAGCTAAGC CGGCTCATTT 6 600 

GTTATATCTT AATGCGGGTT AAGGGACTTG AACCCCCACG CCGTTAAGCG CCAGATCCTA 6660 

AATCTGGTGC GTCTGCCAAT TCCGCCAAAC CCGCATATAT GACCCGTACT GGGCTCGAAC 6720 

CAGTGACCCA TTGATTAAAA GTCAATTGCT CTACCAACTG AGCTAACGAG TCTAAAATAA 6780 

cTTGCGTTAC CTTAAACGGT CCCGACGGGA ATCGAACCCG CGATCTcGCC GTGACAAGGC 6840 

GACGTG 684 6 
(2) INFORMATION FOR SEQ ID NO: 199: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2911 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199: 

GAATTCATTT TAAATAAAGA TACGGGAGAG GTAAGTGAAT TAAAACCTCA TAGGGTAACT 60 

GTGACCATTC AAAATGGAAA AGAAATGAGT TCAACGATAG TGTCGGAAGA AGATTTTATT 120 
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TTACCTGTTT ATAAGGGTGA ATTAGAAAAA GGATACCAAT TTGATGGTTG GGAAATTTCT 180 
GGTTTCGAAG GTAAAAAAGA CGCTGGCTAT GTTATTAATC TATCAAAAGA TACCTTTATA 240 
AAACCTGTAT TCAAGAAAAT AGAGGAGAAA AAGGAGGAAG AAAATAAACC TACTTTTGAT 300 
GTATCGAAAA AGAAAGATAA CCCACAAGTA AACCATAGTC AATTAAATGA AAGTCACAGA 3 60 

AAAGAGGATT TACAAAGAGA AGAGCATTCA CAAAAATCTG ATTCAACTAA GGATGTTACA 420 
GCTACAGTTC TTGATAAAAA CAATATCAGT AGTAAATCAA CTACTAACAA TCCTAATAAG 480 
TTGCCAAAAA CTGGAACAGC AAGCGGAGCC CAGACACTAT TAGCTGCCGG AATAATGTTT 540 
ATAGTAGGAA TTTTTCTTGG ATTGAAGAAA AAAAATCAAG ATTAAGATAA AAGCTATAGA 600 
AAAAAATGGT TTATGTACTG AGATTAGATA GTGAGGTGAT GACATAGTTT TGTGAAAATA 660 
GCCATTTATA ACTCAATTAT TTAGTTTACT TTACTTTACT AGTGATACTA TTTGGAGTTA 720 
TTAATGGACT TAGTTTATAT AACTAATGAA TTGATTGAAA GGGTTAGTAT TGACAATATT 780 
GGTCATATTG ACTAGAAAAT AGAGTCTATC AAAATTTAAA GGCTAATAGA GGTGATGAGA 840 

CAATTTCGGC TCTTTGTCAA CTGTAGTGGG TTGAAGTCAG CTAAGCTCGA GAAAGGACAA 900 

ATTTTGTCCT TTCTTTTTTG ATATTCAGAG CGATAAAAAT CCGTTTTTTG AAGTTTTCAA 960 

AGTTTCGAAA ACCAAAGGCA TTGCGCTTGA TAAGTTTGAT GAGATTATTG GTCGCTTCCA 1020 

GTTTGGCATT AGAATAGTGT AGTTGAAGGG CATTGACAAT CTTCTCTTTA TCTTTGAGGA 1080 

AGGTTTTAGA GUATGAACTT GATTCAGATT GTCCTCAATG AGTCCGAAAA ATTTGTCAGG 1140 

CTCCTTATTC TGAAAGTGAA AAAGCAAGAG TTGATAGAGA TTATAGTGGT GTTTCAAGTC 1200 

TTCTGAATAG CTCAAAAGTT TATCTATAGT AGATTGAAAC TAGAATAGTA CACCTCTGCT 12 60 

TCTAAAACAT TGTTAGAAAT CGATTTGACT GTCCTGAATG ATTTGTCCTG TTATTATTTC 1320 

ATTTTACTAT AAATCCACGT TTACGAATCT CTTTCCACAC TTGTTCAATG GGGTTCATCT 1380 

CTGGTGTGTA TGGAGGAATA AATGCAAAAC CAATATTAGT CGGAATCTTT AAGGTACTTG 1440 

ATTTATGCCA TATAGCATTG TCCATAACGA GTAAAAGATA ATCATCTGGA TAAGCTTGTG 1500 

AAAGCTCCTA TTCCTAAAGC CCCTTTATAA CCTCTTGCGA GAGAGACTAT TGACTCAGCC 1560 

CTTACTTCAT GCGGATGAAA CTTCTTATCG GGTTCTAGAG AGTCATAGCC ATCTGACCTA 1620 

CTATTGGACC TTTTTGTCTG GGAAAGTTGA GAATCAAGCA ATCACGCTGT ACCATCATGA 1680 

TCAGAGTCGG AGTGGTTCGG TAGTACAAGA ATTCCTAGGA GATTATTCTG GCTATGTTCA 1740 

TTGTGATATG TTGCGGCAGT AACTTAGGAC TTTAGTCCTC TAGTTCTGCC TATGCGATAG 1800 

CAGTCCAAGG TTTAGGAGCA AGGCGACGCT AAGCTTGGTA AACTGCGAAC CGCTAGAAGC 1860 

TTATCGTCAA CTGGAAGAAG CTGAACTTGT TGGATGTTGG GCGCATGTGA GAAGGAAATT 1920 
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TTTTGAAGCG 


ACCCCCAAGC 


AAGCAGATAA 


ATCATCCTTA 


GGAGCTAAAG 


GTTTAGCTTA 


1980 


TTGTGATCAG 


TTATTTTCCT 


TGGAAAkAGA 


CTGGGAGGCT 


TTGCCAGCTG 


ATGAACGACT 


2040 


ACAGAAACGT 


CAAGAACATC 


TCCAGCCCCT 


AATGGAAGAC 


TTCTTTGCTT 


GGTGCCGCCG 


2100 


TCAGTCAGTT 


TTAGCAGGTT 


CAAAACTAGG 


AAGGGCAATT 


GAATACAGCC 


TCAAGTATGA 


2160 


AGAAACCTTT 


AAGACTATTT 


TGAAAGACGG 


ACATCTGGTC 


CTTTCCAATA 


ATCTAGCTGA 


2220 


ACGCGCCATT 


AAATCATTGG 


TTATGGGACG 


GAGTAAAAGA 


GTCCAGTGGA 


CTCTTTTAGC 


2280 


CTGAGCTCAG 


TTTAAAAAAG 


CGAGGGTGGT 


TATTTTCTCA 


AAGTTTTGAA 


GGAGCTAAAG 


2340 


CAAGAGCTAT 


TGTTATGAGC 


TTGTTGGAAA 


CAGCTAAACG 


TCATCAATTA 


TAGTGCGTTG 


2400 


AATCTATAAC 


AGTACGCATC 


GACTGCTAAA 


ACATTTCTAT 


AAATCAATTT 


TCCTTTCCTA 


2460 


ATCGATTTGT 


TCATATCTTA 


TTTCAATCCA 


TTATAAATAG 


CGAGAAATAT 


CTATCCTATC 


2520 


TTCTAGAATG 


TCTTCCAAAC 


GAGGAAACTC 


TCGTAAACAA 


AGAGGTTTTA 


GAGGTTTATT 


2580 


TACCATGGAC 


TAAAGTTGTA 


CAAGAAAAGT 


GCAAATAAGA 


AATCTCCAGA 


TTAGGAACTA 


2640 


TCCGTGAGTT 


CACTAATCTG 


GAGATTTTTC 


AATAGAtTCG 


TTATTGGGCG 


GTTACGATAT 


2700 


GATcACTACT 


TCGTCAGTCT 


TATCTACAAC 


CTCAAAACAG 


TGTTTTGAGC 


AACCTGCGAC 


2760 


TAGCTTCCTA 


GTTTACTCTT 


TGATTTTCAT 


TGAATATTAG 


AACAGAAAAA 


ATGCTTGGAG 


2820 


TATTTGTTTG 


TGTGTTTATT 


TTTATATAAC 


AAACTATAAA 


CAAAATAAAA 


ATATAAAAAA 


2880 


AGAGACAAAA 


AAGAACAGAA 


AGTAATTGAC 


A 






2911 



(2) INFORMATION FOR SEQ ID NO: 200: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6854 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 200: 

GAAAATAAGT CTTGACAGAA AGCGCTATCA ATGATAGAAT GAATTCAGAT AAAAAGATTT 60 

ATTTTTAAAA CAAAAATGAA ACGTTTCAAA AAAAGAAATA AAGAGACAGC GCCAAGCGCT 120 

ATCTTTTCTA GAAAAAAATG AAACGTTTCA AAAAAGGAGG TTGCTATGAA TAGCAAAGCG 180 

AAGCAAGTTT CTCTTTGGGA AAGAATCAAG AAACAAAAAC TCTTGTTATT GATGACTGTC 240 

CCCGGTTTAG TTTTAACCTT TATCTTTAAA TACATCCCTA TGTATGGGGT TTTAATCGCA 300 

TTTAAAGATT ACAATCCTTT AAAAGGAATT TTAGGGAGTG ATTGGATTGG TTTTTCTGAG 360 
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TTTACAAAAT TCATATCCTC TCCCAACTTT GGTATCTTGT TAGCCAACAC ATTAAAATTA 420 

AGTATCTATG GTTTATTGCT TGGCTTTTTA CCACCAATCA TTCTCGCGAT TATGCTCAAT 480 

CAACTCTTGA GTGAAAAAGT C7VAAAAACGA ATTCAGCTCA TTTTATACGC ACCAAACTTT 540 

ATCTCAGTCG TTGTTATTGT CGGTATGATT TTCCTCTTCT TTTCAGTGGG AGGACCAATC 600 

AACAATTTTC TTTCTATGTT TGGAATGAAG GCTGACTTCT TGACAAATCC AGACTTCTTT 660 

AGACCTTTAT ACATCTTTAG TGGTATCTGG CAAGGAATGG GCTGGGCTTC AACGCTCTAC 720 

ACGGCAACAT TGGTAAATGT AGATCCAGCC TTAGTAGAAG CAGCCCGACT GGATGGAGCC 780 

AATATCTTCC AACGAATCTG GCACATTGAT ATTCCAGCTC TTAAGCCTAT TATGGTTATC 840 

CAATTTGTTT TAGCTGCAGG TGGAATTATG AATGTCGGAT ATGAAAAAGC ATTCTTGATG 900 

CAGACATCGT TAAATTTGCC AACTTCTGAA ATTATCTCGA CATATGTCTA TAAAGTTGGT 960 

CTTGTATCAG GAGACTATTC TTACTCAACA GCGGTTGGTT TGTTTAATGC AGTGATTAAC 1020 

GTAGTATTGC TTGTTGCAGT TAACCAAATC GTTAAACGCA TGAATAATGG TGAAGGAATT 1080 

TAAGGAGGAA AGTATGAAAA ATTCGATTAT GGATACAAAA TTTGATAGAC GTATCTTACT 1140 

CTTAAATAAA ATCATTATTG TCTTTATCGT TTTGATGACT TTGCTTCCTT TACTTTATAT 1200 

CGTCGTAGCA TCCTTTATGG ATCCTAAGGT TCTGGTTAGT AGAGGGATTA GCTTTAATCC 12 60 

AGCCGATTGG ACTGTAGAAG GTTACCAGCG TGTATTCAGT GACCAATCTA TTCTAAGAGG 1320 

TT-TTATCAAt tCtL'TACTAT ACTCTTTTGG ATTTGCAGCT TTAACAGTCT TGCTATCTGT 13 80 

GTTTACAGCT TATCCTCTTT CTAAGAAAGA CTTGGTTGGA CGTCGTTGGA TTAACTACTT 1440 

CTTGATTGTA ACTATGTTCT TTGGTGGTGG TTTAGTCCCA ACTTACTTGC TCGTAAAAGA 1500 

ATTGGGAATG CTCAATACTC CATGGGCTAT CATTGTTCCA GGTGCTGTTA ACGTTTGGAA 1560 

TATTATTCTT GCTAGGGCCT ATTTCCAAGG ATTGCCTGAA GAATTAGTTG AAGCTGCTGT 1620 

CATTGATGGT GCAAATGATT TACAGATTTT CTTCAAAATC ATGCTTCCTC TTGCAAAACC 1680 

AATTATGTTT GTTCTCTTCC TTTATGCTTT TGTAGGACAG TGGAACTCAT ACTTTGATGC 1740 

AATGATTTAT ATCAAGGATC CAAACTTGGA ACCATTGCAA CTTGTACTTC GTAAAATTCT 1800 

CATTCAGAGC CAACCAGGTC AAGACATGAT TGGAGCACAA GCGGCTATGA ATGAAATGAA 1860 

ACGTTTAGCT GAATTGATTA AATACGCAAC TATTGTCATT TCCAGCTTGC CATTGATTGT 1920 

TATGTATCCA TTCTTCCAAA AATACTTTGA TAAAGGAATT ATGGCTGGTT CACTTAAAGG 1980 

ATAAAAAAAG AAAAAATAAA AGGAGTTTTC TCATGAAATT CAAAACATTC TCAAAATCAG 2040 

CAGTTTTGTT GACAGCTAGT TTAGCAGTAC TTGCAGCCTG TGGCTCAAAA AATACAGCTT 2100 

CAAGTCCAGA TTATAAGTTG GAAGGTGTAA CATTCCCGCT TCAAGAAAAG AAAACATTGA 2160 
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AGTTTATGAC AGCCAGTTCA CCGTTATCTC CTAAAGACCC AAATGAAAAG TTAATTTTGC 2220 

AACGTTTGGA GAAGGAAACT GGCGTTCATA TTGACTGGAC CAACTACCAA TCCGACTTTG 2280 

CAGAAAAACG TAACTTGGAT ATTTCTAGTG GTGATTTACC AGATGCTATC CACAACGACG 2340 

GAGCTTCAGA TGTGGACTTG ATGAACTGGG CTAAAAAAGG TGTTATTATT CCAGTTGAAG 2400 

ATTTGATTGA TAAATACATG CCAAATCTTA AGAAAATTTT GGATGAGAAA CCAGAGTACA 2460 

AGGCCTTGAT GACAGCACCT GATGGGCACA TTTACTCATT TCCATGGATT GAAGAGCTTG 2520 

GAGATGGTAA AGAGTCTATT CACAGTGTCA ACGATATGGC TTGGATTAAC AAAGATTGGC 2580 

TTAAGAAACT TGGTCTTGAA ATGCCAAAAA CTACTGATGA TTTGATTAAA GTCCTAGAAG 2640 

CTTTCAAAAA CGGGGATCCA AATGGAAATG GAGAGGCTGA TGAAATTCCA TTTTCATTTA 2700 

TTAGTGGTAA CGGAAACGAA GATTTTAAAT TCCTATTTGC TGCATTTGGT ATAGGGGATA 27 60 

ACGATGATCA TTTAGTAGTA GGAAATGATG GCAAAGTTGA CTTCACAGCA GATAACGATA 2820 

ACTATAAAGA AGGTGTCAAA TTTATCCGTC AATTGCAAGA AAAAGGCCTG ATTGATAAAG 2880 

AAGCTTTCGA ACATGATTGG AATAGTTACA TTGCTAAAGG TCATGATCAG AAATTTGGTG 2940 

TTTACTTTAC ATGGGATAAG AATAATGTTA CTGGAAGTAA CGAAAGTTAT GATGTTTTAC 3000 

CAGTACTTGC TGGACCAAGT GGTCAAAAAC ACGTAGCTCG TACAAACGGT ATGGGATTTG 3060 

CACGTGACAA GATGGTTATT ACCAGTGTAA ACAAAAACCT AGAATTGACA RCTAAATGGA 3120 

TTGATGCACA ATACGCTCCA CTCCAATCTG TGCAAAATAA CTGGGGAACT TACGGAGATG 3180 

ACAAACAACA AAACATCTTT GAATTGGATC AAGCGTCAAA TAGTCTAAAA CACTTACCAC 3240 

TAAACGGAAC TGCACCAGCA GAACTTCGTC AAAAGACTGA AGTAGGAGGA CCACTAGCTA 3 300 

TCCTAGATTC ATACTATGGT AAAGTAACAA CCATGCCTGA TGATGCCAAA TGGCGTTTGG 33 60 

ATCTTATCAA AGAATATTAT GTTCCTTACA TGAGCAATGT CAATAACTAT CCAAGAGTCT 3420 

TTATGACACA GGAAGATTTG GACAAGATTG CCCATATCGA AGCAGATATG AATGACTATA 3480 

TCTACCGTAA ACGTGCTGAA TGGATTGTAA ATGGCAATAT TGATACTGAG TGGGATGATT 3540 

ACAAGAAAGA ACTTGAAAAA TACGGACTTT CTGATTACCT CGCTATTAAA CAAAAATACT 3600 

ACGACCAATA CCAAGCAAAC AAAAACTAGA GGTTGATTAT GGGAGATAAG AAATACACAC 3660 

TAGAAAAAGC CAATCGTTTT ATAGCAGAAA ATAAACATCT CGTTTUVTACT CAATATAAGC 3720 

CTGAAGAACA TTTTTCAGCT GAGATTGGTT GGATCAATGA TCCAAATGGA TTTGTCTATT 3780 

TTCGTGGAGA ATACCATCTC TTTTATCAAT TCTATCCATA TGATAGTGTT TGGGGGCCTA 3840 

TGCACTGGGG ACATGCTAAA AGTAAGGACT TGGTGACTTG GGAGCACTTG CCAGTGGCAC 3900 
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TTGCTCCTGA CCAAGATTAT GACCGAAATG GTTGTTTCTC AGGCTCTGCC ATTGTCAAGG 3960 

ATGATCGCCT CTGGCTCATG TACACTGGAC ATATCGAAGA AGAAACCGGT GTCCGCCAAG 402 0 

TGCAAAATAT GGTATTTTCA GATGACGGGA TTCACTTTGA AAAGATTTCC CAAAATCCAG 4080 

TTGCAACTGG ATCAGACTTA CCAGATGAGT TGATTGCTGC TGATTTCCGT GATCCAAAAC 4140 

TCTTTGAAAA AGATGGACGC TATTACTCCG TAGTAGCTGC CAAACACAAG GATAATGTGG 4200 

GCTGTATCGT TCTACTAGGG TCCGATAACC TAGTAGAATG GCAGTTCGAA TCCATCTTTT 4260 

TAAAAGGGGG AGAACACCAA GGTTTTATGT GGGAATGCCC AGATTACTTC GAGTTAGATG 4 320 

GGAAAGATTG CCTTATTATG TCACCCATGC GTTATCAGCG TGAGGGAGAC TCATATCATA 4 380 

ACATCAACTC ATCGCTTTTG TTCACGGGTA AGGTAGATTG GAGAGAAAAA CGTTTTATCC 4440 

CAGAATCAGT TCAAGAAATT GATCATGGCC AAGACTTCTA TGCGCCTCAA ACATTGTTGG 4500 

ACGATCAAAA TCGTCGTATC CTGATTGCTT GGATGCAGAC ATGGGGGCGT ACCCTTCCAA 4560 

CCCATGACCA AGAACACAAG TGGGCATGTG CCATGACTCT ACCTAGAATT CTAAGATTGG 4 620 

AAGATGGCAA ACTAAGACAA TTCCCTGTTA AAAAAGGCCA ATATCAAATC CAAATAGATA 4 680 

AAGATTGTCA TTACCACTTA GGAAATGATA TAGATTATCT TGAATTTGGT TATGACAGTA 4740 

ATGCGCAGCA AGTTTACATT GATCGTAGCC ATCTTATTCA AAAAATTCTA GGTGAAGAAG 4 800 

AACAGGACAC TAGTCGACGG TATGTAGATA TTGAAGCTAA AGAATTGGAA GTTGTTCTAG 4 860 

ATAAAAATTC CATCGAGATT TTTGTCAATC AAGGTGAAGC AAGCTTGACT GCAACTTATT 4920 

ACTTAACGGT GCCAGCTGAG CTATCACGAA TTGATTAAAA ATTAAGTTAT TTCTCCTAAA 4 980 

GAAAAAGTTC TCTTTCTAAA ATAGTGGAAA GAGGACTTTT TGTGTTTTGG GTATATAAGC 5040 

TTAGTTTATG GTATTTGTAA AATTGGTGTT GGATTATGAT TTAAGCTAGT TTTCTAAAGA 5100 

ATTTGAAAAA AATTTTATTT AAGCAAAAAA ACCTTGGTTC CAAGGCTTTT CCTGTTGTAT 5160 

TTAGATGCCC CCTACAGGGA TTGTAGGAGA TATGTTGCTT AGATGTTCTT GATTTTCTGG 5220 

TGTTTTGTAA CGTTTAAATG AGTTTTTTGA GTTTGTTGGT GGGGCGTTGC CCGGCAATTG 52 80 

CCCGACTTAT TGCTTGAAAA AGAATTTAAA ATATAGTATA GTTAATTATA GATTAACACT 5340 

TGCTTGGAGG AACTGATGAA GAACAATGAA AGATTAGGTA TTAAATTAAG TAGAGATAGC 5400 

GTTTTAGGAT TGAGGGAAGT TAGAAGGCTT TATTTAGGCA GTTCAGATAT CCCAGTTTCT 5460 

GATGGCTATG TGATTGAAGT TGCTTATAAC CAGATATCAC ATGAGATTGA TATTATTGAT 5520 

TGGGTAGAGT TGAACAAGTC AAAAATTAAG ATAAGTGAAA TTAGTGAAAG CGTGGATATA 5580 

GATGCCACTA GCTTGAGAAC AACTTTGACT TTAGACACAT TAGTATATGA AGGTATGAGA 5640 

GATATACAGT TAAAGTTGAG AGAGCTTACA AAGGGGAGAG TATTCTTTTC ATTTGTAGTG 5700 
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AAGTTAGTTT TGTTTGCTTC TATTTTAAAG AAAAAAGATT 
AAGTGTTAAT CAAGTATTGA CACTTTATCT GGATTTCGGT 
TCTTTCTAAA TTTTTTTCGT CCTTATGTGT TAATCAAAGA 
TTTACTCTAA AAAGTGTTAA TCAATGATGT ATTTGTTAGA 
AGAGCACCAC CAGTTATAAT AGTATAAAAC GTATAATAAA 
AGAAAAGGAG AAACAAATCA TGAAACAAAA ACAACCGATT 
TACATTTGAA GAGCTTATTC AAGACCAAAA GTTAGAAAGA 
TTTGGTTGGA AGGTATGGTT TTACTGCTAG CTGTGCGTCT 
AGAAGCGTAT GGGGGTAAAA ATCTAAACGT AGTTTATGCG 
GAATATTGCT TGCAGTTGTT ATCATAAGGC TGATGGGTAT 
TAGTGATAAA AAAATTTGTC TAGATTCTTA CTATTACCAC 
AACTAGTGAT GTGATAAAAG ATGTTTACGA TAATTATAAT 
AGAAGCGACA CCTGAATACA TTTATGTTGT ACAAACTGAA 
ATATTTTTAT ATTAGAGAAG TTCTGGGATT ATCGTTTAGT 
AGTCAAGGTT CTTGCAGGAG CGCTTGCTAG AAAATATAAG 
TTAAATTTAT ACTCTTCGAA AATCAAATTC AAACCAAGTC 
AAGTGCTGTC TGTGGCTAGC TTCTTAGTTT GCTTTTTGAT 
ATGGTAGTTA TTTATGGCAT AATAATATTG ATTTGGGAGT 
CTATAATATT TGTAGTGGGT AAACCACTAT AGATATTATG 
AAAGTCCCAT ATGA 

(2) INFORMATION FOR SEQ ID NO: 201: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3895 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



TACTAGAAAA 
ATAATATGCT 
CGAATACAAA 
GAGGTAGATA 
AATATTTTAA 
GTTTCTAGAA 
TTGGCTAAGT 
TCATTTGCGA 
AGTCGGATGT 
TCTTTAGCAG 
AAGAATACCT 
AATTATATGG 
ATGCCAAAAG 
ACCATGCATT 
CCATATCGAA 
AGCTTCGCCT 
TTTCATTGAG 
TATAGCGAAA 
GAGCCTATTT 



ATTTCAAGAA 
TAGAAAGGAA 
AACATATTTT 
AATGGAATTG 
CTTGAATTAT 
CGAAACAACA 
TGTCGCCCGA 
ACTTGATTAA 
TGGCTCTCTG 
ATGCGCTTTT 
CTAATACCAT 
TTTTAACTCG 
ATTCAGATTT 
ATGCATTTTT 
ATTGAATTAT 

TATTACTCTT 
ATTTTAGGTT 
ATTGTAGAAA 



5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6650 

6720 

6780 

6840 

6854 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 201: 

TCCTTGCTAA GTTTATACTC AATGAAAATC AAAGAACAAA CTAGGAAGCT AGCCACAGGT 60 

TGCTCAAAGC ACCGCTTTGA GGTTGCAGAT AAAACTGACA CGGTTTGAAG AGATTTTCGA 120 

AGAGTATTAA TTTACATAAA TAGCCAGTGT TTGATAGGGT TTGAGTAGAA TTTTCTCAGA 180 
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CACTTCTGCA TCTTCATAGT TTGATATCAA AATCTGTCCA TTTTGGTAGA CTGCTGGCAA 24 0 

GTCGATTTcA CTTCTTTAGC ATAAAAGTTA TTGAGCACTA GTAACTTTTG ATCCTCAAAC 300 

TGGCGTTCAA AAGCGTAGAC TTGTTTGCTA TCTTCAAAGG CTGGTTTGTA ACTTCCTTCT 360 

GAAATGATTG GCATTTCCTT ACGCATCGAA TCAAGTCTTG ATAGAAGGTA AAAATCGGAC 420 

CCTGGATTTC ATTTTCTACA TTGATGTATT TATAGGATTT ACCAGCTTTC AACCAAGGAG 480 

TGCCTGTTGA AAATCCTGCA TTTTCCGAAG CATCCCACTG CATGGGAATG CGTGAATTAT 540 

CACGCGACTT AGCTTGAATA ATCTGGAAGG CTTCTTGCTG ACTCTTTCCT TCTTCTAAGA 600 

GCATCTGATA GGCATTAAGC GATTCGACAT CCACATAATC AGCCATAGAA TCATAGTCTG 660 

GGTCAATCAT CCCGATTTCC TCACCCATGT AGATATAAGG TGTCCCACGT GACAGGTGAA 720 

TGCTGGCTGC TAGCATGGTG GCTCCTTCCT TGCGGAAGTT TTGAATATCG ACAAAACGGT 780 

TCAAGGCACG TGGTTGATCG TGATTATTCC AAAAGAGGGC ACTCCAACCG TCTTTATCAC 840 

TCATTTCCTT ACCCCAACTA TGGTAAAGAC TCTTCAACTC TTCAAAATCA AAGGGAGCCA 900 

AGGTCCACTT TTGTCCATCC TTATAGTCCA CCTTGAGGTG ATGAAAATTA AAGGTCATGG 960 

ATAATTCCTG ACGATCAGGC GACGAATAGA GGACACAGTT TTCCATGGTG GTAGAAGACA 102 0 

TTTCCCCAAC TGTCATAAAG CTATCGTCGG ATCCAAAAGT GGCTTGGTTC ATCATACGCA 1080 

AATAGTTATG AACGATGGGT TTGTCTGTAT AAGCTGGCTT CCCTTCATTT TCAGGACAGT 1140 

CCACTGAAAC CTCGTCCTTA CCGATCAAAT TGATCACATC AAATCGGAAA CCTTTGACAC 1200 

CCTTGTCGCG CCAGAAATTA ACAACCTTGA AAAGCTCCTT ACGGACATTG GAATTGCGCC 12 60 

AGTTAAGGTC AGCCTGGGTC TCATCAAATA GGTGAAGATA GTATTTCCCA GTATCCCCGA 1320 

AAGGCGTCCA TGCAGAACCA CC7VAACTTAG ACTGCCAATC TGTTGGTTGG TCTTGGATGA 1380 

AGAAAAAGTC TTGATAATAC TTATCACCAG CTAGGGCTTT CTGAAACCAT TCATGCTCTG 1440 

TCGAACAATG ATTAAGTACC ATGTCCAGCA TAAAGTCAAT CTTGTGCTCT TTACCGACAC 1500 

ACACCATTTT CTCAAAATCA GCCATATCAC CAAAAAGAGG ATCCACTGCC ATATAATCTG 1560 

AAATATCGTA ACCATTATCC CGTTGAGGGC TTGGATAGAA TGGATTGAGC CAGACCATAT 1620 

CCACACCTAG TTTGGCTAAA TAGGGAATTT TTTCGATAAT CCCACGGAAA TCCCCAATAC 1680 

CGTTTTCAGT GGTGTCTTTG TAAGATTTTG GATAGATTTG ATAGACTACT TTTCCTTTAT 1740 

CAAGTGTCAT CTGTTTCTCC TTTTCTGATA AAAGGGAGGA AGCAGTCTTC CGTCCCTATT 1800 

TGTGCTATTT CAATTATACT CAATG7VAAAT CAAAGAACAA ACTAGGAAGC TAGCCACAGG 1860 

TTGCTCAAAA CACTATTTTG AGGTTGCAGA TAGAGCTGAC GTGGTTTGAA GAGATTTTCG 1920 

AAGAGTATTA GATTCGTGTA GCGACCATGA GAGATGCTCC AGCTTGGATC GTTGTCGGAT 1980 
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AAGTTCCGGG AATAGTCGCT GTATAAGCAT CTTGGTTGGT GATGATAACA GGAGTTTCTG 2040 

TCACCAGACC TGCAGCCTTA ATGACATCCA TATCAAAACG AATCAGTTGC TGACCAACTG 2100 

TAACGTGATC TCCTTGGACT ACAAGACTTT CAAAACCTTT GCCATCAAGA . CCTACTGTAT 2160 

CCATACCGAT GTGGATGAGC AATTCAACTC CCTCGTCAGA GACAATGCCG ATGGCATGCT 2220 

TGGTAGGGAA AAGAACCGTC ACTGTCCCAT TAACTGGAGA GGTCAACTCA CCTTGGCTTG 22 80 

GTTCAATGAC TAGACCTTGC CCCATGACAC CTGATGCAAA AATAGGATCC GTCGCTTGAC 2340 

TCAATTCTTT CACTTGGCCA GTTAGTGGGC TGATAATTTC TACCGAAGTA AGTTCTACTG 2400 

GTTCATGGTT CACAAATTCT GCTTCTTCTT GAGCAACGAA TTCTGCCTGC AAGTTCGTAT 2460 

CGCCCTCTGT TTTTGTAAAG AGACCAGCCT TGCGGAAGAA GAAAGTCAAG AGCATTGGAA 2 520 

CAACAATCGC AACTAGCATA GTTCCTGCAA ATGGCAGCAT GTATTGAGGT TGAATAGAGA 2580 

GAATACCTGG CAAACCACCG ATACCAATAG AAGCCGCAGT TACATTAAAA GTAACGGATA 2640 

ACATGCCTGC AAGGGCTGAA CCAGTCATCC CAGCAACAAA TGGATAAATA TATTTTACGT 2700 

TAACCCCAAA AAGAGCTGGT TCTGTAACAC CGAGATAGGC TGAAATGGTT GCAGGAAGTG 2760 

AAACCTGAGC CTCACGCTCA TCATGGCGAT GCATGAAATA ATAGGCAAAC ACGGCTGAGC 2820 

CTTGAGCAAT ATTAGAAAGA GCAATCATTG GCCATAGGGC AGTGCCACCA GCATCCGCAA 2880 

TCAATTGTGT ATCAATGGCA TTGGTCATAT GGTGCAGACC TGTGATGACA AATGttAGCGT 2 940 

AGAGGGCGCC AAAAATTGCA CCGAAGAGCC ATTTAACTGG ACCAGTTAAA CCTGCCAAGA 3000 

CAACTGATGA AAGTCCTTGT CCAATTGTCC AACCGATTGG TCCCAAAACA GTATGAGCCA 3060 

AAATCAAGGC TGGAATCAAT GACAAGAAAG GTACAAAAAT CATAGAAATG ACTTCTGGGA 3120 

TATGCTTGTG CCAGAAGATT TCAAGATAAG ACAGACTCAA ACCTGCAAGC AAGGCTGGGA 3180 

TAACTTGGGC TTGGTAACCG ATACGATTAA CAGTAAAATA GCCAAAATTC CAAACCCAGT 3240 

TTGCCGCGAT ATCAGCTGCT GGCGTTGAAG CAACCGCATA GGCATTGAGC AACTGAGGCG 3300 

ATACCAAACA GATTCCGAGA ACAATTCCCA AAATTTGGCT GGTTCCCATC TTACGAGAAA 3360 

CAGACCAAGT AATCCCTACT GGTAAGAACT GGAAGATAGC TTCACCAGGC AACCAGAGGA 3420 

AGTGATTGAC ACCTGCCCAA AACTGAGAGG ATTCTGTGAT GGTCTTGCCA TCCAACATCG 3480 

ACCAATGGAC ACCTTCCAAG ACATTACGGA AACCGAGGAT CAATCCTCCG ACTATCAAGG 3540 

CTGGAATAAT CGGAGTAAAA ATCTCCGCCA GAGTGGTCAT AACACCTTGG ACCACGTTTT 3600 

GATTACTCTT AGCTGCAGAC TTGGCTGCTT CTTTGGAAAC ACCCTCAATA CCTGAAACGG 3 660 

CTGTAAAATC ATTATAAAAG ATGGGCACGT CATTTCCAAT GATTACCTGA AATTGACCTG 3720 
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CATTTGTAAA GGTTCCTTTA ACAGCTGGAA TTGACTCGAT AGCTTTAACA TTAGCCTTCT 3780 
TATCATCTCC TAAAACAAAC CGCATCCGTG TCGCACAGTG AGTTACGGCA GTCACATTTT 3840 
CTTTGCCTCC GATTGCCTGA AGCAGATCTT TGGCTTCTTG TTCAAATTTT CCCGG 3895 
(2) INFORMATION FOR SEQ ID NO: 202: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3936 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 202: 



AGGATCGCCG 


CTCCAGCTAC 


TAAGTCTCGT 


GCAGTGCCGA 


TTTATCAAAC 


AACATTTTTT 


60 


GTTTTTGATG 


ACACGTAGGA 


AGGTGCCGAT 


CTGTTTGCCT 


TGAGGAAACC 


AGGGAACATT 


120 


TATACTCGTA 


TCACCAATCC 


TACAACAGCT 


GCCCTTGAAG 


GTGGTGTTGA AGCGCTAgcA 


180 






Aij i oAU 1 i A i 


ACGATTTTGG 


CGATTGCCCA 


TGCTGGTGAC 


240 


CATGTAGTGG 


CTGCTTCGAC 


TATTTACGGT 


GGAACCTTCA 


ATCTTTTGAA 


AGAACCCCTT 


300 


CCTCGTTATG 


GTATCACAAC 


AACCTTTTTC 


GATATTGATA 


ATTTGGAGGA 


AGTAGAAGCA 


360 


GCTATCAAAG 


ACAATACCAA 


GCTTGTCTTG 


ATTGAAACCT 


TGGGTAACCC 


CTTGATTAAT 


420 


at-tccaoacc 


TGGAAAAACT 


GGCAGAGATT 


GCTCATAAAC 


ATCAAATCCC 


ACTTGTGTCA 


480 


GACAATACTT 


TTGCAACACC 


TTATTTGATT 


AACGTCTTCT 


CTCATGGCGT 


TGACATTGCC 


540 


ATTCACTCTG 


TGACTAAGTT 


TATCGGTGGG 


CATGGTACAA 


CTATTGGAGG 


AATAATTGTC 


600 


GATAGTGGTC 


GTTTTGACTG 


GACGGCTTCA 


GGGAAATTCC 


CTCAATTTGT 


TGACGAGGGT 


660 


CC/iAGCTGCC 


ACAATTTGAG 


CTATACTCGT 


GATGTGGGTG 


CAGCAGCCTT 


TATTATAGCT 


720 


GTTCGAGTTC 


AATTGCTTCG 


TGATACAGGT 


GCAGCCTTGT 


CACCATTCAA 


TGCTTTCCTC 


780 


TTGCTACAAA 


GACTTGAAAC 


CTCTTCACTT 


CGTGTGGAAC 


GCCATGTACA 


AAATGCTGAG 


840 


ACAATTGTTG 


ATTTTCTTGT 


CAACCATCCT 


AAGGTAGAAA 


AGGTAAATTA 


TCCAAAACTT 


900 


GCAGATAGTC 


CTTATCATGC 


CTTGGCTGAG 


AAATACTTGC 


CAAAAGGTGT 


CGGTTCAATC 


960 


TTTACCTTCC 


ACGTCAAAGG 


TGGCGAGGAA 


GAAGCACGCA 


AGGTCATTGA 


TAATTTAGAA 


1020 


ATCTTTTCTG 


ACCTTGCAAA 


CGCGGCAGAT 


GCTAAATCGC 


TTGTTGTCCA 


TCCAGCAACA 


1080 


ACCACTCACG 


GTCAATTGTC 


AGAAAAAGAC 


CTAGAAGCAG 


CAGGTGTCAC 


ACCAAACTAA 


1140 


ATTCGTTTGT 


CAATCGGTCT 


TGAAAATGTA 


GAAGATTTGA 


TTGAAGACTT 


GCGCTTGGCC 


1200 


TTGGAAAAAA 


TTTAAAGTAA 


AAGAAGATAA 


ACAGTGGGCT 


TCGACTCACT 


GTTTTTGATT 


1260 
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TTCCCTCAGG 


CATGATATAA 


TGGTTAC Af^ A 


t\\a IK^l Ata AAA 


GAGGAACGAT 


ATGAACGAAA 


1320 


TCAAATGTCC 


CAACTGTGGG 


GAAGTCTTTA 


CAGTAAATGA 


GAGTCAGTAT 


GCCGAACTCT 


1380 


TGTCCCAAGT 




GAGTTTGATA 


AGGAACTACA 


CGATAGGATG 


AAGCAGGAAC 


1440 


TGGCCTTGGC 




GCCATGAATG 


AGCAACAGAC 


TAAACTGGCT 


CAGAAGGATC 


1500 


AAGAAATTGC 




AGTCAGATCC 


AAAACTTTGA 


TACAGAAAAA 


GAATTGGCCA 


1560 






AGCCATGAGG 


CTCTCTTGGC 


TAAGGACAAG 


GAAGTACAGC 


1620 






ACCTTGCGTT 


TGGAGCATGA 


AAATCAACTA 


CAAAAGACCC 


1680 




apa&ab&r^&a 

AI^AAAAA(jAA 


CGGGATCAGG 


TTAAAAACCA 


ACTACTTTTG 


CAGGAAAAGG 


1740 




ATCTTTGGCT 


TCTGTTAAGC 


AAAACTACGA 


AGCCCAGCTC 


AAGGCAGCTA 


1800 




CGAGTTTTAT 


AAGAATTTTA AGGCTCAACA ATCTACAAAA GCGATTGGGG 


1860 




ACAGTATGCA 


GAGAGTGAGT 


TTAACAAGGT 


TCGTAGTTTC 


GCCTTTCCAA 


1920 


ATG CTTACTT 


TGAGAAGGAT 


AACAAGGTCT 


CTTCGCGTGG 


GTCTAAAGGG 


GACTTTATCT 


1980 




TGATGAAAAT 


GGAGTTGAAA 


TCATTTCTAT 


CATGTTTGAG 


ATGAAAAACG 


2040 


wula AL.(j \j 


AACAGAGAAG 


AAGCACAAGA 


ATGCAGATTT 


TTACAAGGAA 


TTGGACAAGG 


2100 


AL.l,0 J, ^.I^LiLaA 


GAaGAACTGT 


GAGTATGCCG 


TTTTGGTGAC 


CATGCTTGAG 


GCTGATAATG 


2160 


ACTACTTTAA 


CACAGGGATT 


GTTGACGTCA 


GTCACGAGTA 


TGAAAAAATG 


TATGTTGTTC 


2220 


V^i^^l^AAi i 


L. 1 1 1 Ai (-(_AA 


TTGATTGGTC 


TCTTACGTAA 


TGCGGCGCTA 


AATTCCCTAA 


2280 




GGAGTTGGCC 


TTGGTTCGCG 


AGCAAAATAT 


TGACATTACG 


CATTTTGAGG 


2340 




TGCCTTTAAG 


CTAGCTTTTG 


CTAAGAACTA 


TAATTCAGCT 


TCGACTAACT 


2400 


TTGGAAAAGC 


TATTGATGAA 


ATCGACAAGG 


CCATCAAACG 


CATGGAAGAG 


GTTAAGAAAT 


2460 


TCCTGACCAC 


ATCTGAAAAC 


CAACTCCGTT 


TAGCTAACAA 


CAAATTGGAA 


GATGTCTCTG 


2520 


TTAAAAAATT 


GACCCGGAAA 


AATCCAACAA 


TGAAAGCXSAA 


GTTCGAAGCA 


CTGAAGGGGG 


2580 


AGTAGAAAGC 


AAAAATGAAC 


GGTATTATTA 


ACTTAAAAAA 


GGAAGCAGGA 


ATGACCTCGC 


2640 


ATGATGCGGT 


TTTTAAACTG 


CGTAAGATTT 


TGGGAACCAA 


GAAAATTGGT 


CATGGTGGAA 


2700 


CCTTGGATCC 


GGATGTGGTG 


GGTGTTTTGC 


CGATTGCGGT 


TGGCAAGGCG 


ACACGCATGG 


2760 


TCGAGTTTAT 


GCAGGACGAG GGTAAGATCT ATGAGGGGGA AATCACTCTG 


GGCTATTCCA 


2820 


CGAAGACTGA 


GGATGCTAGT 


GGGGAAGTGG TCGCAGAAAC CCCTGTTTTG TCTCTCTTGG 


2880 


ATGAAAAGCT 


TGTTGATGAA 


GCGATTGCTA 


GCTTGACTGG 


GCCTATTACT 


CAGATTCCCC 


2940 


CTATGTATTC 


GGCAGTTAAG 


GTTAATGGTC 


GCAAGCTCTA 


TGAGTATGCG 


CGTGCTGGTC 


3000 



wo 98/18931 



PCT/US97/19588 



1184 

AGGAAGTGGA GCGTCCAGAA CGTCAGGTGA CCATTTATCA ATTTGAGCGA ACAAGTCCGA 3060 

TTTCTTATGA TGGCCAACTT GCCCGATTCA CTTTTCGTGT AAAATGCAGT AAAGGGACGT 3120 

ACATCCGTAC TTTGTCAGTT GATTTGGGTG AAAAGCTTGG TTATGCGGCT CATATGTCCC 3180 

ATTTGACTCG TACTAGTGCT GCTGGCTTAC AATTAGAAGA CGCTCTTGCC TTGGAGGAAA 3240 

TTGCTGAAAA AGTAGAGGCT GGGCAATTAG ATTTTCTCCA TCCTTTAGAG ATTGGGACAG 3300 

GTGACCTTGT CAAAGTTTTC CTAAGTCCAG AAGAGGCTAC AGAAGTTCGC TTTGGTCGTT 3360 

TTATTGAGCT AGACCAAACG GACAAAGAAC TGGCTGCCTT TGAAGATGAT AAATTGTTAG 3420 

CCATTCTAGA AAAACGGGGC AATCTCTATA AGCCAAGGAA GGTTTTTAGC TAGATCGTTT 3480 

AGGAATAAAA ATCGGGTGAT AGATAACAAT TGCTTGATAA AACCCCATAC TAATAGTAGA 3 540 

ATGGTTTTGG GAATTATAAT ATTCCAATTG TTGCGAGTTG TAGGTACTCA AATAATCTAT 3600 

ATAGAAATTT AGAGGTGTGA AATGAAGCAA TTTAAAATTC TTTCAGATAA ATATTTAGAG 3660 

TCCATTACAG GTTCTGATGG GAACTTAGGC CCAGGATTTG GTGTGAT7UVT TCCATGATGC 3720 

GAAATGAGTT TCGAGAAAGG GTGGAGCAAC TTCTTCAACA AAAAGAAATA AATGAAAATA 3780 

GTGAGTTGAG TCACCTGTTT CGTCTTGCTA TACAAAATTT AGACAGAAAT GAAAAATACC 3 840 

AATCGGTCAT GGCCAATTTG AGTCAAGGGT TGTCACTTTA CCTCATGACG CATCATTACC 3 900 

AGGCACCTAA GTCTGTCATT GATTTTGGTT TATGGA 3 936 
v2) IwFukMATION FOR SEQ ID NO: 203: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3230 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203: 

CATCCAGCAA CTGCTCCTCT GAGCGTTTCA AAATTGATGT AATTTTTCTA GTTTTTTCTA 60 

ATAAATGTGC CATTTTTCAC CTCGAATTTA ATCGCTATCA TTATAACATA AAAACGTCTC 120 

TTTTTCAATA ATTATCTGAA AATTCCTTAT TGACTTGCAT TGACTTACAA TTTAATTAAA 180 

AACCAGAATA TTTTTAATTA AATTGTTCCT TTTCTATTGA CAAGTTGCCT ATTTTTGTGT 240 

ATCATAATAT TATAAAAGAT AATATAATAA TTTTATTTGT CTTTTCACAT TCGGTCTCCT 300 

TATATAAAAA AGCGATTCAT TTTGAACCGC TTTTTCTTAT TTATCGCCTT TGTTACGAAT 360 

AACAAAGCCT GTTTGCTTTT CGCTTAAAGT ATTGCGTGGT TTTTTATTAT CCTTACGGTA 420 

ACGTTTTTCC TTATCAAAAC GATCGTTGCC ACGACTTCCT TTTTTGAACT CATCACGGCG 480 
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ACCATTGCCA CGGCGATCAC GCTCTCGACG GTCGTCCCCA CGACGGCCTC CACGACCTCC 540 

CTTAGCTTTA CCACCGAAAC CATTACCTGA TGGTTTAAAC GGTAGTGGtT TTTCACGTGC 600 

AATCTCCACT TCTGGAAGGC TATCTGGGTC TTGGACTGTC AGACTCAAGA TATACATTGC 660 

CAATTCTTCT GGAGTAAACT CAGCAGCCAA TTTGCGAGCA TCCTTACCAA ATTTCTCAAA 720 

GTTGGCACGA ATGGTTTCAT CTGCAAAATC ACGTTCGATT TTCTTGAGAG CTACCTGTTT 780 

TTTTGATTGG AAGGATTCTT CTACACTTGC AGGTTTGAGA CCTTTCATGC GTTTCTTAGT 840 

CAAGTTTTCA ATGATTTGAA GGTAACCCAT TTCGTTTGGA GCAACAAAAG TAATAGATTG 900 

ACCTGACTTA CCAGCACGAC CTGTACGACC GATACGGTGA ACATAACTCT CAGGATCTTG 960 

TGGAATATCG TAGTTGTAGA CATGGGTCAC ACCTGAAATA TCCAAACCAC GCGCTGCAAC 1020 

GTCTGTCGCA ACCAAAACAT CAAGATTGCC ATTTTTAAAG TCACGAAGGA CACGAAGACG 1080 

TTTGTTTTGG TCTAGGTCGC CATGAATTCC TTCTGCACGG AAGCCACGAA TTTTCAAACC 1140 

ACGAGTCAAT TCATCCACAC GGCGTTTGGT ACGACCAAAT ACAATAGCGA GTTCTGGTTG 1200 

TGCCACATCC ATGAGACGAG TCATGGTGTC AAATTTTTCT TGTTCCTTAA CACGGATATA 1260 

GTACTGGTCA ACCAATTCTG TTGTCAATTC CTTAGCCGCA ATCTTGACAT GTTCAGGGGC 1320 

TTTCATAAAC TGAACACCGA TACGTTTGAT GGCATCTGGC ATAGTTGCTG AGAAAAGCAA 1380 

AGTTTGACGG TTCTCAGGTA CACGGGAAAT AATGGCTTCG ATGTCTTCAA GGAAGrcCA.T 1440 

GTTAAGCATT TCATCCGCTT CGTCAAGGAT AAGGGTTTCA ATGTCTTGTA ATTTCAAGGC 1500 

CTTGCGTTTA ATCAAGTCCA AGAGGCGACC TGGAGTTCCC ACCACAATAT GGGCACCAGA 1560 

TTTAAGAGCC TTAATTTGTT TTTCAATGCT TGATCCGCCA TATACTGAAC GGACTTTGAC 1620 

TCCCTTACTA CGACCAAAGC GGAAGAGTTC TTCTTGACTT TGGACAGCTA GTTCACGAGT 1680 

TGGAGCGATG ACCAAGGCTT GGATAGTCGC TTCTTCTGTA CGGATTTTTT CAAGGGTAGG 1740 

CAAGCCAAAG GCTGCAGTTT TTCCTGTACC AGTCTGAGCT TGACCGATAA CATCCTTGCC 1800 

TTCAAGGGCC AAAGGAATAG TTTGTTCTTG GATAGGACTA GCTTCTACAA AACCAGCTTT 1860 

TTCAATTTCT GCTAGCAAAT CAGCAGACAA GTTTAATTCA TTAAATTTCA CGTTATTCTT 1920 

CTTTCTAAAG GTGGTGCGAA GCCACCCTAT AGGGCTTAGT TTATACTTTT CTTTTTATGA 1980 

CGTATTTTCA TATAACTAGA TATAAAATCG TGTTGCTTCT TTTCCACAAA AGAAAAGTAC 2040 

TGTTTTCTTT GCAACCTATC TAGTATAACA CAAGACCAGA GCAAAAGATA GCCCCATTTC 2i00 

TACAGAAAAT CATGTAAGCG CTTTTTGACT TTCTTTTTTG ATTGAACGAC CTAGATAATA 2160 

AGACAAAGCC AAGGCGATAC TGTATAAAAT GAGAAAAACG AACAAGGTTT GTGTGTACGA 2220 
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ATGAGCCATT TTATAAGTCT CTGCTAATAA AATAGGTCCC GCTAAACCAG CCATTGCCCA 2280 

AGCTGTTAAA ATATAACCAT GCAGAGCGGC CAATTCCTTG GTTCCAAAAA TATCACTGAG 2340 

ATAAGCTGGA ATCAAAGAAA AACCAGCTCC ATAGCAAGTC ATCAAAATAG ACATAGCAAC 2400 

TACAAATAAA ACGGAATCTG TAAAGAGCCA AAGTGAGAGA GAAAAGAAAA GATTGACAAG 2460 

CAGTAATATA CTAAAGGTTA GAGGGCGACC GATATAGTCA GACAAACTCG CCCAGAGCAA 2520 

GCGACCAAAT CCATTGAAAA TCCCCAAAAC ACCCACCATT ACTGCTGCAT GACTTGTAGA 2580 

CAAGCCAGCC ATCTCCTGTG CCATTGGCGA TGCCGCTGAA ATTAAGCCTA AACCACAAGC 2640 

TATGTTGATA AAGAAAATAA TCCAAAGCAT ATAAAACCGA TTGCTTTTTA GAGCCTGATT 2700 

TGCAGCCATT CCTTGCGTCA AAGAGGCTGT TTTTTCTTTC CCTGAAGAAG ATAAAATTGC 2760 

AAGCTCTTGC TCATTTGGAC GCTTAATGAA TTGTGAAGCT AGGAGCATGA TAATAAAGTA 2820 

ACTTGCTCCT AAAATATAAA AAGTTTCTAC AAGCCCTACC CCTGCGATGA GGTGTTGCGC 2 880 

TATGGGACTA GTCAATAAAG AAGCAAAACC AAACCCCATA ATCGCTAAAC CTGTTGCGAG 2940 

ACCACGTTTA TCAGGAAACC ATTTTATAAT CGTCGACACA GGGGTAATAT AGCCTGCTCC 3000 

CAAACCAAGC CCACCTAAAA TGCCATAAGC GAGATACAAC AACCACAGCT CTGACGGTCT 3060 

ATTGCAAATC CTGTTAAGAT ATTTCCACCT GCGTATAGAA AAGCAGATAG ACTTCCCATG 3120 

ACTTTCGGAC CAAATTTTTC TACCAAACGC CCCATAAATG CAGCCGATAA GCCCAAACAA 3180 

AAGATTGCTA GACTAAAGGC GAAGGCAACA GAAGCCTGAT CCCATCCCGT 3230 



(2) INFORMATION FOR SEQ ID NO : 204: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5096 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEO ID NO: 204: 

CCTATGAAGA CTGTCCCAAC TGGGTGTCCT TCTAGGCTAT CTGGTCCTGC CACTCCAGTC 60 

AAACTAATTC CAAAATCAGA CTGGGTCTTG CTTCGTGCCT GCTCAGCCAT CTTCTGAGCT 120 

GTAAATTCAG ACACCACACC ATGTTCTTCC AAATTCTTGG CAGGAATATC CAACATCCTT 180 

GATTTTTCCT CCAAGCTATA GGTCACAAAA CCACCCTTAA ATATACTTGA AACTCCAGAA 240 

AAATTCGCCA CGGTAGCTTG GAAAAGACCT GCCGTCAAAC TCTCTGCAGC CGCGATGGTT 300 

TTCCCTTGCC TTTTCAGTTC TTCTACCACA ATGCTGGCTA AACTAGTTTC TTCCCCATAA 360 

CCATAGCAAA AGTCTCGTAA AGAAATTCCT TCGAAAGTCT GGCAGTCCAA GATTTGATTT 420 
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TCCAAGATAT CCAGCGCTTG ATTCGCCTCT TCTTGACTGC TAGCCTTTGT TGACAGACGT 480 

AGAGTGACTT CTCCTGTCTT GGCATAAGGG GCCAAGGTAG GATCGATCTG ATTATCAATT 540 

AAATCAGCCA AAATCGTAAC CAACTGGCTC TCGCCAATCC CAAAGAAACG AAGAACTCGG 600 

GAATACAGCT TGCTCCCTGT CATCAACTTG GGTAGAAGTT GGTTTAAGAC CATGGGTTTC 660 

AATTCACTTG GCGGACCTGG AAGGACGACA TAGGTCACTC CGTCTACTTC TAATTTTCCT 720 

CCAACAGCCA GTCCTGTTTC GTTTGGCAGT GGAATCGCTC CTTCTACAAT TTGAGCTTGT 780 

CTTTCGTTAT TCGGTGTTCG GGCATAGTCT GGTCGCAGGG TAAAAAAGAT ATCCAACTTC 840 

TCCTGAGCCT GAGGATCAAA GACTAATGCT TTCCCTAAAA ATTTAGCTAG GGTTTGTTTG 900 

GTTAGGTCGT CCTCAGTTGG CCCCAAACCG CCTGTCAAAA TCACCAGACT GCTACGTTGA 960 

CTGGCAATCT CAAGCAAAGA CAAGAGACGA ACTTCATTGT CTCCTACAGC CGTCTGAAAA 1020 

TATACATCTA CCCCAATCTC AGCTAGTTTT TCCGACAAAA ACTGGGCATT GGTGTTGACA 1080 

ATCTGCCCTG TCAAAATCTC TGTTCCAACA GCAATGATTT CTGCTTTCAT GTTTCCTCCT 1140 

ACCTATCTAT TCGTATTTTT TTGAAAAAAT CGCAGGAATT TTCCTACGAT TGATTTTTTT 1200 

ATTTGTATCA AAAGTTAATT ATCTTCATCA CCAACAGGTG CTCTGCCAAA TAAATCTTCA 1260 

AATAAAACCG CATTGGTTTC AAGCTGAGTA ACTTCTTCTT GTCCCAAAGA ACGTCGGAGT 1320 

AGATTTTGCA TTTCCAACAT ATGTGCTCTC GAAACAATCT GGTAAGAAAC ACCTTGAAGT 1380 

ATCTCTCCTT CACCCTGCAA CTGCTGAGTT TCAATGGTTT TAAATGAATC TTTATAGCCT 1440 

AGCAAGTTAG GGATACTTTT TGCAGACAAA TCAATATTGG TCTGCATATT GTCACTCAAA 1500 

GCTTTTAGAA TCTCTTGATA ATGACCAATG CTATTTAAAC TGAGAGCTTT TTCCATGACT 1560 

TTTTGAATAA CTTCACGTTG ACGTTTTTGA CGACCATAAT CCCCCTCAGG ATCTTGGTAA 1620 

CGCATTCGTG CATAGACTAG GGCTTCTTCT CCCCCAATAT GTTGCTCCCC AACACCGATA 1680 

GAAATAGTAT TAAATTCTTC TTGGTCACTG ATAGAAATTG GGAAACCTAG GATATTATTG 1740 

ACTGTAATAC CTCCTACTGC ATCCACTAGT TTTTGCAATC CTCTCATATT GACCATCACA 1800 

TAGCGATCAA TATGGATATT CATCATTTTT TGAATGGTTT CTATAGCAAG CTCTGCTCCA 1860 

CCATCTGCAT ATGCTGAGTT CAGTTTCGCT TCATGAGCCT GACCATTCCC TGATTCAATG 1920 

CGCGTCAGAA TATCCCGCTC TAAACTCATC ATTGTTGTTT TTTTCGTTTT AGGATTCACT 1980 

GTCATCAAGA TCATGCTATC ACTTCTACCG ACCCAAGTTT CAGTTCGTTC AACATTTCCG 2040 

GTGTCCACTC CCATTAACAG AATGGTTAGA GGTTCAGTCG CTTCAATAAC CTTGGTTTCT 2100 

TCACCGATTT TTTTATAGGT TTTAGCTAAG GTTTCTGTCC CTTGTTGATA AATAGTATAA 2160 
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GCAAAAACAC CTACTCCTAC TACAGTTACA GAAAGTAAAG CTAGCACCAT TCCAATAATT 2220 

TTTTTAACCA TATTTCTACT AACCTATCAG TTTACCCATC AAGTAAACAT CGATAAATTT 2280 

CCCTTCTTCT ATATATGCCC CACGCTCTTG GCTACCTTCA ATGACAAAGC CATGCTTTTG 2340 

ATAAAGATGG ACTGCTGCTT GATTACGAGT TTGGACAGTC AGTTGGAGAC GACGCAGAAT 2400 

GCCACTTGCT TGTGCCCACT CTATCGCTTC TTCTAGCAAC AAACTTCCCA AGCCATTATT 2460 

CCAATATCTT TTTCCAATCA CAATGAAGAG ATCTCCAATA TGACGGACTC TCTTACGCTG 2520 

ATCAGCTGTA ATATTTACAA TACCAGCAAT TTTGCCATTT AAGAATGCAA GTAAGGTTAT 2580 

CTGATTGTCC GAACTAGCTT GCTTGTTGAG GAATATTTCC ATCTCCTCAC TAGTCAAGAG 2640 

AATACCATCT CCGTCTAGGC TGGTAAAGTC TGTCTCCAAA CTCACACGAT TTAAAAAGGC 2700 

CACTAATTCA GCTGCATCTT TGGGCTCTGC TTCCCTAATG AGCAATTCAT ACTCCATATT 2760 

GAAGCTCCTC TAACAATTTC TCAGCACGCA AACCCTTTGC CTGAAAATTT AAACGGCGTC 2820 

CATCTGCTTC TTTTAGAATT TCCAATTCTA AATAAGCATC TGGCAAGGCA TCTCCTAAGA 2880 

GATTTCCCCA CTCAATAACA GTCACGCCGC CACCAAAGAT AAACTCATCC AAGTCGATAG 2940 

AATCAGCATC TCCTTCAATA CGATAAACAT CTAGGTGATA AAGTGGAAGT CGACCTTCAT 3000 

ACTCTCTCAC GATAGTATAG GTGGGACTTT TAATCATTTG AGAAATCTGT AATCCTTTTG 3060 

CAAGTCCTTT AGTAAAGGTC GTTTTACCTG CACCCAGTTC TCCAGTTAAG ATTAAAACAT 312 0 

CATTCT-TTGC TAATAGAtUG CCCAAACGCT CCCCTAAGGC TTGCAACTCT TCTTCATTTT 3180 

TTGTGTACAT ACTCTTATTA TACCAAAAAC TTTTCTTTTG TGTCTATTTT CCTACTAAAC 324 0 

TTATCATCAT AACATCCATA A7VAAACAGGC TTTCTCTAAA AGAAAATGAG CGTAACAATG 3300 

ACCAATACAA GATCTCGGAA AATATGACCA TAAAAGGAAA CTTCCTTCTT AACCGAATTT 3360 

GGGACAAGAT AGGCTGCAAA AAACAAGCCC AGTCCAATAT AAATCAGAAG TGAGACAATG 3420 

GTCATTGGAT TTCTTAAGAA AAGAAGTGTT GCTAAAATAG TCACCAACAC TGTCTTTTTT 3480 

CTGTCCAGCA TAGCAAGAAA ATCGCGCACG TATTTTTTCA AGGGTAAAAA AATCAGCAAA 3540 

TC?TAGCCCAA ATAGGAAAAA GAAGGATGGC AATAAAAAGT CAACTAATTC TTGCTGCAGC 3600 

GTATTTTTGA TGAACAAGTT ATCTGACAAA ACAAGAACAG CTCCTAACAA ATTAATTAAG 3660 

AGTAACATAC TGTAAAAAAG CTTCACCGAC TTCTTACTGG CTAGGACACT ATGGACTTCT 3720 

TGCTTACGGG TATAAAGATA ATTTACTCCA GCACAGATTC CTGAAACGAA AACCATGCTT 3780 

CCGATGAAAA AAGCTGTACT TTGTTTAAAG GACAAGATGC ATTCCTTCCA TAGGAAACAG 3840 

CTACTCAAAC TGATTTGAAT TAAAGCTAAC AAAAATAAGA TTCTCATTGA TTTCATCTTC 3900 

TCTCTCCCTT CCTACCAATC ATTATACTAG GAGAAAAGAG AGAACTGTTT CTAATCTTCT 3960 
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CAAATGTCTC 


TTTAAGACGC 


TAAACAAACA 


CTAGAGACTA 


ATAPTPAATf: 


AAAAl ^AAAVj 


4020 


ATCAAACTAG 


GTAGCTAGCC 


ACAGGTTGCT 


CAAAACAGTG 


TTTTfiAflATT 


ppa PATH par* 


4080 


CTGACGTGAT 


TTGAAGAGAT 


TTTCGAAGAA 


TATAAATTTG 


AAATPaTnua 


a ai'PPPfppB a 
AAi 1 ^AA 


4140 


ACGGGTGGTT 


GTTTTGTCTC 


GCACCTCACG 


RAfipnanApn 

Ort O V, U/IU A V. Ij 


a PTP 1 P a f^T 


CACATAATTA 


4200 


TAAGGCTGAT 


AGTATTAATC 


TAACTATCAR 


r* ^ Tmp a ririTT 


A 1 i. 1 AAL^a 1 J 


TCAGAAAAAC 


4260 


TATAATGTCA 


AG A TT A A OT* A 


AAr'A^lTATPT 


a PTTrr'TT'/^ a 


AATAATTTTC 


TATCTTCATC 


4320 


AACATTAAAG 


GATTGTTATA 




& & PTPTPTTT' 
riAL. 1 1_ i L. 1 1 ^ 


t-TTCTATATA 


ATAATTTTTG 


4380 


ACTTGTTCTC 


TGTCTAGAAA 


1 J. 




^a a^i^ an^K a^ 
L. AAvj AAl AAG 


TAGAGGAGCC 


4440 


AATTGGTAGC 


TTGTCTGTCT 


X X\3A. I. i n^tWo 


A(.3 I X CAA i 


TTTCAAGAGC 


TTCTTGGATG 


4500 


GCTTCATTAT 




Tf5 AT HPT a f^r* 


1 Ala i VjALiLtj 1 


AGTTGTTIACG 


AACTCTGATG 


4560 


TAGCCAAATA 


AAAACTCTTG 


ATGGTCCAAA 


TTTTTTGTCT 


GATACAACTC 


TATTAAATGA 


4620 


GAGTAGTTTG 


CCTCATATTC 


TTGTTCACGA 


CCCACTAAGG 


AATAGAAATT 


AGATAGAGTA 


4680 


TTCAACGCCT 


TTAAATAAAT 


CAGAGTATTT 


GAAGAGACTT 


TTAATAATAT 


ATTTTCCAAT 


4740 


GACGAAATTG 


CCTCACACTT 


ACTGTCATAT 


TGATAGAAGT 


CAATTATAGA 


TTTAATCCAT 


4800 


TCAAGGTAAG 


TTCGGTCTTC 


TAATGTTAGA 


AAAGTGCTTC 


GTTCTACTTC 


TATTTTATAA 


4860 


AGATATTCTA 


AATCGTCATA 


ATTTCTGTCA 


TCTAATAGGC 


GAGCAGATAG 


ATGTTTGAAA 


4920 


TTAGAGAGGT 


TAGACTTAAC 


TTCGATTTGT 


TCATTGAAAA 


AGTAATCCAA 


AGGGACTTCA 


4980 


AGTCGTTGAG 


AGAGTTTGAA 


TAACAAGTCT 


GCGGAGGGAA 


TAAAATGACC 


TCTTTCAATT 


5040 


TTACTAATCT 


GGCTTTGTTC 


ACAAATTCCT 


TCTGCAAGAG 


TTTGTTGGGA 


GAGTCT 


5096 



{2) INFORMATION FDR SEQ ID NO: 205: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2395 base pairs 
{B) TYPE: nucleic acid 
(C) STRANDEDNESS: double 
{D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: 

ACAAGATAAA AATAAAGGAT TACAATGGGG AATATAAAGT AAACCGGTAA ACCTAAAAAG 60 

AAAGGAGAAA AGATGAAAAT TGTACTTGTA GGGCATGGAC ATTTTGCTAC AGGGATTTAT 120 

AGTTCTTTAC AATTGATTGC AGGTAATCAA GAAAATGTGG AGGCGATTGA CTTTGTGGAA 180 

GGAATGTCAG CAGATGAACT CAAGCAAAAA ATCTTACTTG CAATTTCAAA TGAAGAAGAA 240 
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GTTTTAATCC TAAGTGATCT CTTGGGAGGA TCGCCATTCA AGGTTTCTTC TACCATAATG 300 
GGAGAAAATC CAGCCAAGAC AATGAATGTT CTCTCGGGTT TGAACTTAGC CATGTTAATG 3 60 

GAAGCAGTCT TTGCTAGAAT GGCTCATAGC TTTGATGAGG TTGTTAATAA ATCAGTAGTG 420 
GCGGCCCAGG GCGGAGTCGT AAATGGTAAA GAATTGTTTT CAACGGATGC AGAGGAAGAG 480 
GAAGAAGATT TCGAATCGGG TATTTAAAGG GTAAAAGAAT GATAAAAAAG GTTACGATTG 540 
AAAAAATAAA ATCGCCTGAG CGCTTCTTAG AAGTACCACT TCTGACGAAA GAAGAAGTCG 600 
GCCAGGCAAT CGATAAGGTT ATTCGGCAGT TAGAACTCAA CCTTGACTAT TTCAAGGAAG 660 
ATTTCCCGAC GCCAGCTACC TTTGATAATG TCTATCCAAT CATGGATAAC ACGGAATGGA 720 
CCAATGGTTT CTGGACAGGA GAACTGTGGT TGGCTTATGA ATACAGTCAA CAGGATGCAT 780 
TTAAAAACAT CGCTCATAAA AATGTTCTTT CTTTCCTGGA TCGTGTCAAT AAGAGAGTAG 84 0 

AATTGGATCA CCATGATCTC GGCTTCTTGT ACACACCGTC TTGTATGGCT GAATATAAGA 900 
TA7VATGGAGA TGGAGAGGCT AGAGAAGCAA CCTTGAAAGC TGCAGATAAG TTGATTGAAC 960 

GCTATCAAGA AAAAGGTGGT TTTATTCAAG CTTGGGGAGA CTTGGGCAAG AAAGAGCATT 102 0 

ACCGTTTGAT TATCGACTGC TTGCTCAATA TCCAACTCTT ATTCTTTGCT TATCAAGAAA 1080 

CAGGCGATCA AAAATACTAC GATATTGCAG AAAGCCATTT CTATGCTTCA GCTAATAATG 1140 

TAATCCGTGA TGACGCTTCG TCCTTCCACA CCTTCTATTT TGATCCTGAG ACAGGTCAAC 1200 

CCTTTAAAGG TGTAACGAGA CAAGGGTATA GTGATGATTC ATGCTGGGCA CGTGGTCAAT 1260 

CATGGGGAGT CTATGGTATT CCTTTGACTT ATCGTCACTT AAAAGACGAG tCCTGCTTTG 1320 

ACTTGTTTAA GGGTGTGACC AATTATTTCT TGAATCGTCT GCCAAAAGAT CATGTGTCCT 1380 

ATTGGGATTT GATTTTTAAT GATGGTAGTG ATCAATCACG AGATTCTTCA GCAACAGCTA 1440 

TCGCCGTCTG TGGGATTCAT GAAATGCTAA AACATCTCCC AGAGGTGGAT GCTGACAAAG 1500 

ATATTTATAA ACATGCTATG CATGCCATGC TTCGTTCCTT GATCGAACAT TATGCAAATG 1560 

ATCAATTTAC CCCTGGTGGG ACAAGTCTCC TCCACGGTGT GTACTCATGG CATTCAGGTA 1620 

AAGGAGTGGA TGAAGGCAAT ATCTGGGGTG ACTACTATTA CCTAGAAGCC CTTATCCGTT 1680 

TCTACAAAGA CTGGAACCTA TATTGGTAGG AGGAGAAATA TGACAATGCC AAATATTATT 1740 

ATGACCCGTA TCGATGAACG GTTGATTCAT GGACAAGGAC AACTTTGGGT AAAATACCTA 1800 

GGTTGTAATA CGGTCATTGT TGCCAATGAC GAAGTAAGCA CGGACAAGAT GCAACAAACT 1860 

CTGATGAAAA CAGTTGTGCC AGACTCAGTT GCCATGCGTT TCTTCCCTTT GCAAAAGGTG 1920 

ATTGATATCA TTCACAAGGC TAATCCTGCT CAAACGATCT TTATCGTTGT AAAGGATGTG 1980 

AAGGACGCTT T/ACCTTGGT AGAAGGTGGT GTCACTATCA AAGAAATCAA TATTGGGAAC 2040 



WO 98/18931 PCT/US97/19588 



1191 

ATTCACAATG CCCCTGGTAA AGAGCAAGTG ACACGCTCCA TCTTCCTGGG TGAAGAGGAC 2100 

AAGGCGGCCC TCAAGGAATT GAGCCAAACT CATCAAGTAA CATTTAATAC GAAAACAACT 2160 

CCAACAGGAA ATGATGGAGC TGTTCAAGTC AACATTATGG ACTATATTTA ACAGAGGAGA 2220 

TCGTTATGTC GATTAATGTA TTTCAAGCGA TTTTAATTGG ATTATGGACA GCTTTCTGTT 2280 

TTAGTGGAAT GCTGTTAGGA ATTTACACCA ATAGATGTAT TGTTCTGTCA TTTGGTGTCG 2340 

GAATTATTCT AGGTGATCTG TCATGCTCTT GCAATGGGAG CCAATGGTGA ATTGG 2395 



(2) INFORMATION FOR SEQ ID NO: 206: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3342 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 206: 

CCTTCTTTAG AGGTTAATTT TGCAAAATCG TCGATTGTTA TATAAGGATT ATTATAGAGA 60 

CTGTTCGCAA AGAATCTCTG ATATGTTTTT GAATCTTTTG AATACAAAAC TATCTCTCTA 120 

ATAGCATTGC CATCTGTTCC ATCAATTGGT AAACATACCG TAACTAGAAA AAGAATTATA 180 

TTCAAAATAA AAAATTCTGA TGCGTACGGC ACAAATCCCA AAAGTfiCTAA TATTGCGACA 24Q 

ATTAGGTTAG CTCCACCTCC CCCAAAGAAG TAGAACACCA AATTCCTATC ACTATTTTTT 300 

TCATTAGTAA TGTTTCTATT ACTCATTTGA CAATAACCGA ATGCTAATAA CACTGGAAAT 3 60 

TTGAAATATA TTTTTTTTCT GAAATAGAAG AAAAAGGGAG TAGCAAGCAT CTCTAGTTTA 420 

TAAGATAAAC ATCTTCCCAC TAAAAAATGA CCTAGTTCAT GTAATGTAAT TGATATTAAC 480 

GAAATTAAAA TCAATCGAAA ATAATAGATT AATGAATCAT TTGGAAAAAT TATCAATAAT 540 

AGGAACAATA ACGGAATCAA ACATAAATAT ATGACAGAGT TATTTAATAT TTTCAACATA 600 

ATACCATTCC TCTAAACTAT TAGCTTCAAA AAGGCGTTTT TTCTCCCAAT ACATCTTCTC 660 

AAAATGTTCG GAATCATAAT TTTCTAAAAT TAATTTTAtG TCTGGTAAGC TCTTTCTTGA 720 

TAATCCGTTG TTTTGTACTT AATTTTCCCT TCAAGTACAT CTTCAATTTT ATAAGTTGCC 780 

TCCATCAACT GAGCCTCTGC AATATCTTTG AGTGAATTGG TAATTGAAAC TTGGTGTAAT 840 

ATCTGTCCts CCATATATGA AAATATATCT CTAAGATATT CTGACACATT ATCAGAGCCG 900 

TTACTCTCAG CAACATCTAA TGTTACAACA AACTTTCCAG CTAATCGAAA AAGATGGCTC 960 

CACCCCCCAA TCCTTTCAAT AAAGTTTTTT GTGTCCACAG ATACGTTTTG TAAATATACA 1020 
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GGAGAAGAGA TAATTATAAT ATCAGACTCT AATAACTCTT TTTTTATAAC ACCTCCATCA 1080 

TCAGCATTAC TTTGCCTATC AATTCCTTTC TTAAACAACT CTTCTGAATC AGAATTAGAT 1140 

ATTTCTAGCT CTGAATTGAA AGGTGTCCTG AAAGATATAT CAACATTATT TCTACTAGAA 1200 

ATGATACTTG AAAGTCTCTT AGTATACTCT AAAGTCTTAG AGTTATGATT TCGCACTCCT 1260 

GCATATATAA ATATTTTATT CATTTTAATT CATCCTCTCA ATTTGAATTT AGTAGATTTT '1320 

TCAAGATAGT ATGGTACAAA AACAGACTTT TGTTGACTCA CATTATTACA TATGTTTTGT 1380 

ATTAAACCAA AATCAATACT ATTTTTGGAG TAATTTTGAT TTTAGTTTAA AATCATTTCT 1440 

ATAACAGTAG CATATACCTC AAGCCGTTTA GCAATTAGAA TAGAACTTTT CTTTATTATA 1500 

TTATTATCTC AACGAAAAGC TACACTATTA AAAATATTTT ATAGAATTAC ATATTAAACT 1560 

AGTCAATCTT GGTATTTTTA TATTGCTTAA TGAGTGGACA CCTCTATTTT AGAAACAAAA 1620 

CTATAAATTA AGCTAGATTT CAAGTAATGA GGGGATAACT ATCTTTTTGT CATTCTGATT 1680 

CAGTGCGATA TACCTTAAAA AAGTATAAGC AATACCAGTC ACACCTGTAT ACAAAGAAAA 1740 

ATCTGGGAAA TTGCTTGTTT GGACGATACG ATACTCTCCT TCTTTTGATT TATTCATTAC 1800 

AACACTACAC AATAAAGACT CCAATTCCAT ACTAGTATCC ATTTCTTTCA TGTAGTCGAT 1860 

GTAAAAATTT ATTATGGCCA TACTTCCATG GCAAAATGTA TCATTATCTA AACTAGCTAC 1920 

TATTCCCTCT GGAACACTTT GGGGATGATT AACTAATGTC CCAAATTCTC CACTACACCA 1980 

CTTCAAAGAA TGAATTTTGA TTTTCTCCCT AGGAACTAGT TGTAAAATTA ATTCTTTATA 2040 

TTTTTTAAGT CTTGTCACTT TATAAATATT TTTTAATGTA AAAATTACAC CTGATAGTCC 2100 

ATGGCCAAAA CTATATCCAA AATTACTATT ATCTCTCTCG CTTACATCAT TATATAGCGT 2160 

ATCACCTAAA CTTAATACTA GCCTTAGAAC ACGTTCCTTC TCTATTCCTC TCCTATAATA 2220 

TCTTACCAGT GTATTAATTA AAGGTAGAAG ACCATTAATA TAGTCAGACT TGTTTGAAAC 2280 

ACTTGCAAAA TCAGTCTTTT CAAGCTCAGT TAAAACACTC TTTATATAAT TTAAGCATGC 23 40 

GAGAGTATTT GTATCGTAAT CCTCTATAAT GGATAGAACA ATGAAATATC CTATATCCCC 2400 

AGTTAAACCA AATGTGGTCT TAGATAAAGA AACAGATGGC GGAATTGCAG ATAACATTTT 24 60 

ATTGTACAGT TGAGTATATG ATGATTTATC TTTCAATAAT TTTACATAGT ACATAAACAG 2520 

TAATATTCCA GCTCTACCCC TATACATATC ATTmCCCGTT TGTTCAAGAC ACCATTTAGA 2 580 

ACCTTTAAAA TTAACAGGTA TACTCCAAAT TGGATATTCG TCATAAATAT TATTAATAAC 2 640 

CAAAGAGTCT GCAATATTTT CTACTTCATT ATGCAGAATA GTAACTAAAC TTTCATTTGG 2700 

GAGTTTTTTT CTATTAGATA AGTTTAATTT ATATCCTTTT TTTCGCTGAT CAAAGCTTGG 2760 

AAAATAAATT TCAATGATAT CAAGTTGCTT TTCTAAATTT TCCAAATTAT TATTAGGTAA 2 820 
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ATATTTCATA AAATAGTCAT ATCCAGAAAA TTGATGTAGG GAAATAAAAT GATTTCCAAA 2880 

ATCATCGTAG ATTTCATTGA TATTTGTATC TGTATAAAAA ATCGGAATAT CTAATAACCT 2940 

CATTTGTTCA CATTCGCTTG CTACAATACC TTGATTAG7A AACTTATTGC TCCAGAGATT 3000 

TTCCAATGCT TTTTCTCTAT CTAACATTTC TTCATAAAAA TCAGGATGAT ATAAAAAAGA 3060 

TAGTACTGAA GCATAGCTAT TTGTGTCTCT AAAAAGTACC CTTGTCTTTA AACCATACAA 3120 

GTTTGCTTTT AATAGCATTT TAAATTCTTC TGTTTTATTT AACTCTTCAA ATATCAGATA 3180 

AAAATCCCTA AAACCTTTTT TGAAATCTTT TATATACTTA TCAAATTCTA TATCACCATC 3240 

CCGAACAGGC AGGTTTTTCC CACCTTCAAA ATCAATTTTC CCAATATCAA ACTTTACCTT 3300 

ATCAGTATTT AAATTAATTA AAACTTGACC AGGGATCCTC TA 3342 
(2) INFORMATION FOR SEQ ID NO: 207: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3454 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207: 



GAGAAAAGAA 


TGTTAAAGAA 


AAATGATATT 


GTAGAAGTTG 






GO 


GAAGGGGCAG 


GAGTTGCCAA 


GGTAGATGGT 


TTGGTCTTTT 


TTGTAGAGAA 


TGCTTTACCG 


120 


AGTGAAAAAA 


TTCTCATGCG 


TGTCCTCAAG 


GTCAATAAAA 


AGATTGGCTT 


TGGAAAAGTT 


180 


GAAAAATACC 


TTGTCCAGTC 


ACCACACCGT 


AATCAAGATC 


TAGATTTGGC 


TTACCTGCGT 


240 


TCAGGAATCG 


CGGATTTAGG 


ACACCTTTCT 


TATCCAGAAC 


AGCTCAAGTT 


TAAAACCAAG 


300 


CAAGTCAAGG 


ACAGTCTCTA 


CAAGATTGCT 


GGAATTGCAG 


ATGTAGAAGT 


TGCTGAAACG 


360 


CTTGGTATGG 


AACATCCAGT 


CAAGTATCGC 


AATAAGGCGC 


AGGTGCCCGT 


TCGTCGAGTG 


420 


AATGGTGTCT 


TGGAAACAGG 


ATTTTTCCGT 


AAGAATTCGC 


ATAACCTCAT 


GCCCCTTGAA 


480 


GATTTCTTTA 


TCCAGGATCC 


TGTCATTGAC 


CAAGTCGTAG 


TAGCTCTTCG 


AGACCTGCTC 


540 


CGTCGTTTTG 


ATTTAAAACC 


TTATGACGAA 


AAGGAACAGT 


CTGGATTGAT 


TCGGAATCTT 


600 


GTGGTGCGTC 


GTGGTCACTA 


TTCAGGACAA 


ATCATGGTCG 


TTTTGGTGAC 


AACTCGTCCA 


660 


AAAGTTTTTC 


GTGTTGACCA 


ATTGATTGAA 


CAAGTTATCA 


AGCAGTTCCC 


AGAGATTGTG 


720 


TCTGTCATGC 


AAAATATCAA 


CGACCAGAAT 


ACCAATGCGA 


TTTTTGGTAA 


GGAGTGGCGC 


780 


ACTCTTTATG 


GTCAAGACTA 


TATTACGGAC 


CAGATGTTGG 


GAAATGACTT 


CCAAATCGCT 


840 
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GGCCCAGCCT TTTACCAAGT CAATACTGAA ATGGCGGAGA AACTCTATCA AACAGCCATT 900 

GACTTTGCAG AGTTAAAAAA AGATGATGTG ATTATTGATG CCTATTCTGG TATTGGAACC 960 

ATTGGTTTAT CAGTCGCCAA GCATGTCAAA GAAGTCTACG GTGTTGAACT GATTCCAGAA 1020 

GCAGTAGAGA ATAGCCAGAA GAATGCTTCT TTGAACAAGA TTACTAATGC CCACTATGTC 1080 

TGTGACACGG CTGAAAATGC CATGAAGAAA TGGCTCAAGG AAGGTATTCA ACCAACCGTT 1140 

ATCTTGGTTG ATCCTCCACG CAAGGGCTTG ACAGAAAGCT TTATCAAAGC AAGCGCCCAA 1200 

ACAGGAGCCG ATCGCATCGC CTATATCTCC TGCAATGTCG CAACCATGGC GCGTGATATT 12 60 

AAACTATACC AAGAGTTGGG ATATGAATTG AAGAAAGTCC AGCCGGTGGA TCTATTTCCT 1320 

CAAACGCATC ACGTCGAGAC GGTAGCACTT TTGTCCAAAC TCGATGTCGA TAAGCACATA 1380 

AGTGTTGAAA TTGAGCTGGA TGAGATGGAT TTGACAAGTG CGGAGAGCAA AGCAACATAT 1440 

GCTCAAATCA AAGAATATGT TTGGAATAAA TTTGAATTAA AAGTTTCGAC ATTATATATT 1500 

GCACAGATAA AAAAGAAATG TGGAATAGAA TTACGAGAAC ATTACAACAA GTCTAAAAAG 1560 

GATAAACAAA TTATTCCACA GTGTACACCT GAAAAAGAAG AAGCCATCAT GGATGCTTTG 1620 

AGACACTTCA AAATGATTTA ATAGAAAAGA ATGACAGTAT ATGACTTTCT GCATTTATTA 1680 

CATTCCTACT TGGTATAGGA ACAGCTATTA TTCCTTTCTT GCAAGGTATC AATTAGAAAA 174.0 

TAGGCTCAAT ATAAAGATTG ATAGGATCAT TTTTATATTT AAAGGAGCGT TGAAATGATT 1800 

GATAAAGGCA ACAAAAAATT TTAGGATAAA TTTGCTAAGT TGTATGCCTC TTTTATGAAA 1860 

AAAGATAAAG AGGTTTATGA TAAAGTTTGT GAATATCTTA GTCCTCATTT GAATAAAGAT 1920 

ATGGAGGTGC TTGAACTTGC TTGTTGGTTT CGTGTCATAA CAGTTATAGA GGCAAATAGT 1980 

TArGTAAATA TAAGGAGTTC AAGACTTCTA CCAAAGTTTA AAACTCAAAA AATAAATAGT 2040 

TGGTGTGCTG CTTACAATAT CCATTTTAAT AATGGATATT GTAAGCAGCA CCCCcAtGAA 2100 

TTTAAAGATT CTTTAAAGAG TCTTATTTTG TGATGAAAAT TTAATATGTA AATCTCAGAC 2160 

GATAGAAATT AAAAACTCTA TCGTCTTTTT TATACTCAAA ATTAGGAGGT AAAAATGGTA 2220 

AGGATAAGAG GTCCCACTTA AAACAATTTA TGGCAAAATA AGGACGGAAT AACACAACAA 2280 

ATTCTCTAAA ACAAATCACT AAATCAATGT AAGATTGAAT GAAATCAATA TTTATGCTAT 2340 

AATTAAATAA ATTTAATGAA GAAAAAAAGA GGGATATTAT GGCACTTAAC TATAAACCAT 2400 

TATGGATACA GTTAGCAAAA AAAGGACTAA AGAAAACAGA TGTAATAGCT ATGGCAGGAC 2460 

TTACAACAAA TGTTATGGCA CAAATGGGAA AGGATAAACC AATTACATTT AAGAATTTAG 2520 

AAAGAATATG TAAGGCTTTA TCTTGCACTC CTAATGATAT TATTAGTTTT GAAGATAATT 2580 

TTAGTGACGA GGAATAGAAA ATGACTTTAA GGACAGAAGA TCAAGTTAGG GATTATGCAA 2640 



wo 98/18931 



PCT/US97/19588 



1195 

GAGAAGTATA GGCTTTAATG AAGTTGAAGA AAACATCAAT CAAGGTACTG GTCAAATAAC 2700 

TACTTTTAAT CAATTAGGCT TCAAGGGATA TTCAAATAAG CCAGATGGTT GGTATTTACC 2760 

TAAAAATATG AATGATGTAG CAATAATCCT TGAAACAAAA TCAGAAGAAA GAGATATTAG 2820 

CAAACAAATT TTTATTGATG AGTTAATGAA AAATATAGAC ATAATTTAAC TAAAAATAAA 2880 

AACTAGATCC TTTTTTGAAA AAATTATATT ATTAAATTTG TAACTGTATC TATTGACAAT 2940 

GATAATTATT ATCGATACAA TAGACTTGAA ATATGTTTAA GGAGTTTTTA TGAAAaCAAA 3000 

TTTTTTCTAA TmGCTATTTT AGCTATGTGT ATAGTTTTTA GCGCTTGTTC TTCTAATTCT 3060 

GTTAAAAATG AAGAAAATAC TTCTA7AGAG CATGCGCCTG ATAAAATAGT TTTAGATCAT 3120 

GCTTTCGGTC AAACTATATT AGATAAAAAA CCTGAAAGAG TTGCAACTAT TGCTTGGGGA 3180 

AATCATGATG TAGCATTAGC TTTAGGAATA GTTCCTGTTG GATTTTCAAA AGCAAATTAC 3240 

GGTGTAAGTG CTGATAAAGG AGTTTTACCA TGGACAGAAG AAAAAATCAA AGAACTAAAT 3300 

GGTAAAGCTA ACCTATTTGA CGATTTGGAT GGACTTAACT TTGAAGCAAT ATCAAATTCT 3360 

AAACCAGATG TTATCTTAGC AGGTTATTCT GGTATAACTA AAGAAGATTA TGACACTCTA 3420 

TCAAAAATTG CTCCTGTAGC AGCATACAAA TCTG 3454 
(2) INFORMATION FOR SEQ ID NO: 208: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3752 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208: 
CGGGAGTATA CTTAATATAA TTATAGTCTA AAAATGACTA TCAGAAAAGA GGTAAATTTA 60 

GATGAATAAG AAAAAAATGA TTTTAACAAG TCTAGCCAGC GTCGCTATCT TAGGGGCTGG 120 

TTTTGTTACG TCTCAGCCTA CTTTTGTAAG AGCAGAAGAA TCTCCACAAG TTGTCGAAAA 180 

ATCTTCATTA GAGAAGAAAT ATGAGGAAGC AAAAGCAAAA GCTGATACTG CCAAGAAAGA 240 

TTACGAAACG GCTAAAAAGA AAGCAGAAGA CGCTCAGAAA AAGTATGAAG ATGATCAGAA 300 

GAGAACTGAG GAGAAAGCTC GAAAAGAAGC AGAAGCATCT CAAAAATTGA ATGATGTGGC 3 60 

GCTTGTTGTT CAAAATGCAT ATAAAGAGTA CCGAGAAGTT CAAAATCAAC GTAGTAAATA 420 

TAAATCTGAC GCTGAATATC AGAAAAAATT AACAGAGGTC GACTCTAAAA TAGAGAAGGC 4 80 

TAGGAAAGAG CAACAGGACT TGCAAAATAA ATTTAATGAA GTAAGAGCAG TTGTAGTTCC 540 
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TGAACCAAAT GCGTTGGCTG AGACTAAGAA AAAAGCAGAA GAAGCTAAAG CAGAAGAAAA 600 

AGTAGCTAAG AGAAAATATG ATTATGCAAC TCTAAAGGTA GCACTAGCGA AGAAAGAAGT 660 

AGAGGCTAAG GAACTTGAAA TTGAAAAACT TCAATATGAA ATTTCTACTT TGGAACAAGA 720 

AGTTGCTACT GCTCAACATC AAGTAGATAA TTTGAAAAAA CTTCTTGCTG GTGCGGATCC 780 

TGATGATGGC ACAGAAGTTA TAGAAGCTAA ATTAAAAAAA GGAGAAGCTG AGCTAAACGC 840 

TAAACAAGCT GAGTTAGCAA AAAAACAAAC AGAACTTGAA AAACTTCTTG ACAGCCTTGA 900 

TCCTGAAGGT AAGACTCAGG ATGAATTAGA TAAAGAAGCA GAAGAAGCTG AGTTGGATAA 960 

AAAAGCTGAT GAACTTGAAA ATAAAGTTGC TGATTTAGAA AAAGAAATTA GTAACCTTGA 1020 

AATATTACTT GGAGGGGCTG ATCCTGAAGA TGATACTGCT GCTCTTCAAA ATAAATTAGC 1080 

TGCTAAAAAA GCTGAGTTAG CAAAAAAACA AACAGAACTT GAAAAACTTC TTGACAGCCT 1140 

TGATCCTGAA GGTAAGACTC AGGATGAATT AGATAAAGAA GCAGAAGAAG CTGAGTTGGA 1200 

TAAAAAAGCT GATGAACTTC AAAATAAAGT TGCTGATTTA GAAAAAGAAA TTAGTAACCT 1260 

TGAAATATTA CTTGGAGGGG CTGATTCTGA AGATGATACT GCTGCTCTTC AAAATAAATT 1320 

AGCTACTAAA AAAGCTGAAT TGGAAAAAAC TCAAAAAGAA TTAGATGCAG CTCTTAATGA 1380 

GTTAGGCCCT GATGGAGATG AAGAAGAAAC TCCAGCGCCG GCTCCTCAAC CAGAGCAACC 144 0 

AGCTCCTGCA CCAAAACCAG AGCAACCAGC TCCAGCTCCA AAACCAGAGC AACCAGCTCC 1500 

•ItJCACCAAAA CCAGAGCAAC CAGCTCCAGC TCCAAAACCA GAGCAACCAG CTCCAGCTCC 1560 

AAAACCAGAG CAACCAGCTA AGCCGGAGAA ACCAGCTGAA GAGCCTACTC AACCAGAAAA 1620 

ACCAGCCACT CCAAAAACAG GCTGGAAACA AGAAAACGGT ATGTGGTATT TCTACAATAC 1680 

TGATGGTTCA ATGGCAATAG GTTGGCTCCA AAACAACGGT TCATGGTACT AGCTAAACGC 1740 

TAACGGCGCT ATGGCAACAG GTTGGGTGAA AGATGGAGAT ACCTGGTACT ATCTTGAAGC 1800 

ATCAGGTGCT ATGAAAGCAA GCCAATGGTT CAAAGTATCA GATAAATGGT ACTATGTCAA 1860 

CAGCAATGGC GCTATGGCGA CAGGCTGGCT CCAATACAAT GGCTCATGGT ACTACCTCAA 1920 

CGCTAATGGT GATATGGCGA CAGGATGGCT CCAATACAAC GGTTCATGGT ATTACCTCAA 1980 

CGCTAATGGT GATATGGCGA CAGGATGGGC TAAAGTCAAC GGTTCATGGT ACTACCTAAA 2040 

CGCTAACGGT GCTATGGCTA CAGGTTGGGC TAAAGTCAAC GGTTCATGGT ACTACCTAAA 2100 

CGCTAACGGT TCAATGGCAA CAGGTTGGGT GAAAGATGGA GATACCTGGT ACTATCTTGA 2160 

AGCATCAGGT GCTATGAAAG CAAGCCAATG GTTCAAAGTA TCAGATAAAT GGTACTATGT 2220 

CAATGGCTTA GGTGCCCTTG CAGTCAACAC AACTGTAGAT GGCTATAAAG TCAATGCCAA 2280 

TGGTGAATGG GTTTAAGCCG ATTAAATTAA ATCATGTTAA GAACATTTGA CATTTTAATT 2340 
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TTGAAACAAA GATAAGGTTC GATTGAATAG ATTTATGTTC GTATTCTTTA GGTACCTATC 2400 

TTATGATTTC AGGAAATGTC ATTAAAAAAA CGACTCATTT TCTCTAACCT GAAAAATAGA 2460 

TTAGAGAAAA TGGGTTGTTT TATCTATTAT AGTTATTTGA ATGAAGmTAA GAAGAAGGTA 2520 

TACTCACATC ATTCACATAA TCTGTATATT GACTATAAGT TTTAAAAAAC AATTTTTAAG 2580 

CTCTTCCTTG TCTTCTCTAA CCAAGCGTGT TATAATGAAT ACTGCTCAAG CGACCTTCAA 2640 

TCGTGAAGCA CACACGACCT TCAATCGTGA ATAAACGAAT AGATGGGAGA CTTACCATGA 2700 

GTGATAACTC TAAAACACGT GTTGTCGTGG GGATGAGTGG TGGTGTTGAT TCGTCGGTGA 2760 

CGGCTCTTTT GCTCAAGGAG CAGGGCTACG ATGTGATCGG TATCTTCATG AAGAACTGGG 2820 

ATGACACAGA TGAAAACGGC GTCTGTACGG CGACCGAAGA TTACAAGGAT GTGGTTGCGG 2880 

TGGCAGACCA GATTGGCATT CCCTACTACT CTGTCAATTT TGAAAAAGAG TACTGGGACC 2940 

GCGTTTTTGA GTATTTCCTA GCGGAATACC GTGCAGGGCG CACGCCAAAT CCGGACGTTA 3000 

TGTGCAACAA GGAAATCAAG TTCAAGGCCT TTTTGGACTA TGCCATAACC TTGGGGGCAG 3060 

ACTATGTAGC GACTGGGCAT TATGCTCGAG TGGCGCGTGA TGAGGATGGT ACCGTTCACA 3120 

TGCTTCGTGG CGTGGACAAT GGCAAGGATC AGACCTATTT CCTCAGCCAA CTTTCGCAAG 3180 

AACAACTTCA AAAAACCATG TTCCCACTAG GACATTTGGA AAAGCCTGAA GTACGCAGAC 3240 
TAGCAGAAGA AGCAGGCCTT TCGACTGCTA AGAAGAATiCA CTCGACAGGG ATT-T<^CTtTA . 3JUU 

TCX5GAGAAAA GAACTTTAAA AACTTTCTCA GCAACTACCT GCCAGCTCAG CCTGGTCGCA 3360 

TGATGACTGT GGATGGTCGC GATATGGGCG AGCATGCAGG TCTTATGTAC TATACAATCG 3420 

GTCAGCGTGG CGGACTCGGT ATCGGTGGGC AACACGGCGG TGACAATGCC CCTTGGTTCG 3480 

TTGTCGGAAA AGATCTAAGC AAGAATATTC TCTATGTAGG ACAAGGATTC TACCATGATT 3540 

CGCTCATGTC AACTAGCCTA GAAGCCAGTC AAGTCCACTT TACTCGTGAA ATGCCAGAAG 3600 

AGTTTACGCT AGAATGTACG GCTAAATTCC GTTACCGTCA GCCTGACTCT AAGGTGACCG 3660 

TTCATGTCAA AGGAGAAAAG ACAGAGGTCA TCTTTGCGGA ACCACAACGC GCGATTACAC 3720 

CAGGACAGGC AGTTGTCTTT TACGATGGCG GG 3752 
{2) INFORMATION FOR SEQ ID NO: 209: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3580 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209: 

TATTTATATT TTTTTATCTC TGGCATACTT TGATACCTTT TTAGACTTAA AGTCTTTAAT 60 

AGTGCCTTTC CACCTCTTTT TATCTATAAA GATTCTCCTA CATCATAATT CATTTTTTTA 120 

TTTAAACCTT TCTGTCTTAG TTTGTCTTTA TCTTCTTCAT ACCATTTTAA GATTGTCACA 180 

TAGTGGTTTT GATAGGTCTT ACCACTGCTT TCCATGTATC TGGATAGTTT ATTTATCATT 240 

ATATCTGTGT GTGAGTTTAA TTTTTCTTTT AGATTTTTAT ATTCTTCTTT GCTTAACCTT 300 

ACATTTTTGA ATTCTCCATA AAAAATGGGG GTGGACTTTT TATCTATCTC TCCCTCTCTC 360 

TCTTTATCTA TCTCTATATC TTTCCATGTA ATTCCAATCT GGAGTACCTC TACTGTCTAT 420 

CGGTAATTTA ATTTTGATAT CTGGCAATAC TGTGCTAGAT ATTTGATCTT TATATTCAGT 480 

ATTTTTTAAA GCTTGCCTAA TAATTGAAGT TAAATAGAAT GCTACTTCTT TATTCAATTC 540 

TTTATTTTTT AATTTTAAAC AATGAATTTT CATATCTAGG CTTGCTTTAT ATTTATGATA 600 

AAAGACTGCT CCTAAAAATG AAACAGATAT AAAATTTTCA AAAACTCTAT AATTTTTATC 660 

ATCTATATCT TCGTAGTAAC CTAAGATACC ATTGTCAATA TTTGTAGCAC TAATTCTAGG 720 

AGTTTTTCCA TCGAGTAAAT ATCTTTTTGG AATAGATGAG CCTGTTGGTA CTTAACTCGA 780 

TTTCCCCTTT TTTTCGGTAA TAAATATTTC TTTTTATTTT GTTGTCTGAT ATTTTTCCTA 840 

CCTGTCCTTT GTAGGATGAG TATTTTCTAG ATTTTCyTGA ATAACTTTTT ACTTGAAGTT 900 

TTAGCTTTTG AACTAGTCGT TGTACTTTCT TTTTGTTTAT TATCAGTCCT GATCTTTTTA 9 60 

ATATTGCTGT TATTCTCTAT ATCCTATTTT TCATTCATGA TATTCTTTTA CTAATTTTAT 1020 

CTTAAATTCT GTGCTGTATT TGCCATTAAA AAACTGACCT CCTTTAGTTA GTTTTTTGGC 1080 

CTAACTTTTG AGGGTCAGTT CAAAATTTGC GACTTTTAAA TGAATTCCAA TATTCAATTA 1140 

TTAAGAGTTA ACATGGTGCT TGCCAATAGG AATCATTAGA GGCGAATTGG AAATAGGGTC 1200 

ACGTATAATT TTTGCTTCAA GATTAAAGAT ATCTTTAACT AGTTTATCAT TTAGTATATC 1260 

TTCAGGCTTT CCCTCTGCAA CAAGTTTACC TTCTTTAATT GCAAATAGGT AATCAGCGTA 1320 

TCTTGCTGTT AGATTTATAT CGTGCAAAAT CATGCAAATG GTTGTCTTAT ATTTTTGGTT 13 80 

TAGATCAGTC AAGAGGTCTA ATAGTTCTAT TTGATATGAG ATATCCAAGT AAGTAGTTGG 1440 

CTCATCTAAA AGTAGGATAC TTGTATCTTG GGCTAGGGCT AGAGCTATCC ATACTCTTTG 1500 

CCTTTGACCC CCAGAAAGTT CTTCAACTAG GTTATTTGCT AGATCTTCAA CATTGGCCTT 1560 

AACCATTGAT CTGTTTATTA TTTCAAGGTC ATCTTTTCCA AGACTCTTAA AAGGCTTTCT 1620 

GTAGGGGAAA CGACCACGGC TTACTVAGATC AGCTACTGTT ATTGATTCAG GGATTATTGG 1680 

AGATTGAGGT AATATAGCTA TGTGTTTTGC TAAATCTTTT TCTTTATAAG AATTAATTGA 1740 
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TTTATTATCA AGCAATACTT CTCCCTCTAA TGGCTTTATA AGTCGAGACA AGGTTTTAAT 1800 

GAGTGTTGAT TTCCCACAAC CATTTGACCC AATAATAACT GATATTTTTT CTTCAGGTAT 1860 

TTTTATATTT ATATTTTCCA AGATTATTTT TTCATCATAA CCGCAGGTAA GATTATTTGA 1920 

CCACAGACCT TTCATTATAT ATTCCTCCTG TTCATTTTTA TTAGTAAGTA TATTAAGTAT 1980 

GGTGAACCTA ACAAGCCAGT TACAACACCT ACTGGATATC TAGCTGGTAA AATATTTTGA 2040 

GAGAATATGT CTGATAACAA AACTAGTAAA ATTCCAACCA ATCCAGCTAA TATTGGGCTT 2100 

CTTTTCTTGC CAATATTTAA GGCTATGGGA CCAGCTAAAA AAGATATACA AGCTATTGGT 2160 

CCTGTAATTG AAGTAGAAAA AGCAGTTAAA GATACAGCGC AAAAAATTAA AACAAGCCTT 2220 

GAAAGCTCGG GATTTGCTCC AAGTCCGATT GCTATTTCTT CACCAAGTTC AATAATTTCT 2280 

AGTCTTTTAT TAAAAAATAA AACTAATATA GTAGCAATAA TACTTACTAT TAGAACAAGA 2340 

GGTATGTCAT CTAACTTTGT AAAAGATAAA GAGCCACTGA GCCATCTCAT AACTTCTTGT 2400 

AATTCATATC TTGCTACTTT CAACAATAAA AATGAGGTGC CTGCTCTTGT GACAGCTTGA 24 60 

AAACCAATAC CTAATATTAT CAGTCTTGCT GCTGAAAAAC CATCTTTTTT AGCTAGTAAA 2520 

AATAATATTA AAGATGATGT TAGTCCACAA GTTATTGAAA TAATTCCAGT AGTTAAACTA 2580 

TTTGTTTTTA ATACCAATAT GCAAAAGACC GCTGCAATAG ATGAAGAACT TGTGACACCG 2640 

ATTATATCAG GACTTGCAAG AGGATTTCTT AACATAGT'TT GAAAGATi^JLTi TCCTGCCAAT 2700 

GCAAAAGACC AGCCAGCTAT AATTCCTGCT AATAATTTTG GTAATCTAAT TTCCATAATC 2760 

GAAAAACTAG CTCCAGGAAC AGTTTCACTA TTTAAGACTT TAATCAAAGT TGAAAAAGAA 2820 

TAACTTTCAT CTCCGATAAG TAAAATGAAA AATGATAGAC TGATTATTAT TAATAAAAAT 2880 

AGTGAGGAAA ATAGTGTTAT TCTATTTTTT CTTTTTTGAA TACCTATAAT TAAATTTTGC 2940 

ATTAGTTATT AACCCCTCTA TTTTTCATAG TTACATAAAT AAGTACTGGA CCCCCGATTA 3000 

TTGCAGTAAT TATCCCTACT TCAATTTCAC CTGGTTTACC TAACATACGG CCGATTATAT 3060 

CACATATAAG CAAGAGCTCT GCACCTATAA AAGATGAAGA AATGGTCATT GTGCGTATAT 3120 

CTTTGCTTAT AAATAAGCCA CAAAAGTGAG GAACTATAAG ACCTACGAAG CCAATAGGTC 3180 

CACCAATTGC AGTAATACTT GAACATAAAA GCACACTTGC AATTATTGCA AGTGATCTTA 3240 

TCCTATTAAC ATTAACTCCA AGACCAACAG CCATTTCATC ACCCATAGcT AAAGCGTTTA 3300 

AATCTGATGA AATAAATATA GCTATCAAGT GACCTAAAAT TATAAAAGGT AGTAGTGTAG 3360 

ATATAGAAGA TAATGTAGCT GCTCCAAGGC TACCTATTTG CCAAAATCTA AATTTGTCTA 3420 

AGACGTTATT ATTCGGTAAA ATTAAAAAAC • TTACAAAACT GCTTAAAGCC ATACTAACAC 3480 



wo 98/18931 



PCT/US97/19588 



1200 

AAGTTCCTGA TAAGGCAAGT TTTATAGGGG TAAGGCCTGC TTTTCCGTTA CAGCAATCGC 3540 

GTATACAAAA ATTGCACTTA CTAAGCCACC AATGATTGCG 3580 
(2) INFORMATION FOR SEQ ID NO: 210: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11378 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210: 

CCAAATTGCT CCACAATTAT TATGGAGTCG TCGTTTGGCA GATGGGCGTG ATATGTGTGC 60 

TCAAGAATGG TTGACAGGCA AGATATTGAC CCCCTATGAT ATGAATCGTA AGCAAATCGT 120 

CAATATTTTA ACCCGTCTTC ATCGCTCACG TCCGTTGATG ACACAATTGA GTCGTTTGGG 180 

CTATGCCATG GAAACACCTG TAGATTTACT ACAGTCTTGG CAGGAAACGG CTCCAGATGC 240 

TTTGCGTAAA AATCATTTTA TCAGTGAAGT GATGGCTGAT TTACGTCAGA CTATTCCAGG 300 

ATTTAGAGAG GACCATGCGA CCATTGTCCA TGGAGATGTA CGACATAGTA ATTGGATTGA 360 

GACAGATAGT GGCTTGATTT ATTTAGTAGA TTGGGATTCG GTTCGCTTGA CCGATCGCAT 420 

GTTTGATGTG GCCCATATGC TCTGCCATTA TATTTCAGAA CATCAGTGGA AGGAATGGTT 480 

GACCTACTAC GGTTACAAGT ACAATCAAAC GGTATTAAGT AAATTGTATT GGTATGGTCA 540 

ATTGTCTTAT TTGAGTCAGA TTTCCAAGTA TTATATGAAC CAAGATTTAG AAAATGTCAA 600 

TCGGGAGATT CATGGTTTGC GTCATTTCCG AGACAAGTAT GGAAAGAGAA GATGAGAGTT 660 

AGAAATCGTA AAGGGGCAAC AGAATTACTA GAGGCAAATC CCCAGTATGT GGTCCTCAAT 720 

CCCTTGGAAG CCAAGGCAAA ATGGCGGGAC TTGTTTGGCA ATGATAATCC CATTCATGTG 780 

GAAGTTGGAA GTGGAAAGGG TGCCTTTGTT TCAGGTATGG CCAAGCAAAA CCCTGACATC 84 0 

AACTATATCG GGATTGATAT TCAAAAGTCT GTTTTGAGCT ACGCTTTGGA CAAGGTGCTT 900 

GAAGTTGGAG TGCCTAACAT CAAGCTCTTG TGGGTAGATG GTTCTGACTT AACTGACTAC 9 60 

TTTGAAGACG GTGAGATTGA TCGCTTGTAT CTGAACTTTT CAGATCCATG GCCGAAAAAA 1020 

CGCCATGAAA AGCGTCGTTT GACCTACAAG ACCTTCTTGG ATACCTTCAA ACGTATCTTG 1080 

CCTGAAAATG GAGAAATTCA TTTCAAGACG GATAACCGTG GCTTGTTTGA GTACAGTTTA 1X40 

GTGAGCTTTT CTCAATATGG CATGAAACTC AATGGTGTCT GGTTAGATTT GCATGCCAGT 1200 

GATTTTGAAG GCAATGTCAT GACAGAATAC GAGCAAAAAT TCTCAAACAA GGGGCAAGTT 1260 

ATCTACCGAG TTGAGGCAGA ATTTTAAGAG ATAACCTAAA ATTAGGCTGT ACAAGTGCTT 1320 
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TTGCTTTACA TAAGTTGGCA AACGTGCTAT ACTGATAGTA AGAATATGAA AAGTGAGGCG 1380 

GGGAAATATC TTCGCCTCTT GCTTATGAGG AGGTGGACGC AATCGCAACA ATCGTAGAAT 1440 

TAGTCAGAGA AGTTGTAGAA CCTGTCATAG AAGCTCCTTT TGAACTCGTG GATATCGAGT 1500 

ATGGAAAGAT TGGCAGTGAC ATGATTCTCA GTATTTTTGT AGATAAACCC GAAGAATTAC 1560 

CTTGAACGAC ACGGCAGACT TGACAGAAAT TATCAGTCCT GTCCTAGACA CCATCAAGCC 1620 

AGATCCCTTC CCAGAACAAT ATTTCCTAGA AATTACCAGT CCAGGTTTGG AACGTCCTTT 1680 

GAAAACCAAG GATGCCGTCG CTGGAGCGGT TGGAAAATAC ATCCATGTCG GGCTCTACCA 1740 

AGCCATCGAT AAGCAAAAGG TCTTTGAAGG AACCTTGTTG GCCTTCGAAG AGGACGAGTT 1800 

GACTATGGAA TATATGGACA AGACGCGTAA GAAAACCGTC CAAATTCCAT ACAGTTTAGT 1860 

ATCAAAAGCA CGTTTAGCAG TTAAATTATA GAAAAAGAAA GGATAGCTTT TGAGGATTCA 1920 

AAAGTGAAGA AAACATGAGT AAAGAAATGC TAGAGGCCTT CCGCATTTTG GAAGAAGACA 1980 

AGGGAATCAA AAAAGAAGAT ATCATCGACG CAGTAGTAGA GTCGCTTCGT TCCGCTTATC 2040 

GCAGACGCTA TGGTCAGTCA GACAGCGTAG CTATTGACTT CAACGAAAAA ACAGGTGACT 2100 

TTACAGTTTA TACTGTCCGT GAAGTTGTTG ATGAAGTATT TGATAGCCGT TTGGAAATCA 2160 

GCTTGAAAGA TGCTCTTGCC ATTAATTCAG CTTATGAACT TGGAGACAAA ATCAAGTTTG 2220 

AAGAAGCACC AGCTGAGTTT GGTCGTGTAG CAGrcCAATC TGCCAA^.C«-A ACCATCATGG 2 2 SO 

AAAAAATGCG CAAgCAAACA CGTGCCATCA CTTACAATAC TTACAAAGAA CATGAGCAAG 2340 

AAATCATGTC TGGTACAGTA GAACGCTTTG ACAACCGCTT TATCTATGTC AACCTTGGTA 2400 

GCATCGAAGC CCAATTGTCA AAACAAGACC AAATTCCTGG AGAAGTTTTT GCTTCTCATG 2460 

ATCGTATCGA AGTTTATGTT TACAAGGTTG AAGACAACCC TCGTGGTGTG AACGTCTTTG 2520 

TTAGCCGTAG TCATCCAGAA ATGATCAAAC GTTTAATGGA GCAAGAAATT CCAGAAGTTT 2 580 

ATGATGGAAC TGTTGAAATC ATGAGCGTGG CTCGTGAAGC AGGTGACCGT ACGAAGGTTG 2640 

CTGTTCGTAG CCACAATCCA AACGTGGATG CTATCGGTAC AATCGTTGGA CGTGGTGGTG 2700 

CTAATATCAA GAAGATTACT AGCAAATTCC ACCCAGCTCG TTACGATGCT AAAAATGACC 2760 

GCATGGTACC AATCGAAGAA AATATCGATG TTATCGAGTG GGTAGCAGAT CCAGCTGAAT 2820 

TTATCTACAA TGCCATCGCT CCTGCTGAGG TTGACCAAGT TATCTTTGAT GAAAACGACA 2880 

GCAAACGTGC CTTGGTGGTT GTTCCAGATA ACAAGCTTTC TCTTGCCATT GGTCGTCGTG 2940 

GACAAAACGT GCGCTTGGCG GCTCACTTGA CTGGTTACCG TATCGATATC AAGTCTGCTA 3000 

GCGAATTTGA AGCCATGGAA GACGCTGCTT CAGTAGAGTT GGAAGTAGAA AACGATACTG 3060 
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TAGAAGAATA AAAGCTGCTA GAGGAGGGAA AGATGAAAAC AAGAAAAATC CCTTTGCGCA 3120 

AGTCTGTTGT GTCTAACGAA GTGATTGATA AGCGTGATTT GCTCCGCATT GTCAAGAACA 3180 

AGGAAGGACA AGTCTTTATT GATcCTACGG GCAAGGCCAA TGGCCGCGGC GCTTATATCA 3240 

AACTAGACAA TGCAGAAGCC CTAGAGGCGA AAAAGAAGAA GGTCTTTAAC CGCAGCTTTA 3300 

GCATGGAAGT GGAAGAAAGC TTTTATGACG AGTTGATCGC TTATGTGGAT CACAAAGTGA 3360 

AAAGAAGAGA GTTGGGACTT GAATAAGCAA AAGATAAGTA ATCTCTTGGG GCTTGCTCAG 3420 

CGAGCAGGGC GCATCATATC GGGTGAAGAA TTGGTGGTCA AGGCCATTCA AGACGGCAAG 34 80 

GCCAAGTTGG TCTTTCTAGC TCATGATGCT GGACCCAATC TGACCAAGAA GATTCAAGAT 3540 

AAAAGTCATT ATTATCAAGT AGAAATTGTA ACCGTGTTTT CAACACTGGA ATTAAGCATA 3600 

GCAGTCGGGA AATCGAGAAA GGTTTTGGCT GTAACAGATG CTGGATTTAC AAAGAAAATG 3660 

AGGTCTCTTA TGGAATAGAA GAGGAGGACA TGATTTGTCT AAGAAAAGAT TGTACGAAAT 3720 

CGCAAAAGAA CTTGGAAAAG AAAGTAAAGA AGTTGTAGCG CGTGCAAAAG AGTTGGGCTT 37 80 

GGATGTGAAA AGCCACTCAT CAAGTGTGGA AGAAGCTGTC GCTGCAAAAA TTGCTGCCAG 3840 

CTTTAAGCCT GCAGCTGCTC CGAAAGTAGA AGCAAAACCT GCAGCCCCAA AAGTAAGTGC 3900 

AGAAAAGAAA GCCGAAAAAT CTGAGCCAGC TAAACCAGCT GTAGCTAAGG AAGAGGCAAA 3960 

ACCTGCAGCC CCAAAAGCAA GTGCAGAAAA GAAAGCCGAA AAGTCTGAAC CAGTAAAACC 4020 

AGCTGTAGCC AAGGAAGAGG CAAAACCAGC TGAGCCAGTC ACTCCGAAAA CAGAAAAAGT 4080 

AGCGGCTAAA CCGCAAAGTC GTAATTTCAA GGCTGAGCGT GAAGCACGTG CTAAAGAGCA 4140 

GGCAGAGCGA CGCAAGCAAA ATAAGGGCAA TAACCGTGAC CAACAACAAA ACGGAAACCG 4200 

TCAGAAAAAC GACGGCCGTA ATGGTGGAAA ACAAGGTCAA AGCAACCGCG ACAATCGTCG 42 60 

CTTTAATGAC CAAGCTAAGA AGCAGCAAGG TCAGCAAAAA CGTAGAAATG AGCGCCGTCA 4320 

GCAAGAGGAT AAACGTTCAA ATCAAGCGGC TCCACGTATT GACTTTAAAG CCCGTGCAGC 4380 

AGCCCTAAAA GCAGAGCAAA ATGCAGAGTA CGCTCGTTCA AGTGAGGAAC GCTTCAAGCA 4440 

GTATCAGGCT GCTAAAGAAG CCTTGGCTCA AGCTAACAAA CGCAAGGAAC CAGAGGAAAT 4 500 

CTTTGAAGAA GCGGCTAAGT TAGCTGAACA AGCACAGCAA GTTCAAGCAG TGGTTGAAGT 4560 

CGTCCCTGAG AAAAAAGAAC CTGCAGTGGA TACACGTCGT AAAAAACAAG CTCGACCAGA 4620 

CAAAAATCGT GACGATTATG ATCATGAAGA AGATGGTCCT AGAAAACAAC AAAAGAATCG 4680 

AAGTAGTCAA AATCAAGTGA GAAATCAAAA GAATAGTAAC TGGAATAACA ACAAAAAGAA 4740 

CAAAAAAGGC AATAACAAGA ACAACCGTAA TCAGACTCCA AAACCTGTTA CGGAGCGTAA 4800 

ATTCCATGAA TTGCCAACAG AATTTGAATA TACAGATGGT ATGACCGTTG CGGAAATCGC 4 850 
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AAAACGTATC AAACGTGAAC CAGCTGAAAT TGTTAAGAAA CTTTTCATGA TGGGTGTCAT 4920 

GGCCACACAA AACCAATCCT TGGATGGGGA AACAATTGAA CTCCTCATGG TGGATTACGG 4980 

TATCGAAGCC AAACAAAAGG TTGAAGTGGA TAATGCTGAC ATCGAACGTT TCTTTGTCGA 5040 

AGATGGTTAT CTCAATGAAG ATGAATTGGT TGAGCGTCCA CCAGTTGTTA CTATCATGGG 5100 

ACACGTTGAC CACGGTAAAA CAACCCTTTT GGATACTCTT CGTAACTCAC GTGTTGCGAC 5160 

AGGTGAAGCA GGTGGTATTA CTCAGCATAT CGGTGCCTAC CAAATCGTGG AAAATGGTAA 5220 

GAAGATTACC TTCCTTGATA CACCAGGACA CGCGGCCTTT ACATCAATGC GTGCGCGTGG 5280 

TGCTTCTGTT ACCGATATTA CGATCTTGGT CGTAGCGGCA GATGACGGGG TTATGCCTCA 5340 

GACTATTGAA GCCATCAACC ACTCAAAAGC AGCTAACGTT CCAATCATCG TAGCTATTAA 5400 

CAAGATTGAT AAACCAGGTG CTAACCCAGA ACGCGTTATC GGTGAATTGG CAGAGCATGG 5460 

TGTGATGTCA ACTGCTTGGG GTGGAGATTC TGAATTTGTT GAAATTTCGG CTAAATTCAA 5520 

CCAAAATATC GAAGAATTGT TGGAAACAGT CCTTCTTGTG GCTGAAATCC AAGAAOTCAA 5580 

AGCAGACCCA ACAGTTCGTG CGATCGGTAC GGTTATCGAA GCGCGCTTGG ATAAAGGAAA 5640 

AGGTGCGGTC GCAACCCTTC TTGTACAACA AGGTACCTTG AATGTTCAAG ACCCAATCGT 5700 

TGTCGGAAAT ACcTTCGGTC GTGTCCGTGC TATGACCAAC GACCTTGGTC GTCGTGTTAA 5760 

AGTTGCTGGA CCATCAACAC CAGTCTCTAT CACAGGTTTG ?J^CCAAGCAC CGATGGCGGG 5S20 

TGACCACTTT GCCGTTTACG AGGATGAAAA ATCTGCGCGT GCAGCAGGTG AAGAGCGTGC 5880 

CAAACGTGCC CTCATGAAAC AACGTCAAGC TACCCAACGT GTTAGCCTTG AAAACCTCTT 5940 

TGATACCCTT AAAGCTGGGG AACTCAAATC TGTTAATGTT ATCATC7VAGG CTGATGTACA 6000 

AGGTTCTGTT GAAGCCCTTT CTGCCTCACT TCAAAAGATT GACGTGGAAG GTGTCAAAGT 6060 

GACTATCGTC CACTCAGCGG TCGGTGCTAT CAACGAATCA GACGTGACCC TTGCCGAAGC 6120 

TTCAAATGCC TTTATCGTTG GTTTCAACGT ACGCCCTACA CCACAAGCTC GTCAACAAGC 6180 

AGAAGCTGAC GATGTGGAAA TCCGTCTTCA CAGCATTATC TACAAGGTTA TCGAAGAGAT 6240 

GGAAGAAGCT ATGAAAGGGA TGCTTGATCC AGAATTTGAA GAAAAAGTTA TTGGTGAAGC 6300 

GGTTATCCGT GAAACCTTCA AGGTGTCTAA AGTGGGAACT ATCGGTGGAT TTATGGTTAT 6360 

CAACGGTAAG GTTGCCCGTG ACTCTAAAGT CCGTGTTATC CGTGATGGTG TCGTTATCTA 6420 

TGATGGTGAA CTCGCAAGCT TGAAACACTA TAAAGACGAC GTGAAAGAAG TGACAAACGG 6480 

TCGTGAAGGT GGATTGATGA TCGACGGCTA CAATGATATT AAGATGGATG ATGTGATTGA 6540 

GGCGTATGTC ATGGAAGAAA TCAAGAGATA AGATTTTTTG CTCCTTTCTT AGGTGGTGAG 6600 



wo 98/18931 PCT/US97/19588 



1204 

GGACGCAAGC AAACCGATGG TTTCATTGCT TATTTTTGAG CCTAGGGTCT CAAAAATCCC 6660 

CTGTGATGGG ACTGATAAAT CAGTTCCATC ACTTTCACCA CGGCGAAAGA AGCAGATGAC 6720 

TTCAAATTGA ACTTCGTTTC AATTTAAACT GAAAATCAAG AAGTTTAAAA TAGCTAGGTC 6780 

TGCTGGCCTA GCTTTTGGTT CAAAGTAGAG AAAGGAATAT CATGGCAAAT CATTTCCGTA 684 0 

CAGATCGTGT GGGCATGGAA ATCAAGCGTG AAGTCAATGA GATTTTGCAA AAGAAAGTCC 6900 

GTGATCCACG TGTCCAAGGT GTGACCATCA TAGATGTTCA GATGCTGGGT GACTTGTCTG 6960 

TTGCCAAGGT TTATTACACC ATTTTGAGTA ACCTTGCTTC GGATAACCAA AAAGCCCAAA 7020 

TCGGGCTTGA AAAAGCAACT GGTACCATCA AACGTGAACT TGGTCGCAAT TTGAAATTGT 7080 

ACAAAATCCC AGATTTGACC TTCGTCAAAG ACGAQTCCAT CGAGTATGGA AACAAGATTG 7140 

ACGAGATGCT ACGCAATCTG GATAAGAACT AAAGAAGAGG GGTTGCCCCT CTTTTTTGGT 7200 

GGAGGAAAAT AGGTTGAATT TGAAATGGAA AAATATTCTT TTATAATAGA TTGAAACTAG 7260 

AATAGTACGC CTCTACTTCT AAAATATTGT TAGAAATCGA TTTGACTGTC CTGATCGATT 7320 

TGTCCTGTTC TTGTTTCATT TTAATATAAA AAAGGGATTC TGTATTTTTT AATGTTATCT 7380 

AATTAGAAAA TGCTTTTTTT GTAGGAAATA TAATATGATA AGGTGCAAAA AAGAAATAAG 7440 

GAGTTTGTAT ATGGCTGAAC AAGACTTAGC TATGCAAGTA TTGCAACAAG TGGTGAAACT 7500 

ACCTGTTGTT AAGGTTGATC GTTCGAAATT TTTAGTGGAT AAGTTTTCCA AAGAATTGGA 7560 

TCCAAAAGAT ATTCCTACCT TATTGGAACA AGGTCCAACG ACTCTTCTAT CTCAAGAAAT 7620 

ATTAGATCGT GTAGCTAATG CTTGTATTCG GGACAATGTA TTATTAGCGA GTGGGACTTC 7680 

TGTTTTGGCA GGATTACCTG GAGGGCTTGC TATGGCAATT ACCATTCCAG CTGATGTGGC 7740 

TCAATTTTAT GCTTTCTCTC TGAAATTGGC TCAAGAATTA GGTTATATTT ATGGTTATGA 7800 

GGATCTTTGG GCTTCACGAG AGGAGTTGAG TGAAGATGCT CAAAATACCC TCTTGCTTTA 7860 

TCTAGGCGTA ATGTTAGGGG TGAATGGAAC CGCTGCTTTG CTACGTGTTG GTAGTATAAC 7920 

AATTGCCAAA CAGGTAATGA AAATAGTGCC TAATAAAGCT TTAACAAAGA CGCTTTGGTA 7980 

CCCTATTTTG AAAAAAGTCT TAAAAATATT TGGTGTGAAT CTTACCAAGG GAGGGTTGGC 8040 

CAAAGGAATG GGGAAATTTA TTCCTATCTT GGGTGGTATC ATTTCAGGTG GTTTAACCTT 8100 

TGCAACTATG AAACCAATGG GGGAAAGCTT GCAGAAAGAA TTATCCAAGC TAGTCAACTA 8160 

TAGTGAAGTT CAATATCAAG AAGATGTTGA AACAATCCGA AAAGAGGCTG AAATCATCAA 8220 

AGGAGAGTAA TATGAATCCT ATCAAAGCTT TTGCTAAAAT TTATGGTAAT TACTTTTTGA 8280 

CCGTGCAAGG TGTAAAAGTG ATGAAAACGA TAAAGAAAGC TGACCATGTC GTTGTTGGTC 8340 

TGGGGAAACT TTTTATTGCC GACAAGTTAA TGGATACGGC TCGGTGGCTC ATTAAGCCAG 84 00 
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AGGAGAGAGA ATGAAATTTT TTTGGTCTTC TTGCTATTCT TTTTATCAAA CCGATTATTG 84 60 

GGATTGTGAA ATTCTTTTGG ATGATCATCT CTTTTGCAGT CCAATTGCTG TTTTACAAGA 8520 

TAGTGTTTAA GATATTGGAT TGGCTCTTTA AACTTATCTA GATGGTAATC CAAGTTGCAG 8580 

AGAACTAGCA GGAACTCCAC TGCTAGTTTT TTATTCTCTT TCCATATGGT ATAATATAAG 864 0 

CAGTAAAATC ATTTTATACT CTTCGAAAAT CTCTTCAAAC CACGTCAGCT TCACCTTGCA 8700 

GTATATATGT TACTGACTTC GTCAGTTCTA TCCACAACCT CAAAACGGTG TTTTGAGCTG 8760 

ACTTCGTCAG TTCTATCTAC AACCTCAAAA CACTGTTTTG AGCAACCTGC GGCTAGCTTC 8820 

CTAGTTTGCT CTTTGATTTT CATTGAGTAT TAGAACATAC AATGGAGGTC GTCATGGACA 8880 

ATATCATCGA TGTGTCAATT CCTGTTGCAG AAGTGGTGGA CAAGCATCCA GAAGTCTTGG 8940 

AAATTCTAGT GGAGTTGGGT TTTAAACCCC TTGCCAATCC CTTAATGCGC AATACAGTTG 9000 

GTCGTAAAGT ATCACTTAAA CAGGGTTCTA AGCTAGCAGG AACTCCTATG GACAAGATTG 9060 

TACGCACACT GGAAGCGAAT GGCTACGAAG TGATTGGATT AGACTAATGA CAGATGAACG 9120 

GATTCATATC CTACGGGATA TTTTGTTAGA ATTGCACAAT GGCGCCTCTC CTGAGTCGGT 9180 

TCAAGATCGC TTTGATGCGA CCTTTACGGG CGTGTCAGCC ATCGAGATTT CCCTTATGGA 9240 

GCACGAGCTG ATGAACTCGG ATTCGGGCGT CACTTTTGAA GATGTTATGG AACTCTGTGA 93 00 

TGTCCATGCC AATCTTTTTA AAAATGCTAT CAAAGGTGTC G.'^J^GTTTCAG ATACTGAGCA 93 60 

TCCAGGTCAC CCAGTTCGTG TCTTCAAAGA AGAAAATCTG GCTCTCCGTG CGGCCTTGAT 9420 

TCGCATTCGT AGATTGTTAG ATACCTATGA GTCTATGGAA GACGAGGAAA TGCTGGCGGA 9480 

GATGCGTAAG GGTTTGGTGC GTCAGATGGG ACTTGTGGGT CAATTTGACA TCCATTACCA 9540 

ACGTAAGGAA GAACTCTTCT TTCCTATCAT GGAGCGCTAT GGACACGATT CACCTCCCAA 9600 

AGTTATGTGG GGAGTGGATG ATCAGATTAG GGAACTCTTT CAAACAGCTC TAACGACAGC 9660 

CAAGTCACTA CCAGAAGTGT CAATTAGCAG TGTAAAGGAA GCTTTTGAAG CTTTTGCGAC 9720 

AGAGTTTGAA AGTATGATTT TCAAGGAAGA GTCCATCCTC CTCATGATTC TCCTTGAGTC 9780 

TTTTACTCAG GATGACTGGC TTCAGATTGC GGAGGAGAGC GATGCCTATG GCTATGCCAT 9840 

CATCCGTCCG TCAGAGAAAT GGGTGCCAGA ACGACAGAGC TTTATTGAGG AAAAGATTGC 9900 

AGAGGAGCCT GTACAGCTAG ATACGGCAGA AGGTCAAGTT CAACAAGTCA TAGATACGCC 9960 

AGAAGGCCAT TTTACCATTA CCTTTACCCC TAAGGAAAAG GAAGCTGTGC TGGACCGCCA 10020 

TAGTCAACAG GCTTTTGGTA ATGGCTATCT TTCAGTCGAG CAGGCCAATC TCATCCTCAA 10080 

TCATCTCCCT ATGGAGATTA CCTTTGTCAA TAAAGAAGAT ATTTTCCAGT ATTACAATGA 10140 
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CAATACGCCA 


GCTGATGAGA 


TGATTTTCAA 


ACGGACGCCG 


TCCCAAGTCG 


GGCGCAATGT 


10200 


CGAACTCTGC 


CATCCGCCTA 


AGTACTTGGA 


CAAGGTCAAA 


ACTATCATGA 


AGGGGCTTCG 


10260 


TGAGGGAAGC 


AAAGACAAGT 


ATGAAATGTG 


GTTCAAGTCT 


GAGTCGCGAG 


GTAAGTTTGT 


10320 


CCACATCACC 


TATGCTGCAG 


TACACGATGA 


AGACGGAGAA 


TTCCAAGGAG 


TGTTGGAGTA 


10380 


TGTTCAGGAT 


ATCCAGCCCT 


ACCGTGAGAT 


TGATACGGAC 


TATTTTCGTG 


GATTAGAATA 


10440 


AGGAGAAAAA 


ATGAGTTACG 


AACAAGAATT 


TATGAAGGAA 


TTTGAAGCTT 


GGGTCAATAC 


10500 


CCAAATCATG 


ATTAACGACA 


TGGCGCACAA 


GGAAAGCCAA 


AAAGTTTACG 


AAGAAGACCA 


10560 


GGACGAGCGT 


GCCAAAGATG 


CCATGATTCG 


CTACGAGAGT 


CGCTTGGATG 


CTTATCAGTT 


10620 


CTTGCTTGGT 


AAGTTTGAAA 


ACTTCAAAGT 


AGGCAAGGGA 


TTCCATGATT 


TGCCAGAAGG 


10680 


CTTGTTTGGT 


GAGCGAAATT 


ATTAAACGAG 


AAAGATTCTT 


GATTTTTCAC 


TAAAATCTTG 


10740 


ATAGAATGTT 


TATGTTAAAT 


CCTTGTCAGA 


GCAGGGATTT 


TTTATTGAAA 


GGATTTTATC 


10800 


ATGTCAAAGA 


AACTCAATCG 


TAAAAAACAA 


TTACGAAATG 


GCCTCCGTCG 


CGCAGGTGCC 


10860 


TTTTCAAGTA 






ta AL) AL. AAAAA 


AAGTCGTGAA 


GCGTGCAGAA 


10920 


CAGTCAGCAA 


GCGCAGCTGG 


TAAGGCTGTT 


TCTAAAAAAG 


TTGAACAAGC 


AGTAGAAGCT 


10980 


ACCAAAGAGC 


AAGCTCAAAA 


AGTAGCTAAT 


TCTGTAGAAG 


ATTTTGCAGC 


AAATTTGGGT 


11040 


GGACTTCCAC 


TTGATCGTGC 


CAAGACTTTC 


TATGATGAAG 


GAATCAAGTC 


TGCTTCAGAT 


11100 


TTCAAAAACT 


GGACTGAAAA 


AGAACTCCTT 


GCCTTGAAAG 


GAATCGGCCC 


AGCTACCATC 


11160 


AAGAAATTGA 


AAGAAAATGG 


CATCAAGTTC 


AAGTAATTTT 


TCTTGAGCCT 


TGCATTTCCG 


11220 


AAAAAATCTT 


GCTACAATAG 


AGCCATTAGA 


GGTGTTTTGA 


ATCCCACATT 


TTACAGAAAG 


11280 


TGGCGGCGCT 


GAGAAGTCCA 


CAAATGTGTC 


AAAACTGGTT 


GCTAATGGAT 


GAAAAATTGA 


11340 


AATAAAAGTG 


TCTTTTTGCT 


TTAAAGACGA 


GAGTTGCG 






11378 



(2) INFORMATION FOR SEQ ID NO: 211: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4156 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211: 

CCGCGAGCCA CGGCGAATTT GCTGCGGGTA TTCATCAGTC AGGATCTATG ATCTTTGGTG 60 

AACAAGAAAA GGTTCAAGTT GTGACCTTTA TGCCAAATGA AGGTCCTGAT GATCTATACG 120 

CTAAGTTTAA TAACGCTGTT GCTGCATTTG ACGCAGAAGA TGAGGTTCTA GTTTTGGCTG 180 
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ACCTTTGGAG TGGTTCTCCA TTTAACCAAG CTAGTCGCGT 
GTAAGTTTGC CATCATCACA GGACTTAACT TACCGATGTT 
GCCTCATGGA CGCTGCTGCA GGTGTAGAAA AAGTCGCTGC 
AAGATGGCAT CAAAGCTCTT CCAGAAGAGC TAAATCCAGT 
CAGCTGCTCC AGTTGCCCAA ACTGCTATCC CAGAAGGAAC 
TGAAAATCAA TCTTGCCCGT CTTGACACAC GTCTACTTCA 
GGACTCCAGA TTCAAAAGCA AATCGTATCA TCGTTGCTTC 
ACCTTCGTAA AGAATTGATT AAACAAGCAG CTCCAGGTAA 
CAATTCAAAA ACTGATTGAG ATTTCAAAAG ACCCACGTTT 
TCTTGTTTGA AACACCTCAA GATGCCCTTC GTGCCATCGA 
CTCTTAATGT TGGTTCTATG GCTCACTCAA CAGGTAAAAC 
CTATGGACAA AGAAGACGTT GCTACATTTG AAAAAATGCG 
ATGTCCGTAA AGTACCAAAT GATTCTAAAA AAGATTTGTT 
ATGTCAAATA AGCCATTATT TATGAAAGGA TTTTAAACAT 
TAGTAGTCGT TGTAGCCTTC TTTGCAGGTC TTGAAGGCAT 
ACCAACCACT TGTAGCCTGT ACCCTTATTG GGCTTGTAAr 
TTATCCTCGG TGGATCGCTT CAAATGATTG CCCTTGGTTG 
TCGCTCCTGA TGCTGCACTT GCTTCTGTCG CTGCTGCCAT 
ACTTTACCAA GACTGGTATC GGTGTTGCCC AAGCGGTTGC 
GACTTTTCTT GACAATGATT GTTCGTACAA TTTCAGTTGG 
CTGCCGCTAA AAAAGGTGAC TTCGGCGCTG TGGAGCGTGC 
TCCAAGGACT TCGTATCGCG CTTCCTGCAG CTCTTCTCCT 
TACAAAGTAT CCTTAGTGCC ATGCCAGACT GGCTCAAAGA 
GTATGGTCGT TGCCGTTGGT TACGCCATGG TTATCAACAT 
GGCCATTCTT CGCTCTTGGT TTCGTTCTCG CTGCTGTGTC 
TCGGTGCTAT CGGCGTTGCT ATCGCTCTTA TCTACCTTCA 
ATGGTGGCGG AGGAGCCGCA ACTTCTAACG ACCCAATCGG 
AAGATAAGAA AGGACTGAAA ACATCATGAC TGAAAAACTT 
TAAAAAAGTT TGGTGGCGTT CAACCTTCTT ACAAGGGTCT 



GATGGGAGAA AATCCTGAGC 240 

GATTCAAGCC TACACAGAGC 300 

TAATATCATT AAAGAAGCCA 360 

CGAAGAAGTT GCAAGCGCTG 420 

TGTTATCGGA GACGGTAAAT 480 

CGGTCAGGTT GCAACTGCTT 540 

AGATAACGTG GCTAAAGACG 600 

TGTCAAGGCT AACGTGGTTC 660 

TGGAGAAACA CATGCCCTTA 720 

AGGCGGCGTG CCAATCAAGA 760 

ATTGGTCAAT ACCGTTTTGT 84 0 

TGACTTGGGT GTTGAATTTG 900 

TGACTTGATT AACAAAGCCA 960 

GTCTATTATT TCTATGGTTT 1020 

CCTCGACCAG TTCCAATTTC 1080 

.'IGGTCACTTG GTiAGCAGGGA 1140 

GTCAAATATC GGTGCTGCTA 1200 

TATCATGGTT CTTGGTGGTG 1260 

TATCCCTCTT GCTGTAGCTG 1320 

TTTGGTTCAT ACTGCAGATG 1380 

GCATTTCATC GCGCTACTTT 1440 

TATGGTACCA ACTGAAACTG 1500 

TGGTATGGCT ATCGGTGGTG 1560 

GATGGCAACT CGTGAAGTAT 1620 

AGATATTACT CTAATCGGAT 1680 

CCTTTCTAAA ACTGGTGGAA 1740 

CGATATCCTA GAAGACTACT 1800 

CAATTAACTA AATCAGATCG 1860 

TGGAACTTTG AACGGATGCA 1920 
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AAACTTGGGC TGGGCTTATA CACTCATTCC AGCTATCAAA AAACTCTATA CTAAAAAAGA 1980 

AGATCAAATC GCTGCTCTTG AGCGTCACCT TGAGTTCTTC AACACTCATC CATACGTAGC 2040 

TGCTCCAGTC ATGGGGGTTA CTCTTGCGCT TGAAGAAGAA CGTGCTAACG GTGTGGAAAT 2100 

CGATGACGCT GCTATCCAAG GGGTTAAAAT CGGTATGATG GGACCTCTTG CTGGTATCGG 2160 

TGACCCAGTA TTCTGGTTTA CAGTACGCCC AATCCTTGGA TCTCTCGGTG CTTCACTTGC 2220 

CCTTACTGGC AATATCTTGG GGCCACTCCT CTTCTTTGTT GCATGGAACT TGATTCGTAT 2280 

GTCATTCTTG TGGTATGTTC AAGAGATTGG ATACAAGGCT GGATCAGAAA TCACTAAAGA 2340 

TATGTCTGGT GGTATCCTTC AAGATATCAC TAAAGGAGCT TCTATCCTTG GGATGTTCAT 2400 

TCTTGCTGTC CTTGTTCAAC GCTGGGTAAA TATTAAATTT GCTTTCGATG TTTCTAAAGT 24 60 

TCAACTAGAT GAAAAGGCTT ATATCCATTG GGATAAATTG CCAGAAGGGT CTAAAGGTAT 2520 

CCAAGAAGCA TTCGCACAAG TAGGACAAGG ATTGTCTCAA ACTCCTGAAA AAGTTACTAC 2580 

TTTCCAACAA AACTTGGATA TGTTGATTCC TGGATTATCA GGACTACTCC TTACTTTACT 2640 

TTGCATGTAC TTACTTAAGA AAAAAGTATC TCCAATCACT ATTATCCTTG CCCTCTTCGC 2700 

AGTGGGTATT GTGGCACATG TTCTTCACAT CATGTAATCA AGCAACTAAA AAGGAACCAG 27 60 

GTTCTAAAAT CTGATTCCTT TTTTCTATGC TTTTATTCAG CCAAGGCTCC CATTGGATCC 2820 

CATGGTGCAA GTACGATTGG TTCTGCTCCA TAGGCAGCTT GTTCTTCTGC TGTCAGCAAT 2880 

■TCC-rrACGAA CAACGATTTG GTATGTGTAT TCGTCCATCC AAGCGTCTGA GGCAACAAAG 2940 

TAACCATCTG TACCGACCTT GTCTCCCCAT GAGTTTTCAA CCTTCCACTT GGTTGATTTA 3000 

CCATTTTCGT CCAAGTCAAC ACCTGTCAAG ACCATGGCGT GGGTCATCAA GCTTTCACTA ' 30 60 

TAGTCCAAAC GTCCAGCCTT GTCTTGAGTA AGTTTAATGT CCATGCTTGA TTCAAAGTCA 3120 

TAAACATCTG TCGCAAGGAT GCCAGCTTAC GGTTGCTGAG CTGGCCGACA TCAGAACCAA 3180 

ACCAAACAGT CTCACCTGCT TGCATTTGGG CAATCGCCAA TTCTTTCAAG CGCTCCATTG 324 0 

GAACGTTGAT GTAGCGAACT GCACGGCTAC CAACCACATT CCCCAACATC TCAACTGTGT 3300 

AAGATTTTCC GTAAGGTTTA TCAGCAGTTG GAGCATTGAT AACAGAAACG TAGTCTTCTA 3360 

AAGGAAGATT GACATATTTC TTGTAAAACT CTTGTGGTGT GATTCCTTTT TCACTTTTGT 3420 

AGTTGTTATC TTTATCGCGA TAAGCAAAGT CAAACTTGCG TGGTGGAAGT CCTAATGACA 3480 

TAGCAAGAAA GTTAAAGATT TCTTGCAAGA GGTCTTCTTT CTTAGCTTGA ACAGTCGCTT 3540 

GATCTGCACC AGAAACAAGC AAGTCACGCA AGATTTGAGC ATCTTGACGA AGCAATTTAT 3600 

TAAGGATCGC ATTTAGCTCA CGACTGCTGC TAGATGAAAC AGACTCAGGA TAAACTGACT 3 660 

TAGGCACGAC ACCGTATTTT TCAAAGAGGG AAACGACCAT ATCCCATTGA CCGCCATCTT 3720 
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GTTGAGGTGT TTGGAGTAAG AAGCTAACtT GCGGCTAGTC AATTCTTGGT CTGAAGTCGC 3780 

AATGACTTGC TCCAAGAACC AGTTTGATTT CTCATACTTA TCCCAGAAGA AAGTGTGGGC 3840 

TTGTGACAAC TCAAAGTTCT CCAATTTGTA TTGCGAGATG AGTTTGTGGC GGAAGGTGTT 3900 

GAGAGCCGCA AACATCCAGC AACGACCAGA CGCTTTCTGG TTAGTGACCT TGTCCTTGGT 3960 

TAAATCCAAT GAGAAAACAG GTGTGTTGTC TACATGGCTT TGGCGACGTT CCAGAGCTGC 4020 

AAAAATTCCG TTGTGGCTGG CAGCATTTTC AATCGCTTGG TATTTTACAT TTGCTTCATA 4080 

GTTGGCAAAT AGTTTATCAG TAAATGATTC TTGAATCGCG TTCATAGATT CCTCCTTTTA 4140 

GTCTACAGTG TATTGG 4156 



(2) INFORMATION FOR SEQ ID NO: 212: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3902 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
^D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212: 

AAAAACAACA AAATAAAACA AAAACAAAAA TATCGAGGTT TATTTTCAAA ACTTTCGATA 60 

TTTTTATTAA GTTATTATTT TGTTGTTTCT AGTTTACTTT TTCATGGTTA AGAGTGGTGG 120 

AGAATTATAC TCAATGAAAA TCAAAGAGCA AACTAGGAAG CTAGCCGCAG GCTGTACTTG 180 

AGTACGGCAA GGCGAAGCTG ACGTGGTTTG AATTTGATTT TCGAAGAGTA TTAGTGCAAA 240 

CCGTAGTTGT AGTCATCATC TTGCATGGCT TCAACTTCGC CAAGAAGGTA ACCATTTCCG 300 

ACTTGAGAGA AGAAGTCATG GTTGGAAGTT CCTGTTGAAA TACCGTTCAT AACGATTGGG 360 

TTGACATCTT CAGCTGAATC TGGGAAAAGT GGATCTTGTC CCATGTTCAT GAGAGCTTTA 420 

TTGGCATTGT AGCGAAGGAA GGTTTTAACC TCTTCAGTCC AACCAACACC GTCATAAAGA 480 

CTCTCTGTGT AGCCTTCTTC ATTTTCATAA AGAGTATAGA GTAGGTCGTA CATCCATTCT 54 0 

TTGAGTTTTT CTTGCTCTTC TTCAGGTAAT TCATTGAAAC CAAGTTGGAA TTTGTAACCA 600 

ATGTAGGTTC CGTGAACAGA CTCGTCACGA ATAATCAATT TAATGATTTC TGCAACGTTG 660 

GCAAGTTTGT TGTTACCGAG ATAGTAGAGG GGAGTGAAGA AACCAGAGTA GAAGAGGAAG 720 

GTTTCGAGGA AGACGCTGGC AACTTTCTTT TCAAGTGGGC TGCCGTTTAG GTAGATTTCG 780 

TTGACT^TCT CAGCCTTCTT TTGTAGGTAA GGATTGGTAT TGGTCCATTC GAAAATTTCT 840 

TCAATCTCAG CCTTAGTATT CAAGGTAGAA AAGATTGATG AGTAAGATTT AGCGTGGACA 900 
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GATTCCATAA ATTGGATGTT ATTGAAGACA GCTTCCTCAT GTGGTGTACG GATGTCTGCG 9 60 

CGAAGGGCTT GAACCCCAGT TTCAGATTGC ATAGTGTCAA GAAGGGTTAA ACCACCAAAA 1020 

ACTTTTCCCA CCAAGTCTTT CTCTTTGTTA GATAGCTTTC TCCAGTCATC CAAGTCGTTT 1080 

GATT^GGGAA TACGTGTATC GAGCCAAAAT TGCTCCGTCA GTTTTTCCCA AGTTGATTTG 1140 

TCGATGACAT CTTCGATGGC ATTCCAGTTA ATGGCTTTGT AGTAAGTTTC CATTTAAAAT 1200 

CTCTTTCTGT GTTTAGTATT GCGAACTCAC AATTATTTCT ACTTTACCAT AATTCTATAG 1260 

GAGTATCGCA CAAAAAGTCG GAAGCCCGAC TTTTAAAATG TTACATAAAT TATGTTATGA 1320 

CATAGTAGAT TTGATTTTAT CAGTGCTGCT TAGGGAAAAA TAGTGTTTCT ATGCTAGAAA 1380 

CTAAATCACA CAGCTTTCAC ATTGGTTGGC GCCGACTTCT CCACCGTCAT CTGTAAAGGT 1440 

ACGGACGTAG TAGATAGACT TGATTCCCTT GTTAAAGGCA TAGTTACGAA GGATGGACAA 1500 

GTCACGTGTC GTTTGTTTAT TTTCCCTCTT CCATTCGTAA AGGCCTTTTG GAATGTCACT 1560 

GCGCATGAAG AGGGTGAGTG AAAGTCCTTG ATCCACGTGT TCAGTCGCAG CAGCGTAAAC 1620 

ATCGATGACT TTACGCATAT CCATATCGTA GGCAGAAGTG TAGTAAGGAA TGGTTTCTGT 1680 

AGACAAGCCA GCAGCAGGGT AATAGATTTT ACCAATTTTC TTCTCTTGGC GTTCTTCGAT 1740 

ACGTTGCGTA ATCGGGTGGA TAGAAGCAGA AACGTCGTTG ATATAGCTGA TAGAACCATT 1800 

TGGCGCTACA GCAAGGCGAT TTTGGTGGTA AAGACCATCT TCTTGAACCT TGTCGCGAAG 1860 

TTCAGCCCAA TCAGCAACAC CAGGGATAAA GACATTTTTG AAGAGTTCTT TAACACGGTC 1920 

TGATGTTGGA ACAAATTCAC CAGTTACATA CTTGTCAAAG TAACTTCCGT TAGCATAGTC 1980 

TGATTTTTCA AAGTTGTGGA AGGTAATACC ACGTTCACGT GCAATATTGT TTGACTCTAC 2040 

CAAGGTCCAG TAGTTCATAA GCATAAAGTA GATGCTTGTA AATTCAACAG ACTCAGGTGA 2100 

ACCATATTCA ATGAGTTGTT GGGCAAGGTA GCTGTGCAGT CCCATGGCAC CGAGACCAAA 2160 

GGTGTGGGCT TGGCTATTTC CATGGTCAAT CGTTGGTACA GCTACGATAT GTGAACTATC 2220 

TGTAACGAAA GTAAGGGCAC GAACCATAGC ACGGATAGAA CGACCAAAAT CAGGTGAAGT 2280 

CATCATGTTA ACCACGTTGG TTGAACCCAG GTTACATGAA ACATCTGTTC CCATTTGAAG 234 0 

GAATTCTTGA GCATCGTTGA TCAAGCTTGG TTCTTGAACT TGAAGAATCT CAGAACACAA 2400 

GTTACTCATG ATAATCTTTC CATCAACAGG ATTTGCACGG TTAGCCGTAT CGATGTTGAC 2460 

TACATAAGGA TAGCCAGACT CTTGTTGCAA TTTAGAGATT TCAGTTTCCA AATCCCGCGC 2520 

CTTGATTTTT GTCTTGCGAA TATTTGGATT TGCGACCAAT TCATCGTATT TTTCAGTAAT 2580 

GTCGATGTAA TTGAATGGCA CACCGTATTC TTTTTCTACA GAGTAAGGGC TGAAGAGGTA 2640 

CATTTCTTCA TTTTTACGAG CCAATTCGTA GAATTTATCA GGTACTACAA CACCAAGTGA 2700 
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TAGAGTCTTG ACACGTACTT TTTCATCAGC GTTTTCTTTC TTAGTTGAAA GGAAAGCGAT 2760 

GATATCTGGG TGAAAGACGT TGAGGTAGAC AACACCAGCA CCTTGACGTT GCCCCAATTG 2820 

GTTGGAGTAA GAGAAGCTGT CTTCAAAAAG CTTCATAACA GGAACGACAC CTGAAGCAGC 2880 

TCCTTCATAG CCTTTGATAG GTGCACCAGC TTCACGAAGG TTGCTGAGGG TAATTCCCAC 2940 

ACCACCACCA ATACGTGAAA GTTGAAGAGC TGAGTTGATA GAACGCCCGA TAGAGTTCAT 3000 

ATCATCCGTC ACTTGGATTA GGAAACAAGA TACCAACTCC CCACGACGAG CACGTCCAGC 3060 

ATTCAAGAAG GAAGGAGTAG CAGGTTGGTA GCGTTGGTGG ATGATTTCAT TGGCAATATC 3120 

GATTGCAACA GCTTCATTCC CATCAGCGAA ATAAAGGGCA TTGAAGAAGA CACGGTCTTC 3180 

CATATTTTCA AGATAGTATT CACCGTCATT AGTCTTTAAG GCATATTGAT TGTAAAATTT 3240 

ATAAGCTGCC ATGAATGACT TGAATTGGAA GTTTTGGTCT TTGATAAATT GAGCTAATTC 3300 

TTCCAAGAAC TCTGGACGGT ATTTCTTGAT AAAGGCTGTT TCGATGTAGT TGTGTTCAAT 3360 

GAGGTAATTG ATTTTGTCTT TGATTGAATC AAAAACCATA GTGTTTGGAA CTACATTTTC 3420 

TTTAAAGAAA GCATCCAAGG CTTCCTTGTC TTTATGAAGC ATGATTTGTC CATTAACAGG 3480 

ACGGTTAATT TCGTTATTAA GACGGAAGTA AGTCACGTCT TCAAGATGTT TTAATCCCAT 3540 

AAAATTTCCC TTATCTAATT ACAAAAGAAA GGCTTCTAAG TTAGCCCTAA AAGCAGTTTC 3600 

TTCTGGATGA TGTACTAAGA TTATGCTAAT TGTTTCAGTT TTCCTGGTTG GAA^CCTGAA 3GoG 

AAGACTTCAG TTGGTGTTTG GATAACAGGA GCTGCGCTAA AACCGAGCTC TTTAACTTGA 3720 

TCGACGTACT CAGGTTGCTC ATCAAGATTG ATTTCACGAT AAGAGACATT ATTACTGTCC 3780 

AAGAAACGCT TGGTCATTTT ACATTGGACA CAATTGTTTT TAGAATAAAC GGTTACCATT 3840 

GTGTAACTCC TCTTCAAAAT TTAATACTAT CTTAGTATAT CAGAAAATAA AATTTTGTCG 3900 

3902 

(2) INFORMATION FOR SEQ ID NO: 213: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2456 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 213: 

TATTGAAGCT ATTGTAGACT ACAAAGATAA GGATTTGCAG TTAGTAGGCG GTGAGACTCA 60 

CTGATAACCT AAAAAGGATA GTCAATTATG CTTGTTTACT AACTATTAAC TATGCTAAAT 120 
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CAATTGAGGT TGTTTACATA AAACTCTATA TCAGAGAAGC CTGATATAGA GTTTTTTCTT 180 

GCTAGTTTTA GGATTTTTTT GTAAAATAGA AAAAGTGAAG AGAGGTATGA AATGAGCAAG 240 

AAAGATAAAA AAATCGAAAT TCAAGTAGCG GATGCCAAAG TTAATGTTGG TAAAGACAGT 300 

TTTGAAGGTT ATACATTGAC TATCGGTAAA AAAGTTATCG GAGAAATTGC CGAATTAGAC 3 50 

GGACAATTTG CCATTATAAA GAATGGGAAT GTCGATAGTT TTTATAAAAA ATTGGT^AAAA 420 

GCTGTGGAAA TTTTGATTGA AAATTATAAT TTAGCAAAAT AAGTCTTGTT TTTTTGAAAT 4 80 

TTTCATGATA TAATAGTCCA TGTTGATTGT AGGAGAGATA GCGAAGAGGC TAAACGCGGC 540 

GGACTGTAAA TCCGCCCCTT CGGGTTCGGG GGTTCGAATC CCTCTCTCTC CATTTCATTA 600 

ATGGGGTATA GCCAAGCGGT AAGGCAAGGG ACTTTGACTC CCTCATGCGT TGGTTCGAAT 660 

CCAGCTACCC CAGTTCTTAG GTAATAATCA AGATAGAAAG CAAAATATCT TAGGGTATTT 720 

TATTTTTATA ATTGAAAGAC GTGAATGATA TGAACATGTC CTTGCGGGTG CTTAGGAAAA 780 

AAATTATAAG TATGTCAAGT TTAAGAAAAA CTTGATTGTT GGAGGATTTT TTAGATGAAC 840 

GAATTTGAAG ATTTGCTAAA TAGCGTTAGT CAAGTTGAGA CTGGTGATGT TGTTAGTGCT 900 

GAAGTATTGA CAGTTGATGC GACTCAAGCT AACGTTGCAA TCTCTGGAAC TGGTGTTGAA 9 60 

GGTGTCTTGA CTCTTCGCGA ATTGACAAAC GATCGTGATG CAGATATCAA TGACTTTGTT 1020 

AAAGTAGGAG AAGTATTGGA TGTTCTTGTA CTTCGTCAAG TAGTTGGTAA AGATACTGAT 1080 

ACAGTTACAT ACCTTGTATC TAAAAAACGC CTTGAAGCTC GCAAAGCATG GGACAAACTT 1140 

GTTGGTCGCG AAGAAGAAGT TGTTACTGTT AAAGGAACGC GTGCCGTTAA AGGTGGACTT 1200 

TCAGTAGAAT TTGAAGGTGT TCGTGGATTT ATCCCAGCTT CAATGTTGGA TACTCGTTTC 12 60 

GTACGTAACG CTGAGCGrTT TGTAGGTCAA GAATTTGATA CTAAAATCAA AGAAGTTAAC 1320 

GCTAAAGAAA ACCGCTTCAT CCTTTCACGT CGTGAAGTTG TTGAAGCAGC TACTGCAGCA 1380 

GCTCGCGCTG AAGTATTCGG TAAATTGGCT GTTGGTGATG TTGTAACTGG TAAAGTTGCT 1440 

CGTATCACAA GCTTCGGCGC TTTCGTCGAC CTTGGTGGTG TTGACGGATT GGTTCACTTG 1500 

ACTGAATTGT CACATGAACG TAATGTATCA CCAAAATCAG TTGTAACTGT TGGTGAAGAA 1560 

ATTGAAGTGA AAATCCTTGA TCTTAACGAA GAAGAAGGAC GTGTATCACT TTCACTTAAA 1620 

GCAACAGTAC CAGGACCATG GGATGGCGTT GAGCAAAAAT TGGCTAAAGG TGATGTAGTA 1680 

GAAGGAACAG TTAAACGTTT GACTGACTTC GGTGCATTTG TTGAAGTATT GCCAGGTATC 1740 

GATGGACTTG TTCACGTATC ACAAATTTCA CACAAACGGA TTGAAAATCC AAAAGAAGCT 1800 

CTTAAAGTTG GTCAAGAAGT TCAAGTTAAA GTTCTTGAAG TTAACGCAGA TGCAGAACGC 1860 

GTGTCACTTT CTATTAAAGC TCTTGAAGAA CGTCCAGCCC AAGAAGAAGG ACAAAAAGAA 1920 



wo 98/18931 



PCT/US97/19588 



1213 

GAAAAACGTG CTGCTCGTCC ACGTCGTCCA AGACGTCAAG AAAAGCGTGA TTTCGAACTT 1980 

CCAGAAACAC AAACAGGATT TTCAATGGCT GATTTGTTTG GTGATATCGA ACTTTAATCA 204 0 

AATTGAAAAT TCACAAAATC CTTTGTTTAC TAAACAAGGG ATTTTTCTGG CTCTTTGTCA 2100 

ACTGTAGTGG GTTGAAGAAA AGCTAAGCTC GAGAAAGGAC AAATTTTGTC CTTTCTTTTT 2160 

TGATATTCAG AGCGATAAAA ATCCGTTTTT TGAAGTTTTC AAAGTTCCGA AAACCAAAGG 2220 

CATTGCGCTT GATAAGTTTG ATGAGATTAT TGGTCGCTTC CAGTTTGGCG TTAGAATAGT 2280 

GTAGTTGAAG GGTGTTGACA AGCTTTTCTT TATCTTTGAG GAAGGTTTTA AAGACAGTCT 2340 

GAAAAATAGG ATGAACCTGC TTAAGATTGT CCTCAATAAG TCCGAAAAAT TTCTCCGGTT 2400 

CCTTATTCTG AAAGTGAAAC AGCAAGAGTT GATAGAGCTG ATAGTGGTGT TTCAGG 245 6 



(2) INFORMATION FOR SEQ ID NO: 214: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10974 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214: 

AAATAGGATA TAGAGACATC CTTCTGATCT GCTTTTwACA AAGTCCAATT ATATGCGGAT 60 

CTATACCTCC ACAATGTCCA TTATTATmCC TAACTATAAT ATGAGCCGAA AACACTATAT 120 

CCTTAATGTC TCCATATCCA TCAGGGATAT TAATATTTAT TTTTCCACAA CTATATTGCA 180 

TTGTAACCAT CTCCTTAAAC GACGCATTAT GATATTTGAT AGAGAAATTT TTATGAATAA 240 

CTCAATAATT TTATAGTAAA TCATGCTTAT ATCTCAAAGA TACCTATTTT ATCTTGTCTC 3 00 

GACCTTCTCC AAAGAATTGC TATAATACTA TTACAAATCC ATCTGCACTA CACTTCAAAT 360 

TTTAGCACTG TATAAAAACG TTTCAATACA CTAACTTCAA GAAAACTTCC ACTATTAATT 420 

GAAAAAATTG ATAGAGATAA ATTAAAAATC TATATTGAAA CTCATCCCGA TGCTTATTTG 480 

ACTGAAATAG CTGCTGAATT CAACTGTCCT CCAACAACTA TTCATTACGC TCTAAAGGCT 540 

ATGGGATATA GTCTAAAAAA GAGCCGTACC TACTGCGAAC AAGACCCAGA AAAAGTAAAl' 600 

CGGTTCCTTA AAGAATTGAA TCACTTAAGC TACCTGACTC CTATTTATAT TTATGAGACA 660 

GGGGTTGAGA CCTATTTTTA TCTCGAATAT GATCGAGCCT TGAGCAGGCA GTTAGTCTCT 720 

CTGGAAGAAG ATATAATTAT TTGAATTAAG ATCGAGACAA CGCACACCAG AGATTGCGAT 780 

ACTGTTATAG AAGTACTAAT GCCCTTTTTT GTTTCAATAT ACTATGGCTC CGATGACCTA 840 
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TAAAGATACG ATGACGAGTG ACTTTTTCGA AGCTTGCTTC CAAAAATTCT TACTACCTAC 900 

TTTAGATACA CCATCCCTTA TCATTATGGA CAATGCAAGG TTTCACAGAA TGAACATGTG 960 

TAAGGAGCAG GGCATAGACT GTTACCACTT CCTACCTATT CACCCGAGTA TAATCCCATT 1020 

GAGAAAATAT GGGCTTACAT CAAAAACATC TCAGAATAAT ATTGTCAAAT TACGATGCTT 1080 

TTCTTGAGGC ACTTTTGTCC TATTCTTGTT TCAGCCGACT ATACTCCGTT ATTGGGCAGC 1140 

TACGGAACAG TCGATGGGAC GATGGGGGGA CATAAAAAAA TCCTCCAGTT TTGTTTTTTA 1200 

TAACAGTATA CTGGAGAATT GACAATCTCG GTAGATACCT CGTTATAGCG CGGTTACTTA 12 60 

TTAGGCAGTT ACAAAACAAC TGTGAACAGA AAACATTCCA GAGTCAGACA AGACTTTGGA 1320 

ATGTTTTGGC TCTATAATTT CTGTAGTGGG TAATCCCACC CCAGGAATTA TAGGGTCGTT 1380 

TCTTGTAGAA AAAAAGCCCC ATATGACCTA TAATGAAAAG CGTCTAACCA ACTCATTAGA 14 40 

AAGGGTTCAT ATGGAACAAC TTAAGAATAC CACAGATTTG CTCGGATTGG AAGACAAAAA 1500 

TATCAAAATC TTGTCTGTTC TGAAATACCA AACCCATCTA GTCGTTCAGG C/VAAGTTGGA 1560 

TTCCCCCGCT CCTCCTTGTC CTCATTGTCA AGGGAAGATG ATCAAATACG ACTTCCAGAA 1620 

AGCCTCTAAA ATTCCGCTTC TCGACTGTCA GGGTTTACCC ACGGTACTGC ATCTCAAAAA 1680 

GCGCCGCTTT CAGTGCAAGA ATTGCCTTAA GGTGGTCGTT TCTCAAACAT CCATTGTCAA 1740 

GAAAAATTGC CAGATTTCCA ACATGGTGAG ACAAAAAATC GCTCAGCTCC TCCTTGAAAA 1800 

GCAGTCTATG ACTGAGATTG CCCACAGATT GGCGGTCTCA ACTTCCACCG TCATCCGAAA 1860 

ACTGAGGGAA TTTAAGTTTG AAACCGATTG GACCAAGTTG CCAAAAGTTA TGAGTTGGGA 1920 

TGAGTATAGC TTCAAAAAGA GCAAAATGAG CTTCATTGCC CAAGATTTTG AGTCCAAATC 1980 

CATCCTCGCA ATTTTAGACG GGCGAACTCA TGCGGTGATT CGAAACCATT TCCAACGCTA 204 0 

TCAGAGAGAG GTTCGGGAGC TGGTCGAGGT CATCACCATG GACATGTACA GCCCTTATTA 2100 

TCGGCTCGCT AAGCAACTCT TTCCAAAGGC GAAGATTGTT CTTGACCGCT TCCACATTGT 2160 

CCAACATCTG AGCCGAGCTA TGAACCGAGT ACGAATCCAA ATCATGAACC 7VATTTGACCG 2220 

AAAATCCTTG GAGTATCGGG CGCTCAAGCG CTTTTGGAAC CCTCGCTTTT TCGTTTCTAG 2280 

GCTCGGGCTA AATCAGTCCA CTGGACTGAT TTACTACACC AGTATAGCTT CAAGCTCTGT 2 340 

CAGAAACGAT TCTATCAGCC CACGTTTCGA ATGCACTTAA CCCATCGGGA AGTACGAGAT 2400 

AAGCTGCTTT CTTACTCTGA GGGATTACAG GTTCACTACG AACTCTATCA ACTCCTGCTC 2460 

TTTCATTTTC AAGAGAAGAA TGCCGACCAT TTCTTTGGAT TGATTGAGCA AGAACTGCCA 2520 

ACGGTTCATC CGCTTTTTCA AACGGTCTTT TGGACTTTTT TAAGGGATAG AGATAAGATT 2 580 

ATCAACGCAC TTAAGCTGCC TTATTCCAAC GCTAAACTTG AAGCGACCAA TAATTTGATT 2 640 
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/iAuA 1 1 A I L. A 


AGCGCAAAGC 


CTTTGGTTTC 


CGGAACTTTA 


ACAATTTTAA 


AAAACGGATT 


2700 


1 i^aATGACTT 


TGAACATCAA 


AAAAGAGAGT 


ACGAATTTCG 


TACTCTCCAG 


ATTGCAGCTT 


2760 


TTCGCCTACC 


CACTACACTT 


GACAAAGAGC 


CACTCTTTAT 


TCCATGGTAT 


CAAAGGCAAG 


2820 


ACTTGGTTTG 


GCATTGAGGT 


CCCAGCCTGC 


GAAGTTTTCT 


TTGTTCCACT 


CGCTGACGCT 


2880 


GGCATAGGCA 


ATCATACCTG 


CATTGTCTCC 


GCAGAGTCGC 


AGAGGGGGGA 


TGATAACCTT 


2940 


GACATCTGTG 


ATTTCGGCTG 


CTAGGCGTTC 


TCTGAGACCT 


TTATTGGCTG 


CCACACCACC 


3000 


TGCCACAACT 


AGGATTTTAA 


CAGGATATTT 


CTCCAAAGGC 


TTCTTGGTTT 


TTGCCATGAG 


3060 


AATGTCCATA 


ACTGCTGCTT 


GGAAGGAAGC 


ACACAAATCT 


TCTGTAGACA 


GGCTTTCTCC 


3120 


CTTTTGCTCG 


GCATTGTGAT 


GAAGATTGAT 


AAAGGCAGAT 


TTCAAACCTG 


AGAAGGAGAA 


3180 


CTCCAGATTA 


TCTTCCTTAA 


TCATGGCACG 


GGGGAAATCA 


TAAATATCCT 


GCCCCTGATG 


3240 


AGCCAGCTCG 


TCAATCTCAC 


GACCTGCAGG 


ATAGGTCAAG 


CCCATGACAC 


GGCCGACCTT 


3300 


ATCATAAGCC 


TCACCAACCG 


CATCATCACG 


GGTTTCCCCA 


ACAATCTTAT 


AATCTCCTGC 


3360 


CTCCGAAACA 


TAAACCAACT 


CTGTGTGTCC 


GCCGCTGACC 


AAGAGGGCTA 


GCAAGGGAAA 


3420 


CTCCAAAGGC 


TCCACACTCT 


GAGCTGCCAT 


GAGGTGCCCA 


GCCATGTGAT 


TAACAGGAAT 


3480 


CAGTGGAAGT 


CCGTGAGCCC 


AAGCAAAGGC 


CTTGGCAGCT 


GACAAACCAA 


CTAGCAAGGC 


3540 


TCCGACCAAG 


CCTGGTCCGT 


AGGTAACCGC 


AACAGCTGTC 


ACGTCCTCTT 


C GOT AAT 


3G0G 


TGCTTCTGCC 


AATGCCTCCT 


CGATACAGGC 


TGTAATGACC 


TCGACATGGT 


GACGACTGGC 


3660 


TACTTCGGGC 


ACTACGCCAC 


CAAAACGTTT 


GTGACTCTCA 


ATTTGACTAG 


CAATGACATT 


3720 


GGACAAGAGC 


TCATCGTCGT 


TTTTCAAGAC 


GGCGACACTG 


GTCTCATCAC 


AGGATGTCTC 


3780 


AAATGCTAAA 


ATATATCTAT 


CCTTCATCTA 


TTTCTCTCTT 


CATGATAATG 


GCGTCCTCGA 


3840 


CTGGGTCATG 


GTAGTAGGCC 


TTTCGCTCAG 


CGATAACTGT 


CATCTTTTCT 


TTCTTGTAAA 


3900 


ATGCTTGCGC 


TCGTTGATTT 


GACTGTCTGA 


CTTCGAGGAA 


AATTTCCTTG 


TCTGTCGGCA 


3960 


ATTGAGCAAA 


CAAGGcTGAC 


GCAATCCCCT 


GACCCTGATA 


AGCTCCTTTG 


ACAGCGATTT 


4020 


GCAGGACTTC 


TGCTTCAAAA 


AGATTCTCCT 


GCACAGCTAG 


AAATCCAATC 


ACTTCTGCCC 


4080 


CATCATAAGC 


CAATGCATAC 


CAAGTCTGGT 


CTTGGGACAG 


ATCTGCTTGG 


ATTTGCTCCA 


4140 


GAGTCCAAGG 


ACTGACTAGG 


TAAACAGCTG 


CCATAACAGC 


GTAGATGGCT 


TGAGCTAGGT 


4200 


CAGGCTGTTG 


TTGAATTCGC 


TTGATTTCTA 


TCATAGGCGT 


TTAATGTAAG 


ACTCGCCAGA 


4260 


CTCGGTATGG 


TTCTTGAGCC 


AGTTTTCCTC 


AGCCTCGACT 


CGTTTGAGGT 


AATTCGGCAC 


4320 


AAAATCATGC 


AAGGAGTCTG 


CTTCCTTGTC 


CCAGGCCAAA 


AGAGCTAGAT 


TAGCTGCATT 


4380 
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GGGCAATGTT TCTTTGTAAT CAGTCCTTGG CAAGTGTTTT TGAATCTGCT CAACAAAGGG 4440 

GCCAACTTCT CCGACAAAGG TTACCTGACT AGTACCCTTG ACTTTTTCTA GCACCTCTTC 4 500 

AAAAGATAGG TGCGCTTCTG CCATGACAGG TTTGGCATTT TCATAAAATC CTGCATAAAC 4560 

ATTATTGCGA CGCGCATCCA TCAAGGGGAC AAACAAACCT TCTTGTTGAT GGGGCACCAG 4620 

AGCCAAGAGA CTCGACATAC CAACCAACTC GATGTTCAGG GTGTGAGCTA AGGTCTTAGC 4 680 

AGTTGCTACC GCAATTCGCA AGCCTGTATA GCTACCCGGC CCTTCAGCTA CCACGATTCG 4740 

GTCCAAATCC TTGGGTGTCC AATCCAAACT TGCCATCAAA AAATCGATGG CAGGCATAAG 4 800 

AGTAATACTG TGATTTTTCT TAATATTAAT CGTCGTCTCG GCAAGAACCT GCTTATCCTC 4860 

TAAAATAGCC AGAGAAAGAG CCTTGCTGGA CGTATCAAAA GCTAATACTT TCATAACACA 4920 

TTCCTATCTT TTTGTCTGCT TACTATTATA CTACAAAAGC TGGCACATGG GAATTTTCTT 4 980 

TGCCCCCAGA CAAGAGTGCC CTCACTTAAC TAAAAATAAT TTAAAAAAAT GCTCACTTTT 504 0 

CCTTTTCTTT TCCGAATATA AAAGTGAACA AGAAAAAAGG AGGAAAGTTC AATGACAAAT 5100 

TTTGACATTC TTGACAATCA ATTTTTATCC TTATCTGAAA ATGAATTATC AGATATTGAT 5160 

GGCGGTCTCG CTCCCTTGGT TATCTTTGGA GTAGCAGTAT CTTGGAAGGC TATTGCAGGT 5220 

GGAACAGCAC TTATAGGTTC TGGTTTGGCA GCTGGTTATT TTTTAGGAGG AGATTAATAT 5280 

GATGAAAGAT TTGAACAATT ATCGTGAAAT TTCTAATAAG GAATTGCAAG AAATCAAGGG 5340 

TGGCTTTGGT GTCGGTGTTG GTATCGCTTT ATTTATGGCA GGTTATACCA TTGGAAAAGA 5400 

CCTTCGTAAA AAGTTTGGTA AGTCATGCTA GATAAGAAAC ACATTTTTAG AAGGATAAAT 54 60 

TTTATTGTCT TCATCTCTTA CAGTTTGCTC AGCATTCTCA ATGATTTGAA CATTACTACC 5520 

ATCCCTTTAC CATTCGATTT ATCTGTTTGT ATTGTTTTAT TTTTATGCTT CAACTCTATT 5580 

TTTGATCAGA ACAATGACTC CCATAAAAAT AATAAGCTTT GAAAATTCCA . TTGTCATGTC 5640 

ATGTTAGAAA AATGCAAAGA CCACCTCATC TTGATAGATG GGGTGGAATT TTCGTGTCGT 5700 

AAATCTACTA TCTCTACATT CCCAAACAAA AAACCCCAGC ATAAGCAGGG CATCTAAGCA 5760 

TTTAATTCAA AGTAAAATAC AAACCAAACG ACATAGGTCA CGAGGAGGAG AAAAAGCGAG 5820 

TAGAGAGTCA CAAAGGTCAT TTTCCACAAG AACTTGGTTT GTCGTCGTTC CAGTTTGGCA 5880 

AATAGAAGAT TCCCCGCATA AACGCAAGCA ACAAAAACAA TAAAAGCTAC CAAGCGAGCT 5940 

CCGATAGCAA AAGCAAATAA GTTATACATA GGGCAACCTC CTTGACTTAA AATCTATATG 6000 

GAATTATGAC AAGCAATAAA TTTCACTTCC GTTATCAACA TAATACATTT TCTTTATTTT 6060 

TGAAAACGCT TACCAAAGAA ATCGTCCCCT AACTTTCTCG TTTCCGTCTT TTACTAATTT 6120 

TTCATTTTGT GGTATAATTG AAATAATTGT AACGAATCAA GGTCAATCTA GACACAAAAT 6180 
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GGAATGAAAT CAAGCAAATA TCTGCTAAAA GTTTGGAATA AGCTGACCTG TAAATAGAAA 6240 

GGAACTATAT GATTTACAAA GTTTTTTATC AAGAAACAAA AGAACGTAGC CCACGCCGTG 6300 

AAACAACACG CACGCTTTAC CTAGACATCG ATGCCAGCTC AGAACTTGAG GGCCGTATCA 6360 

CTGCTCGCCA ACTTGTCGAA GAAAATCGCC CAGAGTACAA TATCGAGTAT ATCGAACTCT 6420 

TGTCTGACAA ATTGCTCGAT TACGAAAAAG AAACTGGCGC CTTCGAAATT ACGGAGTTCT 6480 

AATATGGCCT ACACTCTTAA ACCTGAAGAA GTCGGCGTTT TTGCCATCGG TGGTCTAGGA 6540 

GAAATCGGGA AAAACACTTA CGGAATTGAA TACCAAGACG AGATTATCAT CGTCGATGCT 6600 

GGGATTAAAT TCCCAGAAGA TGACTTGCTT GGTATCGACT ATGTCATTCC TGACTACTCT 6660 

TACATCGTGG ACAATATCGA CCGCGTCAAG GCTGTTTTAA TCACACACGG ACACGAGGAC 6720 

CACATTGGTG GGATTCCGTT CCTACTCAAG CAAGCAAATG TCCCTATTTA TGCTGGACCG 6780 

CTTGCCTTGG CTTTGATCCG TGGGAAACTC GAAGAACACG GCCTCTTGCG CAACGCCAAA 6840 

CTTTACGAAA TCAACCACAA CACCGAGTTG ACCTTTAAAA ATCTCAACGC AACTTTCTTT 6900 

AGAACGACTC ACTCTATTCC AGAGCCTTTG GGGATTGTCA TTCATACTCC TCAAGGGAAA 6960 

ATCGTCTGTA CGGGTGACTT TAAGTTCGAC TTTACTCCAG TTGGAGAACC TGCGGACTTG 7020 

CATCGTATGG CTGCGCTTGG TGAAGAAGGC GTGCTCTGTC TCCTGTCTGA CTCGACAAAT 7080 

GCGGAAGTAC CAACCTTTAC CAACTCTGAA AAAGTCGTTG GTCAGTCCAT TATGAAGATT 7140 

ATCCAAGGTA TTGAAGGACG TATCATCTTT GCATCCTTTG CCTCAAATAT CTTCCGTCTC 7200 

CAGCAGGCAA CAGAAGCTGC TGTTAAGACT GGACGCAAGA TTGCGGTCTT TGGTCGTTCT 7260 

ATGGAAAAGG CCATTGTCAA CGGAATCGAT CTTGGCTACA TCAAAGCTCC TAAGGGAACC 7320 

TTTATCGAGC CAAATGAAAT CAAAGATTAT CCTGCAGGAG AAGTTCTTAT CCTCTGTACA 7380 

GGTAGTCAGG GTGAGCCTAT GGCAGCCCTC TCTCGTATCG CCAACGGAAC CCACCGTCAA 7440 

GTACAATTAC AACCAGGTGA TACCGTTATC TTCTCTTCTA GTCCCATCCC TGGAAACACT 7500 

ACTAGTGTCA ACAAGCTGAT TAACATCATT TCTGAAGCTG GTGTCGAAGT TATCCACGGT 7560 

AAAGTGAACA ATATCCATAC ATCTGGACAC GGTGGTCAGC AAGAGCAAAA ACTCATGCTC 7 620 

TGCTTGATTA AGCCAAAATA CTTCATGCCT GTCCACGGTG AATACCGCAT GCAAAAAGTC 7680 

CACGCTGGAC TAGCAGTGGA TACTGGTGTT GAGAAGGACA ATATCTTTAT CATGAGCAAT 7740 

GGCGATGTGC TTGCCCTTAC TGCTGACTCA GCTGGTATCG CAGGTCATTT CAACGCCCAA 7800 

GATATCTATG TCGATGGAAA TCGTATCGGT GAAATTGGCG CAGCTGTCCT CAAAGATCGT 7860 

CGCGATCTAT CTGAAGACGG TGTCGTTCTG GCAGTTGCAA CTGTTGACTT CAAATCGCAG 7920 
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ATGATTCTAT CTGGTCCAGA CATCCTCAGC CGAGGCTTTG TCTACATGAG AGAGTCTGGC 7980 

GACTTGATTC GCCAAAGCCA GCGTATCCTC TTCAATGCCA TTCGTATCGC ACTGAAAAAT 8040 

AAGGATGCTA GCGTGCAATC TGTCAATGGT GCCATTGTCA ACGCTATTCG CCCCTTCCTC 8100 

TATGAAAATA CCGAACGTGA ACCGATCATC ATCCCGATGA TCCTCACACC AGATGAAGAA 8160 

TAAAGCAAGA AAACAGCCCC GTCCTCGGAG CTGTTTTTCT CTATGCTTTC TTTTGAGATT 8220 

AAAACTCATA CTCAATGAAA ATCAAAGAGC AAACTAGGAA GCTAGCCGTA GGTTGCTCAA 8280 

AGCACTGCTT TGAGGTTGTA GATAGAACTG ACGAAGTCAG TAGCCATACC TACGGCAAGG 8340 

CGACGTTGAC GCGGTTTGAA GAGATTTTCG AAGAGTATCA ATAAAAATCG AAATCAGACT 8400 

AGAAGGCTAA GCGAAAGCAT AACTTGAGTT AGCTCCCATA GTTCGGGAAA CTATGGGAGG 84 60 

CTGGAGATGA ATCAAAGCCA AGCTTTGAAC TCATTCGTAA GAAGCCGACG ACGTATCATT 8520 

TTGATTTTTG AAGAGTTTTA GAAATACTAC GATTTTTACC TTCCAGATAC ACCATCAAAA 8580 

TAGAAATATC TGCTGGGTTT ACTCCCGAAA TACGGCTGGC TTGGCCGATG GTTTCTGGAT 8640 

TGATGAGTTT GAACTTCTGA CGGGCTTCGG TTGCGATAGA ATCAATGTCA TCCCAGTCGA 8700 

TATTGGCCGG AATGCGTTTT TCTTCCATGC GTTTCATCTT GGCAACCTGG TCCATGGCTT 8760 

TGGAAATATA GCCTTCATAC TTGATTTCTG TTTCAATCAA TTCGATAATC TTGTCATCCA 8820 

AGTCTTCTGC AGCTGGTCCG ATGAAGGCCA CCACATCTTG GTAAGAAACT TCTGGACGGC 8880 

GAAGGAATTC CTTGGCTGTC ACTGCATCGG TCAAGGGTTT GAAGCCCATC TCCTCAACCT 8940 

TGGCATTGGT TTCCTTGACT GGCTTGAGTT TGATACTGTC TAGGCGCTTC ATCTCATTAT 9000 

CAAATTGATT TTTCTTGATT TCAAAACGAG CCCAGCGTTC ATCGTCCACA AGGCCAATCT 9060 

CGCGTCCCAT CTCAGTCAAG CGCATATCAG CATTGTCATG ACGAAGAATG AGACGGTATT 9120 

CAGCACGACT GGTCAAGAGA CGGTAGGGTT CAATGGTTCC CTTGGTCACC AAGTCGTCGA 9180 

TCATCACCCC GATATAACCA TCACTGCGCT TCAAAATCAA TTCAGGCTTG CCTTGGATTT 9240 

TCAGAGCCGC ATTGATACCC GCGATAATCC CTTGGCCTGC TGCCTCTTCG TAACCTGATG 9300 

TTCCATTTGT CTGACCAGCA GTGAAGAGAC CTGAGATTTT CTTGGTTTCC AAAGTCGCAC 9360 

GCAACTGATG AGGCAAGACC ATATCATACT CAATAGCATA ACCTGTCCGC ATCATCTCTG 94 20 

CATTTTCCAA ACCTTTGATG GAATGCACCA AGTCACGCTG GACATCCTCA GGCAGACTGG 9480 

TTGAAAGTCC TTGCACATAG ACTTCCTCAG TATTGCGCCC TTCTGGCTCA AGGAAGAGTT 9540 

GGTGACGTTC CTTGTCCGCA AAGCGCACAA TCTTGTCTTC AATCGACGGA CAGTAACGAG 9600 

GCCCCACTCC CTTGACCACA CCTGTAAACA TAGGCGCACG GTGGAGGTTG TTTTGGATAA 9660 

TCTCATGACT GGTACCATTG GTATAGGTCA ACCAGCATGG TACTTGGTCC TTGACATAAT 9720 
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CCTCATCACG TGAAGTGTAT GAGAAATGAT TAGGCACTTC GTCTCCTGGC TGAATTTCTG 9780 

TCACATCGTA ATTGATAGAA GAAGCCTTGA CACGTGGAGG GGTTCCTGTC TTGAAACGAC 9840 

CGATTTCGAG ACCCAGTTCC TTGAGATTGT CAGCTAGGTT AATAGAAGCC AAGCTGTGGT 9900 

TAGGACCTGA TGAGTACTTG AGGTCTCCGA TGATAATTTC CCCACGGAGA GCAGTCCCTG 9960 

TCGTCACAAT AACAGCCTTA GCAGCATATT CTTGATGGGT GGCTGTACGC ACACCGACAA 10020 

CCTTGCCATC TTCCACCAAA ATCTCATCAA TCATGGTTTG ACGAAGGGTC AGATTTTCTT 10080 

GGTTTTCAAC CGTCTTGCGC ATCTCCTTAG AGTAAAGTTC CTTGTCAGCC TGCGCACGAA 10140 

GGGCACGGAC AGCTGGCCCC TTCCCTGTGT TTAGCATCTT CATCTGGATG TAAGTCTTGT 10200 

CAATGGTTTT GGCCATCTCG CCACCGAGGG CATCGACTTC ACGCACGACA ATCCCCTTGG 10260 

CAGAACCACC GATAGAGGGA TTACAAGGCA TGAAAGCCAG CATTTCAATA TTGATGGTCG 10320 

CAAGCAGGAC CTTACAGCCC ATACGGCTAG CGGCCAAGGA AGCCTCAACC CCAGCGTGTC 10380 

CCGCACCAAT TACAATAATA TCGTATTCTT CAGTAAAATG ATAAGTCATG TTTCTCTCCT 10440 

ATTCCTCAAG ATGAATGTGT CTTAGTTGGC CTTCCCAATC TGGTAGGGCT GTTTTTAAAA 10500 

AGACTGGAAC TAGCTGGATA TTCTGGAGCT TATCCAAGTC AATCCACTCA CAGGGCTGCC 10560 

TTTTCTCATC TTCCTGCATG GTCAACGGGG CATCTTCAAG CAAATCCACC AGATAATGAA 10620 

ACTCGATATT GTGATAGGAA ACGCCGTCCA CTTCAAAACG ATTTTCAArr ACAAAAGCTA 10S8C 

GCTGCCCAGC TTGAGCTTTG ACACCCAGTT CTTCCTTCAC TTCACGGACT ACCGCGTCTT 10740 

CCGTGCTTTC ATTGACTTGA ATCGCACCTC CAATAGTGTA ATACTTGCCC TTGTCTTTGG 10800 

TAACTAGAAG CTTGTGATTT TGGACAATCA AGGCTGTAGC CCGAACACCA AAAACCGTAT 10860 

TGTCTACTTT TGTCCGAAAG TCTTGTTGAG TCATTCTTGT CCTTTCCCTT AAACGACACA 10920 

AAAACAGTCA AAACTACAAA GAAGTGCAGG ACAAAAAAGC CTGCAACATC CAGG 10974 
(2) INFORMATION FOR SEQ ID NO: 215: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 987 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215: 

CCCGTTATGA TTATGGATAG CGCTTTCAAA TTTTTAAACT CCTATCCCAT CCTTTTATCT 60 

ATATAATAAG TGAAAATATA ATAACTGTCA AGTAACTGAA GTGAATTTTA TAAAAAAATT 120 
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ACAAGCCAAA TTTGTAAAGT TTACACTAAG CCGCTAGgCA ATCGTCTATC AGAATATCCG 180 

TTTATTTGTC AATAATCCGA GAAAATCTTG CAACGCTTAG AAGTCTATAA AAACTATCAA 240 

CATTTATATG ACTTGCGAAT AGCAATCCTG CTAAACCTTT CCACACTCTA TCTATACAAT 300 

CAAGATAAAA ACATGTGTAA GCAAATCTGC TACACTTTAC TGGAGGACGC CAAGAATAAG 3 60 

AAAAGCTACG ATAGGCTTGC TATCTGCTAT GTCCGTATTG GGATTTGTAC AGACGATTCT 42 0 

AAACTTATCC AAAAAGGGTT CTCCCTTCTG GAGCTGACCG AGGAAACTTC TATGCTGTCT 4 80 

CATCTCAAAA AAGAAGTAGA GACCCATTAT CAACCAAAGA AATTATAAAA AAAGTCGAGG 540 

GAGCTCCTCG ACCTTTTCAT AGAATCGCCG AACGATTTAA CGAGAAAGTA TGACTTTTAC 600 

GTTTATCCCA ACTCAATTAT GACATTTTTT TCAAAAGTCA ATATATCTCA CTTTTTCAAC 6 60 

GACAAGAAAG AGGCTGATAA TCTACCAACC TCTTATTCTG AACCCATCAC TCCATCACTT 720 

TTTAGCTTCA TTCGCTTTCT TAGCGACTGC AATCTGGTAT TCGACTTGGT CATTCCCCTT 780 

ACCGGTACAA CCATGAGCAA TTGTAGTCGC TCCTATCTGA TGCGCTATTT CAACCAATTT 840 

TTTAGAAATC AGAGGGCGGC TCAAGGCAGA TACCAAGAGA TACTTTTGTT CATAATAGGC 900 

ATGTGACTGA TGAGCCACTA GCACATAATC TGTAGCAAAT TCGTCCTTAA CATCAATGAC 960 

ATAAGATTCT ACTGCCCAAA CCTTAAG 987 



(2) INFORMATION FOR SEQ ID NO: 216: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2651 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216: 

CTGGGTCTTG TTCATAGTAG GTGTGGTtCT TTTTTTCGAG TGTAGCCCAT AGCTTTGAGC 60 

GCATAGTGGA TGGTAGTTGG ATGACAGCCA AAGTCAGAAG CTATTTCAGT CAAATAAGCA 120 

TCTGGATTGT CAGTAAGATA GTTTTTAAGT CTATCTCTAT CAACTTTTCT TGGTTTTGTT 180 

CCTTTTACTT GGTGGTTTAG CTCTCCTGTT TTCTCTTTTA GCTTTAACCA GCCATAAATG 240 

GTATTACGTG AGATTTGGAA AACGTGTGAT GCTTCTGTTA TACTACCTAT TCGCTCACAA 300 

TAAGAGAGAA CTTTTTTACG AAAATCTATT GAATATGCCA TAAGAAGATT ATACCACATT 360 

GTGTACTATT TTTGGTTCAT TTTACTATAT TTTATAAGTT ATAGTGTAGC ATTCCAACTT 420 

CAAAGCACTA TAAAGTAAAT TGAAACAAGA ACAATACAAA CAATTCTCGT AAACGGATTG 4 80 

CAACCACAAA AAAGCAAGCA TTCACAAGAA TACTTACCTA TCATGGGAGG AACAACCGTT 540 
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CCTCTTTTTT ATTACTAAAA TTCAAAGAAT TCCAATGCTT TTTTCAAGAG CAAATCCGTA 600 
TATTCTGGAT CTTCTTGGGC TACTTCTATT TCCCGCTGAA CTTTTTCCAA ATCATCTGTA 660 
ATCACTCCAT CTACTCCTAA GTGAAGAGAT TTGCTGATAG CTTCTGAATC ATTGACAGTC 720 
CAGACATAAA GTTTCTGATC CGTTGTCCAT AGTTTGCTTA CAAAATATTC ATCCAAGGTT 780 
GAGTACTCCA TAGTATATCC TGTCGCTCTT GTTTTAGGAA AGACAGAATT GTAGGGCATG 840 
ATGAAATAAA CTGGTAGTTC GGCATCATAC TGTCTTACTT TTTCGACAAC ATGGTAGTCT 900 
AAAGACTGGA TTTGATGTCC ATAAATCTTG AGCTTTGCAG CATAACGGGC TAAAAAGCGG 960 

TTCATCATGT CTGGACTATC TTTTTTACTG GTTTTAATTT CAATTAGTAA TTTTTGACCA 1020 

AGTTCGTTGG CTCGACTGAG ATAATCTTCA AAGCTTGAAA TTTTAGTCTG GTAGCCATTT 1080 

TCAAAAATAT CAATCCCTTT AAGCTCCTCC AAGTTTAAGT CTTGAGGACT TTTATTGATA 1140 

CCTGCTAGAT TTTTCAAGTT AGCATCATGC ATCATGACAA ACTGCCCATC TTTTGTTTCC 1200 

TGCACGTCCG TCTCCACCAA GTCTGGTTTG AGTTGTGCTG TAGTTTCCAA GGACTCTACT 12 60 

GTATTTTGAA TCCCATTTGC ATTGGAAACC CCTCGGTGAG AAATAAGTTG AGGTAGATGA 1320 

ACCATGGGAG CCTCCAGATA AATATAACCT TCTAAGGCAA AGAAAAGACT GGCACAAGTC 1380 

ATGACACCCC ATCGCACGAT GTGATCTTTT TCTCTCCTAG GAAGCATATC CAGCTCCTTT 1440 

CCTGTCAAAA ATGAAACAAA TTTAACCAAA AAATAARTrA GAGCCATATA ATAGAGATTT 1500 

TTAATCACGA CAAAATTCAA AATACCAAGA ATCAGAGACT CTCTCTGAGT GATATCATCT 1560 

ACCAAAGTTT GAGCCAATAA TAAAGGAATC AAAGGAAGAT ArAATAATAA ATGTGCTTTG 1620 

AGCAAGATGT AAAATAAATT CCAAGCATAA AAAGTAACTC TCTTCTTGGT TTTCTCCAAG 1680 

CTAAACATCA CTGCTTCTCG AACAGTCAGC TGATCATATA CAATCTTCGG AAGGGCAAAC 1740 

ATCAATCTGA CAGAGACATA GAGAAAGATA AGAGATAGAA GTAGGATGCT CAGCCACCAC 1800 

ATCCAATATC TATCTTCTAA ATAAGCTTGG ATAAACTCTG GAATGACGAT TTTATTAAGA 1860 

TAATAAATCT TCAGCATTTT CCGTATAAAA GGAAACAGCA TAGCTATATA GAAAAAGATA 1920 

AACAAGGCTT TAGCGCAAGT TAGCTTTTTC ATAAATCCAA AACTTTCATG GAAAACCTTG 1980 

CGGATATACT CAATTAGCCT TCGCTTTTCA TTATAGAGGA GATGACGAGC ACCAATAAAG 2040 

AGGAGTCCTA TTTGAAAATA AGCAACCAGA AGGTTAATTA CAATCAAGGC TAAAAAAGCT 2100 

AGACTAATCA ATGGAGAATG AGTAAGGATG GCTAAGACAT TGTTATAGGA AATAAAAAGA 2160 

TAACCTGTCT GATCTAATAA GAAGCTAGCC AACCATGAAT TGAATGGTAC CCACAAATAC 2220 

TCCACTATCA TAAAAATCAA GAAAAATAGA AAGAGGATTT TATCAAGATC GAGGTAAATC 22 80 
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TGTTTAAGAC CCAATTTTTT AGGTTTTTCA GGTTTCATAG GCACTCCTAG TCAAATAATT 2340 

GAGACAAGTC CAAGCCACCA AAAGGATTGT TTGATAAGCT ACTTTCTGTC TCTAACAATT 2400 

CCCTAGCTTG ATCCGACTCT AAGAAGGATT CGTAAACACG CGCCGTCATC CGAGCATCCT 2460 

CTAAACTATT ATGAGACTGA CCTTGAAATC CAAGAAATGA GGCAACAGTT TGCAATTTGA 2520 

GATTGGCAAT ACCATGTAAA TCTGAACTCC GACGTTCAAA AGCTTCATCA TACAAATCCA 2580 

CCTTGTACTG TTGGCTATAG TCTAAACCAT GCTCTGCTAA AATAGGTAAA TCACTTTTAG 2640 

CAGCATTGTA G 2651 
(2) INFORMATION FOR SEQ ID NO: 217: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5638 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217: 

CGTTATAATA AACTTGTGAA AAAATTAACA AAGGATATCG TTCCTTGAAA GCTATGGAGG 60 

AAAATATGGC TGATAAAAAA ACTGTGACAC CAGAGGAAAA GAAACTCGTT GCTGAAAAAC 120 

ACGTAGATGA GTTGGTTCAA AAAGCTCTAG TTGCCCTTGA AGAAATGCGT AAATTGGATC 180 

AaGaacaaGT TGACTACATC GTTGCCAAAG CATCAGTAGC AGCTTTGGAT GCCCACGGAG 240 

AATTGGCTTT ACATGCCTTT GAAGAAACAG GACGTGGTGT ATTTGAAGAC AAAGCAACTA 300 

AGAACTTGTT TGCCTGTGAA CACGTAGTAA ACAACATGCG CCACACTAAG ACAGTTGGCG 360 

TTATCGAAGA AGACGATGTA ACAGGATTGA CTCTTATTGC TGAACCAGTT GGTGTTGTTT 420 

GTGGTATTAC TCCAACAACA AACCCAACAT CAACAGCAAT CTTCAAATCA TTGATTTCAT 480 

TGAAGACACG TAACCCAATC GTCTTTGCCT TCCATCCATC AGCACAAGAA TCATCTGCTC 540 

ATGCAGCTCG TATCGTCCGC GATGCAGCTA TCGCAGCTGG TGCTCCTGAA AACTGTGTGC 600 

AATGGATTAC TCAACCATCT ATGGAAGCAA CAAGTGCCCT TATGAACCAC GAAGGTGTTG 660 

CGACAATCCT TGCAACAGGT GGTAATGCCA TGGTTAAGGC GGCTTATTCA TGTGGTAAAC 720 

CAGCTCTTGG GGTAGGTGCC GGAAACGTTC CAGCTTATGT TGAAAAATCA GCAAACATTC 780 

GTCAAGCAGC ACACGATATC GTCATGTCTA AATCATTTGA TAACGGTATG GTCTGTGCAT 840 

CTGAACAAGC AGTTATCATT GATAAAGAAA TTTACGATGA ATTTGTAGCA GAGTTCAAAT 900 

CTTACCACAC TTACTTTGTA AACAAAAAAG AAAAAGCTCT TCTTGAAGAG TTCTGCTTCG 9 60 

GCGTCAAAGC AAACAGCAAA AACTGTGCTG GTGCAAAATT GAACGCTGAC ATCGTTGGTA 1020 
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AACCAGCAAC TTGGATTGCA GAACAAGCAG GATTTACAGT TCCAGAAGGA ACAAACATTC 1080 

TTGCTGCAGA ATGTAAAGAA GTTGGCGAAA ATGAGCCATT GACTCGTGAA AAATTGTCAC 1140 

CAGTTATTGC AGTTTTGAAA TCTGAAAGCC GTGAAGATGG TATTACTAAG GCTCGTCAAA 1200 

TGGTTGAATT TAACGGTCTT GGACACTCAG CAGCTATCCA CACAGCTGAC GAAGAATTGA 1260 

CTAAAGAATT TGGTAAAGCT GTTAAAGCTA TTCGTGTTAT CTGTAACTCA CCTTCTACTT 1320 

TTGGTGGTAT CGGGGACGTT TACAATGCCT TCTTGCCATC ATTGACACTT GGATGTGGTT 1380 

CTTACGGACG CAACTCAGTT GGGGATAACG TTAGTGCCAT TAACCTCTTG AATATCAAAA 1440 

AAGTCGGAAG ACGGAGAAAT AACATGCAAT GGATGAAACT TCCTTCAAAA ACATACTTTG 1500 

AACGTGATTC AATTCAATAC CTTCAAAAAT GTCGTGACGT TGAACGTGTC ATGATCGTTA 1560 

CTGACCATGC CATGGTAGAG CTTGGTTTCC TTGATCGTAT CATCGAACAA CTGGACCTTC 1620 

GTCGCAATAA GGTTGTTTAC CAAATCTTTG CGGATGTAGA ACCGGATCCA GATATCACAA 1680 

CTGTAAACCG TGGTACTGAG ATTATGCGTG CCTTCAAACC AGATACCATC ATCGCACTCG 1740 

GTGGTGGGTC TCCAATGGAT GCTGCCAAAG TAATGTGGCT CTTCTACGAG CAACCAGAAG 1800 

TGGACTTCCG TGACCTTGTC CAAAAATTCA TGGATATCCG TAAACGTGCC TTCAAGTTCC 1860 

CATTGCTTGG TAAGAAGACT AAATTCATCG CGATTCCAAC TACATCTGGT ACAGGATCTG 1920 

AAGTAACACC ATTTGCCGTT ATCTCTGATA AAGCAAACAA CCGTA-^ATAC CCAATCGCTG 13 SO 

ACTACTCATT GACACCAACT GTGGCAATCG TAGATCCTGC TTTGGTATTG ACAGTTCCAG 2040 

GATTTGTTGC TGCTGATACT GGTATGGACG TATTGACTCA CGCGACAGAA GCATACGTAT 2100 

CACAAATGGC TAGTGACTAC ACTGATGGTT TAGCACTTCA AGCCATTAAA TTGGTCTTTG 2160 

AAAATCTCGA AAGCTCAGTT AAGAATGCAG ACTTCCACTC ACGTGAGAAA ATGCATAACG 2220 

CTTCAACAAT CGCTGGTATG GCCTTTGCCA ATGCCTTCCT AGGTATTTCT CACTCAATGG 2280 

CCCATAAGAT TGGTGCGCAA TTCCACACAA TCCACGGTCG TACAAATGCT ATCTTGCTTC 2340 

CATACGTTAT CCGTTACAAC GGTACACGTC CAGCTAAGAC AGCAACATGG CCTAAGTACA 2400 

ACTACTACCG TGCAGATGAA AAATACCAAG ATATCGCACG CATGCTTGGA CTTCCAGCTT 2460 

CTACTCCAGA AGAAGGGGTT GiUVTCTTACG CAAAAGCTGT CTACGAACTC GGTGAACGTA 2520 

TTGGGATCCA AATGAATTTT AGAGACCAAG GAATTGACGA AAAAGAATGG AAAGAACATT 2580 

CTCGTAAATT AGCCTTCCTG GCTTATGAAG ACCAATGTTC ACCAGCTAAC CCACGTCTTC 2640 

CAATGGTAGA CCATATGCAA GAAATCATCG AAGATGCATA CTATGGCTAC AAAGAAAGAC 2700 

CAGGACGCCG TAAATAATTG TTTATCAGTC TAGAAGCAAG ACAAAAACTC AATTTGAGGG 2760 
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AAAGATCCAG 


TAATTTTTCT 


ATGATAAAAG 


GCATCCTATC 


AAGGTTTTTG 


AACACCTGAT 


2820 


AGGATGCCTT 


TTTATGATAT 


TGAGGCCTTT 


TTGCCCTTTT 


TGAAAAACTA 


GAATAGAAAC 


2880 


AAAATATATA 


ATAGATTGAA 


ACTAGAATAG 


TACATATCTG 


CTTCTAAAAC 


ATTGTTAGAA 


2940 


TTCGATTTGA 


CTGTCCTGAT 


CGATTTGTCC 


TGTTCTTATT 


TCATTTTGAT 


ATATAAAAAA 


3000 


TATAGTATAG 


TAGACTGAAT 


CTAAAATAGT 


ACGAAACAAT 


TGCTAAAACA 


TTTATAGAAA 


3060 


TTAATTTTAC 


TTTTCTGATA 


GAGTTGTTCA 


CATCTTATTT 


CAATTCACTA 


TAGTTTAATT 


3120 


TAAGAGTAGT 


ATTTACTAAG 


GCCCAATTAA 


AATCAAAGAG 


CAAACTAGAA 


AACGAGTGCC 


3180 


ATTCAGCTCA 


AAACACTGAT 


TTGAGATTGC 


AGATAAGACT 


AGCCCCCTCA 


TTAACAGATT 


3240 


TACGATAAAA 


CGATGACAAG 


GTGTGTTGCT 


TTTTGATTTC 


TAAAGAGTAT 


AATGATAGAT 


3300 


CTCTATAAAA 


TAAGTGCGAA 


GGAAATGAGC 


TTTTATAGTC 


CTTTCGTTTT 


AAAATACTAT 


3360 


CTCAGATATT 


CTTATATCGA 


CAA6AAGTTT 


TTGAGTCATT 


CCCTCATCAT 


ACATATTAAA 


3420 


TAAATAGTGG 


CTCATTCAAT 


TTTTCACTAG 


AATAATAAGC 


TAGTATAGTA 


AACTGAAATA 


3480 


AGATATAAAC 


AAATAAATTG 


GAGCTTAACA 


TCCATTTCCA 


GCAATTTTTT 


AGAAACTACA 


3540 


GTGGACTATT 


CTAGATTCAA 


CATATTATAA 


AAACTAGAGT 


AAAAGAAAAG 


GATTGGATCT 


3600 


TGTGTAATGC 


AGGATCCAAT 


CCTTTCAATC 


ATTTTGTCCA 


ACTTTTGGAG 


GTTCCTACAA 


3660 


TGTAGTCGTC 


ATTAATAAAG 


ACAGATGGGA 


ATGACAGTGT 


TCCTATTTAT 


TTTGATAGAG 


3720 


ATCGATGAA'i' 


TC'TTTAGATA 


GCAACTGAAT 


AATCTCTGTT 


GAAGCCATTT 


GGTCTTCTGC 


3780 


ATGCATAAAT 


AGCAAGGAGA ATCCTATTTT 


TTCTCCAGTA 


GCTTCTTTTT 


GTATGAGATT 


3840 


AGAGTGAATC 


TTGTGCGCTT 


CTACTAAGGA 


GTCTTCCGCT 


TCTTCAACTT 


TAATTTTCGC 


3900 


TTCTTTTAAA 


TTTCCTGCCT 


TAGCTAGTTG 


GATGGCTTCA 


ATAAAGGATG 


ATTTGGCTGC 


3960 


TCCACTATTG 


GCAATGAGCT 


G7VAAACAGAT 


ATATTCCATT 


TCTTCTGTCA 


TCTTATTTCT 


4020 


CCTATCCATG 


CAAGTGCTTG 


TTCCAGAACT 


TTTGCTCCAT 


TCATCATTCC 


GTAATCCCGC 


4080 


ATATCAATGG 


TATCTACAGG 


GATATTTCCT 


GCAATTTCTT 


TCACAGCAAG 


TAACTCATAA 


4140 


CGAATTTGTG 


GCCCAATTAG 


AATGACATCT 


GCTTCATGGA 


TATTCTTTTT 


AGCTTCTGTC 


4200 


ATTGATTTTG 


CTTGGATAGA 


GATTTCAATC 


CCACGTTCAG 


TCGCACTTTG 


TTGCATTTTT 


4260 


TTAACAAGCA 


TACTTGTCGA 


CATTCCCGCA 


TTACATACTA 


ATAAAATTTG 


TTTCATAATC 


4320 


TTAACCTTCC 


ATTTCTTGTT 


CAACAACTTT 


GTCATTAACT 


TTGATAAATG 


GAATGTATAG 


4380 


AAGAACTCCA 


AGTGCAAAGA 


TGATGAATTG 


AACTAGAACT 


GCTCTCACGT 


CCCCTGCTGT 


4440 


TGCTAACCAT 


GCATTTAAGA 


ATACTGGTGT 


AGTCCAAGGA 


ACTTGTATAA 


ATGCAGGACT 


4500 


CATGAATTCT 


GTAACTGTTG 


CTAAGTAGCT 


GATTTIAAATA 


CCAAGGACTG 


GAACTGTGAT 


4560 
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Ov. 1 AA i. (jAAA 


TGTTATAAAC 


GATTGGGTAA 


CCGAATAATA 


CTGGTTCATT 


4620 


o A 1 A i 1 u AAu 


A rAC C AGGTC 


CAAAAGATAA 


TTTAGCCACG 


TTTTTAGAGA 


CAGCATTGCG 


4680 


ACTCACTAAG 


AATGTTGCTA 


TTAATAAACA 


TAATGTAGAT 


CCACTACCAC 


CCATTAAAGC 


4740 


GAATGTTTGT 


ATTTGTGATA 


GGTTGATGAT 


GTGTGGAATG 


GCTTGTCCAT 


TATTTGCTGC 


4800 


AGTGATGTTT 


TCAGTAATGT 


TAATTAATAG 


TAATGGTTCT 


AGGATGGCAC 


TGTAAATAAC 


4860 


TGCTTGGTGA 


ATACCAAATA 


GCCATAACAT 


ATTTCCTAAA 


GAGTAAATAA 


TAATGACCCC 


4920 


GATTAAGCTT 


GTACCAATAT 


GACGAATTGG 


TTCTTGAATA 


AAGATTGTAA 


TGATTGAGAT 


4980 


TAAGTTCATT 


CCAGTTATAT 


TGAATAATAA 


TGCTGAAACA 


ACCCCAAATA 


AGGAGATGAC 


5040 


GGTCATGACT 


GGAAGTAATA 


CGCTAAATGA 


TCTACTAACA 


GCTGGTGGAA 


TATTTTCACC 


5100 


AAGGTTCATT 


TGTAAAGCTT 


TAACGTTTGA 


TAATTCAATG 


AATAATTCTG 


TTGCAATAAT 


5160 


CGtACGATAA 


CCCCGGCGAA 


CATTGCGCCT 


GTACCTGTGT 


TGTTGAATGA 


AAGAACACCT 


5220 


GAAATGTTTA 


CCGCATCTTT 


TGCTCCGTCA 


GGAACTACAG 


AAACTGTATT 


TGGCATCATC 


5280 


ACAATTAAAG 


AAACTAATGA 


TAGCATTGAT 


GCTGCTAACG 


GGTTTTCGAA 


ATCTCTGTTT 


5340 


TTAGCTAAGA 


AATAACCAAC 


CATTACAGCA 


ATAATCATAC 


CTGAAATACT 


TAAAGTACCG 


5400 


TTTGCAATTG 


TTATTCCCCA 


ATATTGGAAT 


CTTGTTAATG 


TATCCCCTTG 


GAAAATCCAC 


5460 


TTAAATACCG 


TGTTGTTCAA 


AAGAACGATT 


AAACCTGCCA 


AAATATATAA 


TGGCATTACT 


5520 


GTTACGT^TG 


CATCTCTTAG 


GGTTTTTAAA 


TGAATTTGGT 


TCCCTAGTTT 


ACCAGCAAAG 


5580 


GATGGCAAAA 


AAATTTTTTT 


GGGGGGGGGG 


GTTATTAAAC 


CCCCCTTTTT 


AAAAAAAA 


5638 



(2) INFORMATION FOR SEQ ID NO: 218: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4745 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218: 

CCGGAAGCTG TTGCCCTTGG AACTCCAAAT GAAGAAACAG CCTTTGTCTT GAACTATTTT 60 

GGTGTGGAAG CACCACGTGT TATCACTTCT GCCAAAGCAG AGGGGGCAGA GCAAGTTATC 120 

TTGACTGACC ACAATGAATT CCAACAATCT GTATCAGATA TCGCTGAAGT AGAAGTTTAC 180 

GGTGTTGTAG ACCACCACCG TGTGGCTAAC TTTGAAACTG CAAGCCCACT TTACATGCGT 240 

TTGGAGCCAG TTGGATCAGC GTCTTCAATC GTTTACCGTA TGTTCAAAGA ACATGGTGTA 300 
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GCTGTGCCTA AAGAGATTGC AGGTTTGATG CTTTCAGGTT TGATTTCAGA TACCCTTCTT 360 

TTGAAATCAC CAACAACACA CCCAACAGAT AAAATCATTG CTCCTGAATT GGCTGAATTG 420 

GCTGGTGTGA ACTTGGAAGA ATATGGTTTG GCAATGTTGA AAGCTGGTAC CAACTTGGCT 480 

AGCAAATCTG CTGAAGAATT GATTGATATC GATGCTAAGA CTTTTGAACT CAACGGAAAT 540 

AATGTCCGTG TTGCCCAAGT GAACACAGTT GACATCGCTG AAGTTTTGGA ACGCCAAGCA 600 

GAAATTGAAG CTGCAATGCA AGCTGCCAAC GAATCAAACG GCTACTCTGA CTTTGTCTTG 660 

ATGATTACAG ATATCGTCAA CTCAAACTCA GAAATCTTGG CTCTTGGTGC CAATATGGAC 720 

AAGGTCGAAG CGGCTTTCAA CTTCAAACTT GAAAACAATC ATGCCTTCCT TGCTGGTGCC 780 

GTTTCACGTA AGAAACAAGT GGTACCTCAA TTGACTGAAA GCTTTAATGC GTAAGATTTT 840 

GGGTGTCAGC TCAAAATCGG AAAGTCTAGT TTGCCTTATA TCGCAAGGAG TTTCGGCTCC 900 

TTTTTTCTAG GAGTGAAGTA TGTTAGAAAA TGGCGATTTG ATTTTTGTGA GAGATGGGTC 960 

AGACATGGGA CAGGCCATCC AGACTTCCAC AGGTAACTAT AGCCATGTTG CCATTTATTT 1020 

GGATGGGATG ATTTATCATG CTAGTGGACA GGCTGGTGTT GTCTGTCAAG AACCGGCAGA 1080 

CTTCTTTGAG TCCAATCATT TATACGACCT CTATGTTTAC CCAGAAATGG ATATCCAGTC 1140 

GGTGAAGGAA AGAGCTTGCA AACATCTTGG AGCACCCTAC AATGCTTCTT TCTATCCAGA 1200 

TGCAGCTGGT TTTTACTGCT CCCAGTATAT AGCAGAAATC CTACCTATTT TTGAAACTAT 1260 

TCCTATGAAA TTTGGAGwTG GGGAGCAGGA GATTAGTGAT TTTTGGAGGG AGTATTACAT 1320 

AGAACTAGGT CTGCCTGTTC CTCTGAACCA AGCTGGTACC AATCCTAGTC AGTTGGCAGC 1380 

ATCGCCTCTG TTACAATGTA AAGAAAGGAA TCTTCATGAT TCAGATTTTT AATCCATCTC 1440 

GTTTGACGAG ACAGCCATTT TTGGAGAATT GATCCGCTAT CTGGATCAGT ATGAGGATGT 1500 

GATTCTACGG GAAATTAAGG CTCAATTTCC AGATGTTGCA GTTGATAAAC TCATGGAAGA 1560 

GTATATAAAG GCAGGCTTGA TTCTACGTGA AAATAAGCGC TATTACCTCA ATTTTCCTAC 1620 

GCTTGAATCA CTTGATAGTC TTGAACTGGA TCAAGAGATT TTTGTCAGAG AAGCTAGTCC 1680 

GGTCTATCAA GCCTTGTTGG AGCAGAGTTT TGAGACGGAA TTGCGCAATC AAATCAATGC 1740 

AGCTATTTTA GTTGAAAAGA CGGACTTTGC GCGCATTAAA ATGACCCTGT CCAATTATTT 1800 

TTACAAGGTC AAACAGCAGT ATCCTTTGAC AGAAAAACAG CAGGAGCTCT ATGACATTTT 1860 

AGGAGATGTT AATCCTGAGT ATGCCCTCAA GTATATGACG GCTTTTTTGT TGAAATTTCT 1920 

CAAAAAAGAC CAGCTTATGC AGAAATGCCG TGATATCTTT GTGGACAGTT AGGTTGTCTT 1980 

AGGCTATATT GTGCAAAATG AAGATGGAAA GTATGAGTTG GCTATCGATT TTGATAAGGA 2040 

GAGGTTAACT TTCTACTTAG CGTGATTTCT TGTTTCTGAG TACATTGTTT GACTTTCCTT 2100 
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AGTATTCGGT ATAAACTATA TGTAACCGGT AACACATATC GGAATAAACT AAAGGAGACA 2160 

ATCATATGTC ACTTGAAAAC AAATTGGAAC AAGCAACAGG CGCTGTCAAA GAAGGTTTTG 222 0 

GTAAAGTTAC TGGAGACAGC AAGACAGAAC TTGAAGGAGC TGTTGAAAAA ACAGTTGCTA 2280 

AGGCAAAAGA CGTTGTAGAA GACGCAAAAG GTGCTGTAGA AGGTGCCGTT GAAGGTTTGA 2 34 0 

AAAACGTTTT TACTAAAGAA TAGGAAAAAA TCAAGGGTTT CATTTTCCCT TGATTTTTTC 2400 

TATTCTTATA AATAATTTTC TGCGACGGCT GTATCTCCTG GGTAGGATTC TTTCTTGCCC 2460 

TGGATGATTT GGTAACAATC GGCTCCCTTA CCCGCAATAA TAACTGCATC TAATTCGTGA 2520 

TTTGTGATAG CCATTGCCGC CTTGATGGCT TCTTGGCGAT CCGCAATCTT TTCAACAGGA 2580 

TGATTGATGT AGCTACTAAT TTCATCTGCA ATCGCCATTG GGTCTTCATA GTTAGGGTCA 2640 

TCAGCAGTCA GAAAGACTTG AATCTCAGGG TGTTGATTGA GGAGGAGGCC AAAGTCCTTA 2700 

CGACGACTTT CTCCCTTGTT TCCTGTTGAT CCCAGAACCA GAGCAATCTT TCCGGTTTGA 2760 

TGAGTTTCAA CCACATTGAT GAGTTTTTTC AGACTATCCC CATTGTGGGC ATAGTCGATG 2820 

AAGACCTTGG CTCCATTTTT CTGAGTGAGG ACTTCCATAC GACCAGGAAC GCGGGTTGCA 2880 

GCGATGCCTT TTTTGATGTC CTCAAGACTT GCTCCGAGAC GGAGACAAGC AAGTCCAGCA 2940 

GCAACTGCAT TTTCTTGGTT GAAGTTGCCA ATGAGTTGAA TATCATAATC TCCAGCGAGT 3000 

TTACCCGTAG CTGAAAAGCT AAAGGCTTTG CAATTCTCGA TTTGGTTATC AAATTGGCTA 3060 

CCATAGAAAT CATGGTCTTG ATCTTCAACC TGTTCTTTCA AGACTGAGAA GTGGTCCATG 3120 

TCACTGTTAA TGATGACTGC TCGGCTCTTT TCCATCAAGA GACGCTTGTG GTAGAAATAG 3180 

TCTTCAAAGC TAGGGTGTTC AATCGGGCCG ATATGGTCTG GGCTGATATT TAGGAAAACT 3240 

CCCACATCAA AGGTTAGACC ATAGACACGT TTGACCAGAT AGGCTTGACT GGAGACTTCC 3 300 

ATGATGAGGT GGGTACGGTC ATTTTGCACA GCCTGATTCA TCATGTCAAA GAGGTCAATA 3360 

CTCTCAGGGG TTGTCAACGC TGACTTAAAG AAAGTCTCGC CATCAAGAGT TGTGTTCATG 3420 

GTCGACAACA TAGCAGGTCT ATGCCCTTGA GATAAGATGT TATAGGCGAA ATAGGCTGCT 3480 

GTTGTCTTAC CCTTAGTACC AGTAAAGGCA AGGAGTTTGA GTTTTTCCTG TGGATTACCA 3 540 

TAGAACTCCA TGGCAATCAA ACTCATGGCT TTCTTTATAT CGTTCACAAT GATGACAGGG 3600 

ATACCGACTT CGTAGTCCTT TTCAGCTACA TACCAAGCTA ATCCTTGTGT TATAGCAGAA 3660 

AGAAGGTATT CTTTTTTAAA GGCAGCGCCT TTTGCGAAAA AAAGAGTGTC TTCTGTTACT 3720 

TTTCGGCTGT CGTAGCTGAT GCTATCAAAA ATAACTTTGC TGTAGTTGTA GTGGTAATGA 3780 

CCTTGGTCAA TAATTTCGCG AAAAAGGCCA TCTTTCTTTA AAATATCTAA TACGGTTTCA 3840 
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ATCTTAATCA TACTTTCTAT TGTAAACCGA AAGTCGTAAA TTTACAAGTA ACAAGGAAAA 3900 

GTTTATAATG GAAGATAAGG AGTTTTTCCT AGTTATCAAA ATTGAATGAG GAATCTATGT 3960 

CGCACGAAAA CAATCACCAG CAGGCCCAGA TGTTACGGGG GACTGCTTGG CTAACGGCTA 4020 

GTAACTTTAT CAGTCGCCTA CTCGGGGCTG TTTACATTAT CCCTTGGTAC ATCTGGATGG 4080 

GGGCTTATGC AGCTAAGGCA AATGGTCTCT TTACCATGGG TTACAATATC TATGCTTGGT 4140 

TCTTGTTGGT TTCAACAGCG GGGATTCCAG TTGCGGTGGC CAAGCAAGTT GCCAAGTATA 4200 

ATACCATGCG AGAAGAAGAG CATAGCTTTG CCCTGATTCG GAGCTTCTTA GGCTTTATGA 42 60 

CAGGACTAGG CCTGGTTTTT GCTTTAGTCT TGTATGTCTT TGCTCCTTGG CTAGCAGACT 4 320 

TGTCTGGCGT GGGCAAAGAC TTGATCCCAA TCATGCAAAG CTTGGCTTGG GGAGTCTTGA 4 3 80 

TTTTCCCGTC TATGAGTGTT ATCCGAGGAT TTTTCCAAGG GATGAATAAC CTCAAACCCT 4440 

ATGCCATGAG CCAAATTGCT GAGCAGGTCA TTCGTGTTAT CTGGATGCTC CTAGCAACCT 4500 

TTATCATTAT GAAGCTCGGT TCAGGAGATT ATCTAGCAGC CGTTACCCAA TCAACCTTTG 4560 

CTGCCTTTGT CGGTATGGTA GCCAGTTTTG CAGTCTTGAT TTATTTCCTT GCCCAAGAAG 4 620 

GTTCACTCAA AAGAATCTTT GAAACAGGAG ATAAGATTAA CAGTAAGCGT CTCTTGGTTG 4 680 

ATACCATTAA GGAAGCCATT CCTTTTATCC TGACAGGGTC TGCCATCCAG CTCTTCCAGA 4740 

TTTTG 4745 



(2) INFORMATION FOR SEQ ID NO: 219: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doTible 
{D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219: 

CCTGATTGAC CTTATAATAA GGAACAAAAC ACAATGCACT ACCTTTTCAA CAAAAGAGTT 60 

GCTGCTTGAT TAAAACCATC ACACCAGTTA TACCATTTTG CTTCATACCC ATCTTGAGCT 120 

AGGATACGAT CTTCTAAATC AAAAACAGAG TAAATCTTTC TTTCCTCGCA AGCTTGCGCA 180 

TAGAGATGAT ATAGTTCATC ACCACCATCT CTATCCCACT CAGCAGAAAT CGTATCCCGA 240 

CCTGCCAATA AAGCCTGATA AGCCCTGTGA TGCCCATCTG TAATCAGCAA ACAATCTCCA 300 

AAGGCAAGAA TACTGATTGG ATCGACTTGG ATTGTTTCTG CCGACTGGTA AAGCATCTGA 360 

ATATCTTGCA ACTTCTTTTC TGATAAATAT AGTTGAGTCA GATGAAGATC TGCTATATTG 420 

ACTTTCATTT CTTTCTCCTC AAGGGAATTC GATACTCACT TCTGTTTGCC TTTAAATCGC 480 
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CATTGG7VAGC GGAgCTTGTC ATAAAAGGGA AACTCGATAA 
CAGAGACTGG CAAGGACGTC TGATGGGTAA TGAACTCCCA 
ACACTGACTA GGTAGAGGCC AAGGACGATT TGTACGATTT 
ATCCGCTGAC TAAGAATAAC AATCAAAGTC CCTACCATCA 
CCACTTGGGA AGGAAAATCC CTTCTCCTCC ACCAGATGTA 
TGGTAGATAT TTTTAAAGGT CACGATTAAA AGACCTGCCA 
AAGAAACTTT CTATCTTCCA TCGCTTACGA TAAAAGACAA 
GTGATAATCA CTGGGATATC AATCAGACGT GTGAGGGCTC 
TCTGGTAAGT CTCCTCGAAT GGCAGTCTGA ATCGATTGGT 
GGGTAAAATT TGACCATGTA GCCAAGAATA ACGAAAAGTA 
ATTAAAAATG TTTGTTTATC TCTCATAATG TTTTAAGGTT 
CAACCAGAAT GAAACGGAAA AGATAACACC TTCAATCAAG 
CATTAGGTAG TTGGAAAGTC CCAAAATTTT TCCAATATCA 
CAAAGGAACA GCATAAACAT AGTTGAGAAC CAACATGGCC 
AGCTAGAGAG CCTAGTAGGA AACGAAGGGT TGTCCGTTCC 
TACGATGACA AAAACTCCCA AAGCTACGAT ATTCATCGGC 
TCCTTGGCTG TTAAGAAGCA ATTTCAAGAG TGAGCGAAGC 
AGGCAAATCC ATGACCACCA GACCCACAAG GACTGGCAAG 
GAAAGATGCC GCTGGTAAAA GCGGAAAGTC AAAGTACATC 
TAGAATTGCA ATGGTCGAAA GTCGACGTGT GTTTGTCATA 
ATAAAATCAG AAGAAGTTGG AAAGGATTCC TCTATCTATT 
AGTTCCCTCT TACTCTATTA AAGAAAAACA AAGCAAGTGG 
TATCAAAACA GACAAGGCTA TTCTTTCGTC TTCTCCCATC 
GGAATCTCAC CACATCACGT TGCGCTCACG GACTTCTTTA 
(2) INFORMATION FOR SEQ ID NO: 220: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4692 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



ACAGGACTCC CAAGCCCACA 540 

GATAGACTCT, TGATACCAGC 600 

TTCTCCAGAC CTGATCTTTA 660 

GCGTTACAGC TAGAGAATGC 720 
AAATAGCTGG TCGTGGGCGC , 7 80 

AAGCCAGATT TCCCAGCATG 840 

AAGCTGTAAT GACAACCCAA 900 

GAAAAAGAAT AGTCAAATAA 960 

CAAAATTGAC CAACATTTCA 102 0 

AAAGGGCAAA ACTGCCCTTC 1080 

GGTTTCAAGA GAACATACAA 1140 

TTAAAAGGTA ATACCATGGT 1200 

AAGTTAGCAA ACTTAGCGTA 1260 

AAGGTTAAAC CAATAGTTCC 1320 

TTTTTCCAAA TCAAAGCAAA 1380 

iia_a CQ \ATGT AAGTATTC AC 1 4 4 C 

AAGAGCACTC CTAGAGmCsC 1500 

ATACTAAATT CGATCTTGAG 1560 

AGCACAAATG AGATGGCTGA 1620 

ACAGGTTCCT CCAATTTTCT 1680 

CTCACTTTTT ATATCCCAAA 1740 

TTACAATCCG GCTATAAATC 1800 

CAGACTATAC TGTCGGTTGT 1860 

1900 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220: 



GGTTTTCCAG 


CAGGAGCTTC 


TCCTTTATCA 


GAATGACCAT 


CCCATCTGCT 


CACGATAGAT 


60 


GAATAATGAT 


ATTTTTTACC 


ATGATAGTAA 


TTTGAAAAAG 


CCTAACCACC 


TCCTGAACCT 


120 


TCTCCATATG 


TCCATACTCC 


TCCATCTGGA 


TATTATACAG 


CAGCTGATGC 


AGCTCCCAAT 


180 


AATGTAAAAC 


TTGAAATAAG 


AGCTAGAGCA 


AGTAATCTAT 


GTTTTTTCGT 


TTTCATTTTA 


240 


TTTTTTCTTT 


CAAAAAAAGC 


ACACCTTGAG 


CAACAATGCA 


ACAAAATAAA 


TCCTCCTCTC 


300 


TCTTTTATTG 


AAACCGCTTT 


CTTATGTGAT 


AAGAATAACT 


TTTTTATtAT 


TTGTTGTCAA 


360 


GGAAAAAATC 


GAATTTTTTA 


GATATTTTAC 


TATATTACCT 


CTGTGAATAA 


TATTATATAG 


420 


TAGTTTTATT 


TCAAAATAAT 


ATGCAACCAG 


TACTAACCAA 


ATATAAAATA 


GATGCCATTA 


480 


ACGAATTTTA 


TTCAAGTTTT 


TCCCATTCAT 


ACTATACAAG 


TAAAAGAGAT 


GGTGTTAACT 


540 


AAAAAGCAAT 


TCAAACTATT 


GTAAAATTCC 


TAGCAAAAAG 


AGAGCCGAAA 


CTCTCTTTTT 


600 


TATCTTCTTT 


TACTTTTTTT 


GACTGGCATG 


AGTGTGATGT 


CTCTAACACT 


AAAGTAAGCT 


660 


AGGATCAACA 


TGGCTATTGC 


TAGGAATATT 


TCTGTTGGTA 


ATTGAAAAAT 


TTTCAGAAAA 


720 


GATAGAACCA 


ATAAAATCAA 


GAGTGCCACT 


AAAATACATA 


CCATAGCGAC 


GATATTGACA 


780 


GTCCCTTTAA 


TGCTTTCTGG 


TGTCGCAAAT 


ACATAGAGTA 


GGAGCAGTAA 


AATTCCTAGG 


840 


ACTAAATAGA 


CCATCTTTCT 


CTCTTTCTAG 


CTCTTATTCA 


GCTGATTTTT 


TCTTCTTGTT 


900 


AGCxT'TCu'CA 


CGCTCTGCTT 


TGTTAAGGAT 


TTGTTTACGC 


AAACGGATAG 


ACTCAGGCGT 


960 


TACTTCCATG 


TACTCATCGT 


CGTTCAAGAA 


CTCAAGAGAC 


TCTTCAAGTG 


TCAAGATACG 


1020 


AGGCGTCTTG 


ATAACAGCTG 


TTTGGTCCTT 


AGTAGCTGAA 


CGAACGTTGG 


TCATTTGTTT 


1080 


TGCCTTCGTG 


ATGTTAACTG 


TCAAGTCATT 


TTCACGAGAG 


TTTTCACCGA 


TGATCATTCC 


1140 


TTCATAAACC 


TCAGTACCTG 


GGTTGACAAA 


GATCGTACCA 


CGTTCTTCGA 


TAGACATGAT 


1200 


TGAGTAAGTT 


GTAGCCTTAC 


CAGCATCGAT 


AGAAACAAGG 


GCACCACGGT 


GACGTCCACC 


1260 


AATTTCCCCT 


GGAATCAATG 


GCAAGTATTG 


GTCGAAGGTA 


TGGTTCATGA 


TACCGTAACC 


1320 


ACGAGTCATT 


GATAAGAACT 


CAGTTGAGTA 


TCCAATCAAA 


CCACGCGCTG 


GAACAAGGAA 


1380 


GACCAAACGA 


GTTTGACCAT 


TACCAGTTGA 


AATCATATCC AACATTTCAC 


CTTTACGTTC 


1440 


AGAAAGGCTT 


TGGATAACAG 


ACCCTTGGTA 


TTCTTCTGGA 


GTGTCGATTT 


GTACACGTTC 


1500 


AAATGGTTCA 


CATTTAATAC 


CGTCGATTTC 


TTTTACGATA 


ACTTCTGGAC 


GAGATACTTG 


1560 


AAGTTCATAG 


CCCTCACGAC 


GCATTGTTTC 


GATAAGGATT 


GACAAGTGCA 


ATTCTCCACG 


1620 


TCCTGAAACA 


GTCCATTTAT 


CTGGTGAATC 


AGTTGGGTCA 


ACACGAAGGG 


AAACGTCTGT 


1680 


TTGCAATTCT 


GCCTGCAAGC 


GTTCTTCCAC 


CTTACGAGAA 


GTTACCCATT 


TACCTTCTTT 


1740 
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ACCAGCAAAT GGTGAGTTGT TGACCAAGAA AGTCATTTGA AGAGTTGGCT CATCGATGTG 1800 

TAGGATTGGA AGAGCTTCTA CTGCATCTGT CGGAGTGATG GTTTCACCGA CAAAGATGTC 1860 

TTCCATACCT GAAACGGCAA TCAAGTCACC CGCTTTGGCT TCTTGGATTT CACGACGTTC 1920 

CAAACCAAAG AAACCGAAGA GTTTTGTAAC ACGGAAGTTT TTAGTTGTAC CGTCAAGTTT 1980 

AGAAAGGGTA ACTTGGTCCC CAACCTTAAC TGTACCACGG AAGACACGAC CGATACCGAT 2040 

ACGTCCAACG AAGTCATTGT AGTCCAAAAG TGACACTTGG AACTGCAAAG GCTCATCTGA 2100 

GTTATCTACT GGAGCTGGGA TATGGTCGAT AATCGTGTCA AAGATTGGTG CCATAGTCGC 2160 

TTCTTGGTCA GCTGGATCAT CTGACAATGA AGAAGTTCCG TTGATCGCTG AAGCATAAAC 2220 

CACTGGGAAA TCAAGCTGGT CGTCATCTGC ACCAAGCTCG ATGAAAAGTT CCAAGACTTC 2280 

ATCCACTACT TCTGCTGGAC GAGCTGATGG CTTATCGATT TTGTTAACAA CCACGATTGG 2340 

GACAAGGTCT TGTTCCAAGG CTTTTTTCAA TACGAAACGA GTTTGTGGCA TGGTTCCTTC 2400 

ATAGGCATCT ACGACCAAGA CAACACCGTC AACCATTTTC ATGATACGCT CAACTTCTCC 2460 

ACCAAAGTCC GCGTGTCCTG GTGTGTCCAT AATGTTGATA CGAGTTCCGT TGTAAGCAAC 2520 

GGCAGTATTT TTAGCAAGGA TGGTAATTCC ACGCTCTTTT TCGATATCGT TTGAGTCCAT 2580 

AGCACGCTCT GCCAATTCAG TCCGTGCATC AAGCGTTTCT GATTGTTTCA ATAATTCGTC 2640 

AACCAGGGTT GTTTTACCGT GGTCAACGTG GGCGATAATC GCAATGTTAC GGATATCTTC 27 CO 

TCTTAATTTT GTCATGATTT CCTCTATAAT ATTCAAAATT TATTTTCTAA CTGAACGATT 27 60 

ATACCATAAT TTCAAATAAA TAACATAACT CAAGCAAGTG TAAATGTTTT CACTCTGCTT 2820 

TTCTTTTCAC GTCAAGCCTT TTCAAAGCGA GCGACTTATG ATAAGATAGG CACAGTATGC 2880 

GTTTAGATAA TTTATTAGCT CAAGAAAAAA TCAGCCGAAA GGCCATGAAG CAAGCACTCC 2940 

TCAGAGGGGA AATTCTAGTC GATGGTTGCC CAGCCCGCTC CCTAGCTCAA AATATCGATA 3000 

CAGGACTACA AGAACTCCTT TTTCAGGATC GAATCATTCA AGGCTATGAA CACACCTATC 3060 

TTATGCTTCA TAAACCTGCT GGTGCCGTTA CAGCCAACAA AGACAAGGAA CTTCCGACCG 3120 

TCATGGACCT GCTTCCATCT AACATCCAGT CTGACAAGCT CTATGCCGTT GGCCGACTGG 3180 

ACCGAGATAC AACGGGACTC CTCCTCTTGA CCGATAACGG TCCCTTGGGC TTTCAGCTCC 3240 

TCCATCCCCA ATATCATGTC GATAAGACTT ACCAAGTTGA GGTTAATGGA CTTCTAACAC 3300 

CTGACCATAT CCAAACCTTT CAAAAAGGAA TTGTCTTTTT AGATGACACT GTCTGTAAAC 3 3 60 

CCGCAAAACT AGAGATTCTA TCTGCAAGTC sCTCCCTCAG TCAAGCCTCT ATCACCATTT 3420 

CAGAAGGAAA ATTTCATCAA ATCAAGAAAA TGTTCCTCTC GGTTGGTGTT AAGGTGACTA 3480 
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GCCTCAAAAG AATCCAATTT GGGGACTTCA CATTGAACCC AGATTTAGCA GAAGGTAACT 3 540 

ACCGCCCTTT GAACCAAAAA GAGTTACAAA TCATTAAAAA CTATTTAGAG ATGAGTCGAT 3600 

AAAACAAAAA AAGCTTTAAA ACTAAAGCTT TTTTCTTTTA TTTACCGAAA AATTAAGGCG 3660 

ATTGCTACAA TCCAGTTAAC TACAGAAATC ACAATTCCTA AGATATTAAG AATCTTTTCT 3720 

ATTTTATAGT CTAATTGTGA CTCTTTTTGG TATGAAATAG CCAAGACCAA TCCTATGATA 3780 

CCCAAAATCA GGCCTACAAT TGGAAATAAC AAACCAAGAA TAATCGACAA GATACCCACA 3 840 

AAAAGTGGAT TTTTCTTCTT TTCTTTTATG TTCTAAGAAC TCCTTAAATT TTATACAAAT 3900 

TAATTATACT ATAAAACAAT AGCTTCATCC TATCATTCGA CTAATTTGGA AATAAGGTTA 3960 

GCTAGTCTTC ACTTTCCCTT TCCAAGAATC CAAGCCATAA GAAAGGATAT AAATCTCAGA 4020 

AAAACCTTGT TTTTTCAAGT AAAGAGCTGC ATTTGTAACT CGTTGCGCAC GTTGGTTTTC 4080 

GTAGAGAAGG ACAGGTTTAT CTTTACGAAG GGCTGCAAGA CTAGTTTTCA ACTGACTTGA 4140 

AGGAATATTG CGTGCACCAA GGATATGTTT TCTGTGGAAT TCTGCTGGGT CGCGCAAATC 4200 

AATCAATTGA CCCGTACGAA TCAAGGCTTC AAACTCCTCA TTGTCCACAA TTTTAGCCGC 42 60 

ACGGCGAATA CGAAGATAGT TAAAGCCCAT CCACGCCAAC ATTGCTAGTA TAAGTGCCCA 4320 

CAAAATCCAA GTAACCATTA GTTCTTTTCT CCATTTTTCT CAATATAATC CAATTCTACC 4380 

TTGTGCTCTC TGCGAAGAAC TGCTTCTGCC TCTAGATAGT CTAATTTATC CATCAACCCT 4 440 

GCATCGTAAA TCCGAGATAG TTCCAACTTC ATCAGTTCAA TATCATATAA GCGTTTTCCC 4 500 

ATGTAAACAA TAATACCAAA TCGTTTGAGG AATTGCTGCA CATCATAGAA TGTTTTCATA 4560 

AGACTCATTC TAGCAAAATT TTGTGTTTTT TTCAAGAAGA GACTCACACA ATGCTCCTTA 4620 

TTTTCCTATC TTCTTTAGCG ATTCTAAGGC AAGTATGGTA CAATAAAAAC ATGGGGATTC 4 680 

AACAATTACA TT 4692 
(2) INFORMATION FOR SEQ ID NO : 221: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 706 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 

GCTAAAAAGC TGATAATCTT CGACTCCTGT ATATGATGTG TCTTTTCATG TAAGACACGC 60 

GCCGCCAGAA TCATGGCAAG AGCTGCAAGA CTGGCAAGTA AGAAGCCGAT AAGATAGGCA 120 

AAAAGATAAG TGAATTTGAC AAAGAAAGTC AAAAGAACTA GGAAACCAAA GCCTCCTCCA 180 
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AAAACTACCA AAGTCTTTCG TAAATCCCAG ATTTTATCCA ACTGCTTGAC GAGGGAAGTC 240 

GTCTGACGAA CGCCTACAAT AGTTGCTAAC ATACTTCCTA AAAAGAATGG ATAGACATGA 300 

GTTAAACTGG AGAAATAAAC AGAGGAATAA GAGGTCACTA GAAAACTACC AATAAACATG 360 

GAGAAGAAAC TGATCAAGAA GGCAACAGCA GATAAGAGAA AGACCATCCC CTTCAACTGA 420 

CCATTTGATT TAGCTTGTTT GGATAAGAAC CAAACTGCCA ATCCCCAAAG AATATAGTAG 480 

TGAACCTCAA CTGCCAAACT CCAATTATGA ACAAACAAAT GAGGAATGAA CTGAGATTCA 540 

TAACTCCCAC CTGTTAGGAG TTCATAGAAG TTGGTCATAA AGCCTAAGAC GCCCGCAATC 600 

TGGCCACCAA TTCCAGCAAC ATAGTCTTGG CGAACCAAGA AAGTAAAAGG CATGGTCACC 660 

AAGACCATCA AAACCACAGG TGGCACAATC TCGATAAAAG CGTCTT 706 



(2) INFORMATION FOR SEQ ID NO: 222: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3236 base pairs 

(B) TYPE: nucleic acid 
tC) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222: 



CAGCTGATGG 


GCAATATCAG 


TCATAGAAAT 


TTTTTCAATT 


AACTTTTGAG 


CAATTTTTTG 


£0 


GTTGATGATA 


CGAGGGATTT 


GGTGATTTTT 


CTTTACCAGG 


GGAGTCTCAG 


CAACCATCAT 


120 


TTTTGAACAG 


TGATAGCACT 


TGAAACGGCG 


TTTTCTAAGG 


AGAATTCTAG 


AAGGCATACC 


180 


AGTTGTTTCG 


AGGTAAGGGA 


TCTTAGACGG 


TTTTTGAAAG 


TCATATTTCT 


TCATTAGACT 


240 


TCCACAATCA 


GGGCAAGATG 


GAGCCTCATA 


ATCCAGCTTA 


GCGATAATTT 


CTTTGTGGGT 


300 


ATCCATATTG 


ATGATATCTA 


GAATCTTGAT 


GTTTGGGTCT 


TTAATATCGA 


GCAGTTTTGT 


360 


GATAAAATGT 


AATTGTTCCA 


TATGATTCTT 


TCTAATGAGT 


TGTTTTGTCG 


CTTTTCATTA 


420 


TAGGTCATAT 


GGGACTTTTT 


TTCTACACAA 


AAATAAGCTC 


CATAATATCC 


ATAGGGGATT 


480 


TACCCACTAC 


AAATATTATA 


GAGCCCGAAA 


ATATGGGAAA 


ACTGATCCTT 


GTTTCTGCTT 


540 


TTGTCTATAG 


AAGAATAATA 


AAGATTATCT 


TCTTCAAATT 


CTCCGATATT 


CTCTAAAGTT 


600 


TTGTGCAAGT 


TGCACAGAAC 


TTGTTTATTT 


TTTTGGTCAT 


CTTGCCATAG 


AAATATAAAG 


660 


CGTTTTCATA 


TATAATATAA 


TTATCAAAAG 


ACAAAAGGAG 


TTCACCTCAT 


GGTAGAATTG 


720 


AATCTTAAAA 


ATATTTACAA 


AAAATATCCA 


AACAGCGAAC 


ACTATTCAGT 


TGAAGATTTC 


780 


AACTTGAACA 


TCAAAGATAA 


AGAATTTATC 


GTTTTCGTAG 


GACCTTCAGG 


ATGTGGTAAA 


840 
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TCAACTACAC TCCGTATGAT TGCTGGTCTT GAAGACATTA CAGAAGGTAC TGCATCTATC 900 

GATGGCGTAG TTGTCAACGA CGTAGCTCCA AAAGACCGTG ATATCGCCAT GGTATTCCAA 960 

AACTACGCTC TTTACCCACA CATGACTGTT TATGACAACA TGGCTTTCGG TTTGAAATTG 1020 

CGTAAATACA GCAAAGAAGA CATTAACAAA CGTGTTCAAG AAGCAGCTGA AATACTTGGA 1080 

TTGAAAGAAT TCTTGGAACG TAAACCAGCT GACCTTTCAG GTGGTCAACG TCAACGTGTT 1140 

GCCATGGGGC GTGCGATTGT CCGTGATGCG AAAGTATTCT TGATGGACGA ACCTTTGTCA 1200 

AACTTGGATG CCAAACTTCG TGTATCAATG CGTGCTGAAA TCGCTAAAAT TCACCGTCGT 12 60 

ATCGGAGCTA CAACTATCTA TGTAACTCAC GACCAAACAG AAGCGATGAC ACTTGCAGAC 1320 

CGTATCGTTA TTATGTCAGC TACT7VAGAAC CCTGCTGGTA CAGGTACTAT CGGACGTGTA 1380 

GAACAAATCG GTACTCCTCA AGAAGTTTAC AAAAATCCAG TTAACAAATT CGTTGCAGGA 1440 

TTCATCGGAA GCCCAGCTAT GAACTTCATC ACCGTGAAAT TGGTTGGTAG CGAAATTGTT 1500 

TCTGACGGTT TCCGTTTGAA AGTGCCAGAA GGAGCATTGA AAGTTCTTCG TGAAAAAGGC 1560 

TACGAAGGAA AAGAATTGAT CTTTGGTATC CGTCCAGAAG ACGTGAATGC AGAACCTGCT 162 0 

TTCCTTGAAA CATTCCCAGA CTGTGTTGTA AAAGCGACTA TCTCTGTATC AGAACTGCTT 1680 

GGTTCAGAAT CTCACCTTTA CTGTCAAGTT GGTAAAGACG AGTTTGTTGC AAAAGTTGAT 1740 

GCTCGTGACT ACTTGCAAAC AGGTGCAACA GTTGAGCTTG GATTTGACTT GAACAAAGCA 1800 

eAei-l'U'lTCG ATGTAGAAAC TGAAAAAACA ATCTACTAAA ATAAATAAAA TTCAAAGCAC 1860 

TACAAGAAAA GATATCTCTT TATCAATTGT AGTGGAGAGA TATCAGTTAA TCTAGGGAGA 192 0 

GAAACAAAAT GCTTCTCTCC TTTTTGCTAG AGAAGTCATA TTATGCATCT ATATTGTGAT 1980 

GCTCTTTAAT ACTCTTCGAA AATCTCTTCA AACCACGTCA ACGTCGCCTT GCCGTACGTA 2040 

TGATTACTGA TTTCGTCAGT TTTATCTGCA ACCTCAAAGA TGTACTTTGA GCAGCTTACG 2100 

GCTAGTTTCC TAGTTTGCTC TTTGATTTCC ATTGAGTATT ATTTGTGGGT ACCATCTACA 2160 

AGTGAAGCTA TATGCGTAAA CTACGTGAGC AATTGAATTC GAACTAGAGA GGTAATAATA 2220 

AATTTATGCT ATAGTTATGG TGACTTGTAT GCTTTTGATT CTAGTTTATC AAATAATAGA 2280 

TTAGAATTGT CAGATAATAT CATTTTGTGT TATAATGAAG AAAAAACAGA GGTGTTCAAA 2340 

TGTCAGAAGC AGGTCATAAG TTTTTAGCAA AATTGGGGAA AAAACGCTTA CGTCCAGGTG 2400 

GAAAGCGTGC CACAGATTGG TTAATTGCAG AAGGAGGATT TTCAAAAGAA AAGAGAATAC 2460 

TAGAGGTTGC GTGTAATAGG GGAACTACAG CAATTGAGTT GGCACAGCGT TTTGGTTGCA 2520 

AGATAACTGC TGTTGATATG GATGCTCAAG CTTTAGAAGT GGCTAAAAAA TCTGCTGGAA 2580 

CGGCAGGTGT TGCTCATTTA ATCAGTTTTG AAAGAGCAAA TGCAATGAAA CTTCCTTATC 2640 
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AAGATGCTAG 


TTTTGATATT 


GTTATAAATG 


AAGCTATGCT 


GACTATGCAA 


GCCGATCAAG 


2700 


CTAAGAAAAA 


ATGTGTAATG 


GAATATCTAA 


GGGTATTAAA 


ACCTGGAGGT 


CTTCTCTTGA 


2760 


CACATGATGT 


GCTTCTTAAG 


GAAGCTAAAG 


AGTCTATCAG 


ACAGGAATTA 


TCACAAGCAA 


2820 


TTCATGTAAA 


TGTAGGTCCT 


TTAACTCAAG 


ATGGTTGGGA 


ACAGGTGATG 


ATAGAATCAG 


2880 


GTTATTGTGA 


TGTGAAAGCA 


TTGACTGGTG 


AAATGACATT 


AATGAAATTA 


TCGGGTATGA 


2940 


TTTATGACGA 


AGGTTTGCTA 


GGAACTTTGA 


AAATTTGTGT 


AAATGCTTGT 


AAAAAGGAGA 


3000 


ATAGAAAGCA 


GTTTTTAACT 


ATGTATAAAA 


TGTTTGCTAA 


GAATAAACAG 


AAATTGGGCT 


3060 


TTATTGCGAT 


GGCTAGTTAT 


AAATCGTCAA 


AACGTTAGAT 


AATTATTGAA 


GTTAACTTTT 


3120 


CCTTTTTTCT 


TTCTTAAAAA 


ATATGCTATA 


ATAGAGAGTA 


AAAAACTTTG 


AAAGAAAGAA 


3180 


AAAGATGAAT 


TTAAAAGATT 


ACATTGCAAC 


AATTGAAAAT 


TATCCAAAGG 


GTACCG 


3236 



(2) INFORMATION FOR SEQ ID NO: 223: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2885 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEO ID NO: 223: 

CCTGACTTTT CAAATTGGTT AGTTTGCCAC ACTTGGTTTA TATGGTCGTG GAAAGCATGG 60 

CTATTACTTC TCAAAGGGCG ATTTCTCACC CCATGAAAAG TGTCTATTTT TGTTTAGGTT 120 

TGTAAGTTAA TTCATTGTCA CATATTACTC TTTAACTGAT TGAGTGAGTA CCGCTTATAT ISO 

TTGATGCCTIA ACGCCTTAAA AGTGTTACCC TCAAGTCCTT TTAGAATACG GCTATAATTC 240 

CGCTCATTGT AAACTATCTT AAGCTCATCA CTATCTAGGT TGGTATTAAA AATGGTATTT 300 

TCACGATTGT TTAGCACGTC AAAGAGTAAA TCCTGCTCCC AGTCACTCTT AGGCTTAATA 360 

ACAGCATTTT TTGCTCCTAA ATCATCAATA ATTAAGTAAT CAACAGACTT CATGAGTTCA 420 

GTAGCTTCAA ACTCTGTAAG TGTTGCACCT TTACCATAAT TCCACCCCTC TTTAATTTGT 480 

TTGATCATTT CGGTTAGGCT TACAAAAAGC ACACTCTTAG GTTCTCCTTT TGTCTTATAC 540 

CCCTCATTTA TACCTTTGGC AATAGCAACT GATAAAAGTG TTTTTCCAAT CCCTGTACCT 600 

CCTGTGATAA GCGTATTTCC CCTCATGCCA TCAAGATATT TTTGTACCTG ACCTTTTGCA 660 

AATTCTAAAA ATCGCTTTTC TTCTGATGTT ACAGCATTAA AATCATCAAA AGTTTTAGTT 720 

TTAAACTCAT CTGCTACATA GCTCTTATTG CTCATCAACA CATTATAAGT TTGCATATAT 780 
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AGTTTAGCAT TCAAATTATC AGCAATCGCA TCTTCTTCAT CTTGCTTTTT CTGTTCTTCT 840 

TGGCATTGTT CACAATAGGG TGGGATACAG CGAACTTCTT TTATTGCCTC TCCGTTCTCA 900 

TTCCACCCCA CTACTACATG TCTTTCTCCT TTGATTTGTG TTAGCTGTAT TTCATGCTTA 960 

GGACACAATT CGTCTAGTTT AAATGTCTCA ATATTTCCTA AACTAGATTG TAATGATTTC 1020 

ATTTTCTGAC CTCCTAAAAT GGTTTTTCTT GTGTTGGTAT CCAATCTTCA TAGCTGGTAG 1080 

GCTCTAGTTG ATTGGTTTGC TGTTTTTTAG CCTCACGCGC TGCCCTGCTA TTTCTAACAA 1140 

GTTCCACCGT CAATAAATTG TCCTGTTTCC AACGGTTAAG GATTACCTTG ATGTATGCAA 1200 

AGTTTGCTTT ACCCTGACTG ACAGCCTCTT TTAACGCCTC ATGGATAAGC TCTGGGCTAA 1260 

AATCTTCTAG CATATACTGC AATTCTTGAA TCTGTAACGG TGACAATGCT TTACCTGTCT 1320 

CAGCTCGCTT CATATTCAAC AAGTCGTCTA TTTCCACACT GGTTACTTTT TTATTTACAA 1380 

AATCAGAAAT CAGTTGAAAA ATGTTTGGAC TTTGTAGCTG GATTTCAGCC ATTACCTCAT 1440 

CAAATTCTGC TTGTGTCATG TTGTCTAAAT CTAGTGTCAT TGCATTGCCT CCTCAAACTT 1500 

CTCTATAAGA CAACTTTTAT TTGCTTTCTG AGTTCCATTT TTAGAGTTAA AAAGAATATC 1560 

TTTTAAGGTT ACAGTAGCCT CTAAATACTC CTTTTCAGCA TGCTCTATAT ACGCCTGTTG 1620 

CTCTGCTTCG TTCTCAAAAA AGTGCTTAGC TTGGCGTTTA AAGAATGCTT TTCGCATAGC 1680 

GTCCATTTCA AAAATACCAG GGGCGAAAAA CATTCCCGTA GTGCTTTTAG AGACCGCTTC 1740 

GATTTTATGG CTTTCATTCA ATTCAGGAAG TTCAATCCAA AGTAAACGGG ACAACTCATC 1800 

TTTGATGGAT TTTGTCTGAC TTTCCAATAA AGAAAGGATT CTTAGGCCAT TTTCTTCGCT 1860 

AATTTCTCGC ATTTCTGCGC TAATTCTGTC TATACGTCTA GTTAAATTCT CATATGTTGT 1920 

TTGTGTCATG TTTTTACCTC TGTTTCTTTG TTGGTGTGAT TTTTTAGCTT ATTTTTTTAC 1980 

TTCTAAACAT CATTGTCTTA ATTTCCTGAT AACTCATTTT CAATTCAATC ATAGCTATTG 2040 

CCATATCCTC AAATGCCTGG TACTGCTCCA ACTCCTCACT AGTCAAGCTA TCGATACCGT 2100 

TATAGCCCCC ACGCTCTTCT CTTAACTGCT TAGCGTTCAT GTCTGTTACT GCCTTTACTA 2160 

GCAAGTTGTT CATGGTGCTA TGCGCGTGCT TTGGTGCATT AGGCCATGTT TCTATACTGT 2220 

CATGCAAGGT TTTTCTTTTC GGTTTTTCTA GCGCCCTCTG CAGACGAATT TCAGAAAGTT 2280 

CCTCACGCAT TTCAAAGAAT GCTTTGACTA GGTTTAGTTT GAATTGCCGT ACTGTTTCGG 2340 

TATTCTTTAA ATAAGTGATC AGAAAAGTAG CCTGTTGCTC GTTCAGAATA TAGGATTTTT 2400 

TAGGTTGTCC TCTAGTATCT AATTTATGGA TTTTAAATCC AAGTATTCCC AACTCTTCAA 2460 

AGTCAGCCTT ATTTTCTCTT ATTAAGCGCG TGATAGTGTG GTGTTGTACT TCAGCACATT 2520 

CAGCGATGAT CTCGCTTGTG GTGTACGGCT CTTTCTTACC GTCCATGTAA ACTAGTTCCA 2580 
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TTACGGTTCT ACCTCCTGTA TAAATCTGGT TAGCTTACTT TTTAATTGCC TCCTCTAGCC 2640 

TCTTTTTTAG CCTCTAAAAC GGCTTTGGCT AGTGGTTAAT ATTATTTACC ACTTGTCTCT 2700 

ATAAACGTGT TAGAGGCCTT TATAACGACT TGTATCGCTG TATCGATATC CTCCGTGGAA 2760 

TAGTAGATTT ATTTTCTAAT ATCATTCAAG ACTTGTTTAA CCCATTTCTT GAAAGAAATA 2820 

AAATTACATC TTCTTTATCC TTGGCATCTG CTTTGTCTGA GACAAATTAG AATGTCAATA 2880 

CTTGG 2885 



(2) INFORMATION FOR SEQ ID NO: 224; 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 3144 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224: 



TATCAATCCT 


TTCCCATTAT 


AGGAGCAACA 


GAGTGGGAGT 


AGTCATCTAA 


GGACTAATTT 


60 


ATGTATTTTT 


ACGAGTCAGT 


ATCTTGGGAT 


ACTGGTTTTT 


ACTTTTCTAG 


ACTTTTTGAC 


120 


TACTTGTTAA 


AACTGGGATA 


ATTTTCGACT 


GTTTAACAGT 


TATTATGCAA 


AGTCTAAAAG 


180 


ATTAGAATTG 


TCAAAACAAT 


CCGTCTAGGC 


TTGATTTTAT 


CCTTTATTTH 


PTiTa a R T» mo 


24 0 


AGAAGGAAAA 


ATGTCAAACT 


TTTATATTGC 


AAATAGGAGA 


AATCATGACA 


AAAACATTAA 


300 


AACGTCCTGA 


GGTTTTATCA 


CCTGCAGGGA 


CTTTAGAGAA 


GCTAAAGGTA 


GCTGTTCAGT 


360 


ATGGAGCAGA 


TGCTGTCTTT 


ATCGGTGGTC 


AGGCCTATGG 


TCTTCGTAGC 


CGTGCGGGAA 


420 


ACTTTACTTT 


CGAACAGATG 


GAAGAAGGCG 


TGCAGTTTGC 


GGCCAAGTAT 


GGTGCCAAGG 


480 


TCTATGTAGC 


GGCTAATATG 


GTTATGCACG 


AAGGAAATGA 


AGCTGGTGCT 


GGTGAGTGGT 


540 


TCCGTAAACT 


GCGTGATATC 


GGGATTGCAG 


CAGTTATCGT 


ATCTGACCCA 


GCCTTGATTA 


600 


TGATTGCAGT 


GACTGAAGCA 


CCAGGCCTTG 


AAATCCACCT 


TTCTACCCAA 


GCCAGTGCCA 


660 


CTAACTATGA 


AACCCTTGAG 


TTCTGGAAAG 


AGCTAGGCTT 


GACTCGTGTC 


GTTTTAGCGC 


720 


GTGAGGTTTC 


AATGGAAGAA 


TTAGCTGAGA 


TCCGCAAACG 


TACAGATGTT 


GAAATTGAAG 


780 


CCTTTGTCCA 


TGGAGCTATG 


TGTATTTCAT 


ACTCTGGACG 


TTGTACTCTT 


TCAAACCACA 


840 


TGAGTATGCG 


TGATGCCAAC 


CGTGGTGGAT 


GTTCTCAGTC 


ATGCCGTTGG 


AAATACGACC 


900 


TTTACGATAT 


GCCATTTGGG 


AAAGAACGTA 


AGAGTTTGCA 


GGGTGAGATT 


CCAGAAGAAT 


960 


TTTCAATGTC 


AGCCGTTGAy 


ATGTCTATGA 


TTGACCACAT 


TCcAGATATG 


ATTGAAAATG 


1020 
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GTGTGGACAG TCTAAAAATC GAAGGACGTA TGrAGTCTAT TCACTAyGTA TCAACAGTAA 1080 

CCAACTGCTA CAAGGCGGCT GTGGATGCCT ATCTTGAAAG TCCTGAAAAG TTTGAAGCTA 1140 

TCAAACAAGA CTTGGTGGAC GAGATGTGGA AGGTTGCCCA ACGTGAACTG GCTACAGGAT 1200 

TTTACTATGG TACACCATCT GAAAATGAGC AGTTGTTTGG TGCTCGTCGT AAAATCCCTG 1260 

AGTACAAGTT TGTCGCTGAA GTGGTTTCTT ATGATGATGC GGCACAAACA GCAACTATTC 1320 

GTCAACGAAA CGTCATTAAC GAAGGGGACC AAGTTGAGTT TTATGGTCCA GGTTTCCGTC 1380 

ATTTTGAAAC CTATATTGAA GATTTGCATG ATGCTAAAGG CAATAAAATC GACCGCGCTC 1440 

CAAATCCAAT GGAACTATTG ACTATTAAAG TCCCACAACC TGTTCAATCA GGAGACATGG 1500 

TTCGAGCTCT TAAAGAGGGG CTTATCAATC TTTATAAGGA AGATGGAACC AGCGTCACAG 1560 

TTCGTGCTTA ATGTAGTTGT TTAGTTTTAA AAAACTATGC AAAGCTCCAT ATACAACACT 1620 

TAAACGAGAT TAAAGAATGG CGAAATCCCT TGATGCGCAA GAGATTAGCT GTCTTTTTTA 1680 

TTTTTTAAGT GATAAAGTCG GAGTTTAGGC ATCAAAGCCT ATCAAATTAA ACAAAGAAGC 1740 

GATGTCTTAG ATATTTTGAA AAAAATTAAT AAGCAGAAAA CTCTCTATTA TTTTGTTGTA 1800 

GAGAGTTTTT TGTTAATAAA ATTTCACAAA ATGACATTTA TATATTGCAT TAAGTTAGAT 1860 

ATATGATATA ATATTGTTAA AAAGAGGCGC AACTTTTTAA AATTAATGAG AATCAAAGAG 1920 

AAAACCAATA ATATTAATGG AGGAATAAAA AATGTAAGTA AGCATTATGG TCATTCAATC 1980 

ATTCTCAAAG ATATAAATTT TGCACTTAAC AAGGGTGAAA TTGTTGGTCT AGCAGGGAGA 2040 

AATGGAGTTG GTAAGAGTAC GTTGATGAAA ATTCTTGTTC AGAATAATCA ACCGACTTCA 2100 

GGTAATATTA TAAGCAGTGA TAATGTTGGG TATTTAATCG AAGAACCAAA ATTATTTTTA 2160 

TCTAAAACAG GTTTAGAGAA TTTAAAATAT TTGTCAAATT TATATGGTGT TGACTACAAT 2220 

CAAGAAAGAT TTAGATGTTT GATCCAAGAG TTAGATTTGA CTCAGTCTAT TAATAAAAAA 2280 

GTAAAGACCT ATTCTTTGGG TACAAAACAA AAATTAGCTT TGCTTCTAAC TCTCGTTACG 2340 

GAACCTGATA TATTGATTTT AGATGAACCG ACTAATGGTT TAGATATTGA ATCATCACAA 2400 

ATAGTTTTAG CGGTTCTAAA AAAATTAGCT TTACATGAAA ATGTGGGAAT TTTAATATCG 2460 

AGTCATAAAT TAGAAGACAT TGAAGAAATT TGTGAGAGAG TTCTTTTCTT GGAGAACGGG 2520 

CTTTTGACAT TTCAAAAAGT AGGATIAAGAT AGTCATAATT TCTTGTTTGA GATAGCTTTT 2580 

TCATCAGCTA CAGATAGAGA CATTTTCATT ACCAAACAAG AATTTTGGGA TATTGTTTAG 2640 

GAAGAGGGAT TGAGAATTAC TATGTCTGGG AATATTCAAA ATAGTGAGCT TTTTAAATTT 2700 

TTTAACGAAA ACTCTATTAA AGTAGTTGAT TTTGAAACTA AAAAAGAGAC GCTTAAAGAT 2760 

ATTTACCTAA ATCGTTCAAA ATAAAGGAAG GTTATAATCA TGAAATTAAA TAAACAGAAG 2820 
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AATCGGATGA TTTACGTCTT GTCTAATTTT CTATATGCTA TCTCAGTTTC CATTATTTAT 2 880 

GCTTTGAATG GCATTGTGTT ACTAGTCATA GTAAGTAAAT TGGGTATTCC AGGTGATTTA 2940 

GGATTAAATT TTATAGTAGC TATTGTAGTC AATACAATTT TGTTAGTCCT GTTTTATTTT 3000 

CTATTATCTT ACATTTTCTA TTTATACAAA TTGAAAAGTG GCTTGGTATw TGGTATTTTA 3060 

GTAGCTTTAC TACTCTTTAT CTCTAATATA TTAAATACGA TGATGATGAA TACTAGTAAT 3120 

GATTTGTTTA TCAAAGCAAT TGAA 314 4 



(2) INFORMATION FOR SEQ ID NO: 225: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3766 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225: 

TACGGTATTA TTTTTAAGGA GAT^GAATCA TGAAAATCAA AAAATGGCTT GGTCTAGCAG 60 

CCCTTGCTAC AGTCGCAGGT TTGGCTCTTG CAGCTTGCGG AAACTCAGAA AAGAAAGCAG 120 

ACAATGCAAC AACTATCAAA ATCGCAACTG TTAACCGTAG CGGTTCTGAA GAAAAACGTT 180 

GGGACAAAAT CCAAGAATTG G'T'TAAAAAAG ACGCAATTAC CTTGG.^J'.TTT ACAGAGTTCA 240 

CAGACTACTC ACAACCAAAC AAAGCAACTG CTGATGGCGA AGTAGATTTG AACGCTTTCC 300 

AACACTATAA CTTCTTGAAC AACTGGAACA AAGAAAACGG AAAAGACCTT GTAGCGATTG 3 60 

CAGATACTTA CATCTCTCCA ATCCGCCTTT ACTCAGGTTT GAATGGAAGT GCCAACAAGT 420 

ACACTAAAGT AGAAGACATC CCAGCAAACG GAGAAATCGC TGTACCGAAT GACGCTACAA 480 

ACGAAAGCCG TGCGCTTTAT TTGCTTCAAT CAGCTGGCTT GATTAAATTG GATGTTTCTG 540 

GAACTGCTCT TGCAACAGTT GCCAACATCA AAGAAAATCC AAAGAACTTG AAAATCACTG 600 

AATTGGACGC TAGCCAAACA GCTCGTTCAT TGTCATCAGT TGACGCTGCC GTTGTAAACA 660 

ATACCTTCGT TACAGAAGCA AAATTGGACT ACAAGAAATC ACTTTTCAAA GAACAAGCTG 720 

ATGAAAACTC AAAACAATGG TACAACATCA TTGTTGCAAA AAAAGATTGG GAAACATCAC 780 

CTAAGGCTGA TGCTATCAAG AAAGTAATCG CAGCTTACCA CACAGATGAC GTGAAAAAAG 840 

TTATCGAAGA ATCATCAGAT GGTTTGGATC AACCAGTTTG GTAATAAGAA ACAGGGAGGT 900 

GGGAGAGAAA ATTCCACCTC TTGCTTTTGT ATAGAGTATA GATTGTAAAG AAGACTATTC 960 

GTTCATAGAA AGGTAGAGAG AATATGGTTT TTCCTAGCGA ACAAGAACAG ATTGAAAAAT 1020 
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TTGAAAAGGA TCATGTAGCC CAGCATTATT TTGAGGTTTT GCGTACCTTG ATTTCTAAGA 1080 

AATCAGTCTT TGCCCAGCAG GTTGGACTCA AGGAAGTCGC AAATTATCTG GGTGAGATTT 1140 

TCAAGCGTGT TGGAGCTGAA GTGGAGATTG ATGAGAGCTA TACAGCGCCC TTTGTCATGG 1200 

CACATTTCAA GAGTTCGCGT CCAGATGCCA AGACCTTGAT TTTCTATAAC CACTATGACA 12 60 

CTGTGCCAGC GGATGGGGAT CAGGTCTGGA CAGAGGATCC kTTTACGCTT TCGGTCCGCA 1320 

ATGGCTTCAT GTATGGGCGT GGGGTTGATG ACGACAAGGG TCATATCACA GCTCGCTTGA 1380 

GTGCTTTGAG AAAATATATG CAGCACCATG ATGATTTACC TGTCAATATC AGCTTTATCA 1440 

TGGAGGGAGC GGAGGAATCG GCTTCAACAG ACCTAGATAA GTATTTGGAA AAGCATGCAG 1500 

ACAAACTCCG TGGGGCGGAT TTGTTGGTCT GGGAACAAGG GACCAAAAAT GCCTTGGAAC 1560 

AGCTGGAAAT TTCTGGTGGC AATAAGGGGA TTGTGACCTT TGATGCCAAG GTAAAAAGCG 1620 

CTGATGTGGA TATCCACTCG AGTTATGGTG GTGTTGTGGA ATCAGCTCCT TGGTATCTCC 1680 

TCCAAGCCTT ACAGTCTCTT CGTGCTGCGG ATGGCCGTAT CTTGGTTGAA GGCTTGTACG 1740 

AAGAAGTACA AGAGCCCAAT GAACGAGAAA TGGCCTTGCT AGAAACTTAT GGTCAACGAA 1800 

ACCCAGAGGA AGTTAGTCGG ATTTATGGAT TGGAGTTGCC TCTCTTACAG GAGGAGCGGA 1860 

TGGCCTTTCT AAAACGTTTC TTTTTCGATC CAGCGCTTAA TATCGAAGGA ATCCAGTCTG 1920 

GTTATCAAGG TCAGGGTGTT AAGACTATTT TACCTGCAGA AGCCAGTGCC AAGCTAGAGG 1980 

TTCGTCTGGT TCCGGGCCTA GAACCGCATG ATGTTCTGGA AAAAATTCGG AAACAGCTAG 2040 

ACAAAAATGG CTTTGATAAG GTAGAATTAT ACTATACCTT GGGAGAGATG AGCTATCGAA 2100 

GCGATATGAG CGCACCAGCC ATTCTCAATG TGATCGAGTT GGCCAAGAAA TTCTATCCAC 2160 

AGGGCGTTTC AGTCTTGCCG ACGACAGCGG GGACAGGACC TATGCATACG GTCTTTGATG 2220 

CCCTAGAGGT ACCAATGGTT GCATTCGGTC TAGGAAATGC CAATAGCCGA GACCACGGTG 2280 

GAGATGAAAA TGTGCGAATC GCTGATTATT ACACCCATAT CGAATTAGTA GAGGAGCTGA 2340 

TTAGAAGCTA TGAGTAGAGA TATTATCAAG TTAGATCAGA TCGATGTGAC TTTTCACCAA 2400 

AAGAAGAGAA CCATCACAGC GGTTAAGGAT GTGACCATTC ACATCCAAGA AGGGGATATC 24 60 

TACGGAATCG TTGGATATTC TGGAGCAGGA AAATCAACCC TTGTACGGGT GATTAATCTC 2520 

TTGCAAAAAC CATCTGCAGG GAAAATTACC ATTGACGACG ATGTGATTTT TGACGGCAAG 2580 

GTGACCTTGA CGGCAGAGCA GTTGCGTCGT AAACGTCAAG ATATCGGAAT GATTTTCCAG 2640 

CATTTTAACC TGATGAGCCA AAAGACAGCA GAGGAGAATG TAGCCTTTGC CCTTAAACAC 2700 

TCTGAACTCA GCAAGGT^AGA AAAGAAGGCT AAAGTAGCTA AGTTGTTGGA CTTGGTTGGT 27 60 

TTGGCAGATC GTGCTGAAAA CTACCCTTCA CAACTATCTG GAGGGCAAAA ACAGCGTGTG 2820 
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GCAATTGCGC GTGCCTTGGC CAATGATCCA AAAATCTTGA TTTCAGACGA GTCAACTTCT 2880 

GCCCTTGATC CGAAGACAAC CAAGCAGATT TTGGCCTTGT TGCAAGATTT GAACCAAAAA 2940 

TTAGGCTTGA CTGTTGTCTT GATTACGCAT GAAATGCAGA TTGTCAAAGA CATTGCCAAC 3000 

CGTGTTGCAG TTATGCAGGA TGGGCATTTG ATTGAAGAGG GTAGTGTGCT TGAAATCTTC 3060 

TCAAACCCTA AACAACCTTT GACTCAAGAC TTTATCTCAA CAGCTACAGG TATTGACGAA 3120 

GCCATGGTCA AAATCGAGAA GCAAGAAATC GTGGAACACT TGTCTGAAAA CAGTCTCTTG 3180 

GTGCAACTCA AGTACGCTGG AGCTTCAACA GACGAGCCAC TTTTGAATGA ATTGTACAAG 3240 

CATTACCAAG TAATGGCTAA TATTCTCTAT GGGAATATCG AAATTCTCGA TGGTACTCCT 3300 

GTTGGAGAAT TGGTGGTGGT TTTGTCAGGT GAAAAAGCAG CGTTGGCAGG TGCCCAAGAA 3360 

GCCATTCGTC AAGCAGGTGT ACAACTAAAA GTATTGAAGG GAGTACAGTA AGATGGAATC 3420 

ATTGATTCAA ACCTATTTAC CAAATGTCTA TAAGATGGGT TGGGCTGGTC AGGCAGGCTG 3480 

GGGAACGGCT ATCTACTTAA CTCTTTATAT GACAGTTCTT TCCTTCATTA TCGGAGGCTT 3540 

CTTGGGGCTA GTGGCAGGTC TCTTTCTCGT CTTGACAGCG CCAGGTGGTG TCTTGGAGAA 3600 

TAAAGTCGTA TTCTGGATTT TAGACAAAAT TACCTCAATT TTTCGTGCGG TTCCCTTTAT 3660 

CATCCTCTTG GCAATCTTGT CACCACTTTC TCACTTGATT GTTAAAACAA GTATCGGGCC 3720 

AAATGCAGCC CTTGTCCCAC TTTCTTTTGC AGTCTTTGCC TTCTCG 37 G 6 
(2) INFORMATION FOR SEQ ID NO: 226: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2520 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 226: 

TGTTGCTGAG TTAATCGGTA CGTTCATGTT TGTATTCGTC GGGACAGGAG CTGTTGTTTT 60 

TGGAAATGGT CTTGATGGCC TTGGTCACCT TGGAATCGCC TTTGCCTTTG GTTTGGCAAT 120 

CGTGGTGGCA GCCTACTCAA TCGGAACTGT TTCAGGTGCT CACTTGAACC CGGCTGTTTC 180 

GATTGCTATG TTTGTAAACA AACGTTTGTC ATCTTCAGAA CTTGTAAACT ACATCCTTGG 240 

TCAGGTTGTT GGAGCTTTCA TCGCTTCTGG CGCTGTCTTC TTCCTCTTGG CTAACTCAGG 300 

TATGTCAACT GCTAGTCTTG GTGAAAATGC CTTGGCAAAC GGTGTCACTG TCTTTGGTGG 360 

TTTCTTGTTT GAAGTCATCG CAACTTTCTT GTTTGTATTG GTTATCATGA CTGTGACTTC 420 
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AGAAAGCAAG GGCAATGGCG CGATTGCTGG TTTGGTAATC GGTTTGTCAT TGATGGCGAT 480 

GATTCTTGTC GGATTGAAGA TTACTGGACT TTCAGTAAAC CCAGCTCGTA GCTTGGCACC 540 

AGCTGTCTTG GTAGGCGGCG CAsCCTTCAA CAAGTTTGGA TTTTCATCCT TGCACCAATC 600 

GCTGGTGGAG TTCTTGCAGC CCTTGTTGCA AAAAATTTCC TTGGAACAGA AGAATAATTG 660 

AAACTCAAAA AGCCTTGCTC CTCATCTTGA GGAACAGGGC TTTTTCGTAT GATACTCTTC 720 

GAAAATCTCT TCAAACCACG TCAGCTTCAT CTTGCCGTAG TATGGTTACT GACTTCGTCA 780 

GTTCTATCCA CAACCTCAAA ACAGTGTTTT GATCTGACTT CGTCAGTTCT ATCTGCAACC 840 

TCAAAACAGT GTTTTAAGCT GACTTCGTCA GTTCTATCTG CAACCTCAAA ACAGTGTTTT 900 

AAGCTGACTT CGTCAGTTCT ATCTGCAACC TCAAAACAGT GTTTTAAGCT GACTTCGTCA 960 

GTTCTATCTG CAACCTCAAA ACAGTGTTTT AAGCTGACTT CGTCAGTTCT ATCCACAACC 1020 

TCAAAACAGT GTTTTGATCT GACTTCGTCA GTTCTATCCA CAACCTCAAA ACAGTGTTTT 1080 

GATCTGACTT CGTCAGTTCT ATCCACAACC TCAAAACAGT GCTTTGAGCA ACcTGCGGCT 1140 

AACTTCCTAG TTTGCTCTTT GATTTTCATT GAGTATGACT TTAGCGGTTG TCAATTTTCT 1200 

CTGGATAAAG GTCGTGTTGG AAGAGGCGTT GTTCTGCCAA GCCCTCATAC TTAGTTCCTT 12 60 

GCTTACCGTA GTTGTAGTAG GGGTCGATTG AAATGCCACC GCGCGGAGTG AATTTTCCCC 1320 

AGACTTCTAA ATAGCGAGGG TCTAGCAAGT TGACCAAGTC TTTCCCGATG GTGTTGATAC 1380 

AGTTTTCGTG GAAATCTCCG TGGTTTCGGT AGCTAAATAG ATATAGTTTG AGGGATTTTG 1440 

ACTCGACACA GAGCTTGTCA GGAATGTAGG AAATATGAAT CGTCGCAAAG TCTGGCTGAG 1500 

CAGTGATTTG TCCCAGCAGA GACATATCGA GGATATGGTG ACGAATGCCC TGTTCCTTAG 1560 

CGATTTCTCT AGTAATTTGA ATTTCGAGGT GATGACGTTG GCCGTAGGCA AAGGTGACAG 1620 

CTTCGACTGT TTCATAGTGT TGCATGACCC AGAAAAGGCA GGTTGTTGAA TCTTGACCAC 1680 

CACTAAAGAC GACCAAGGCT AATTGACGTT TCATAGTACT CCTTCCAAAA TGGGAAATGT 1740 

TCAGAGCACG CAAAAAGCTC CCATTAGGGA GCTAAAAAAT ACCAAATCGA GGTTTTTTTA 1800 

GCGATGGCAT ATCCCAAACA TCGTAATATT CTACTTATAT AGTAAAATGA AATAAGAACA 1860 

GGACAAATCG ATCAGGACAG TCAAATCGAT TTCTAACAAT GTTTTAGAAG TAGAGGTGTA 1920 

CTATTCTAGT TTCAATCTAC TATAGTCTAG CATATTTTTT GAAAAATGGC AAAGGGCAAG 1980 

AAAAAAGAGA CCAAAGAAAG TACTTGGTCT CTCGTTTGAT TAGCTCAATT CAGCAATGAT 2040 

GGCCTTGATT TGTTCTGCTG TGTGAACACC TGCAACTTGT TTGACAACTT GGCCGTCTTT 2100 

TTTGAAGAGA AGAGTTGGAA TAGACATGAT TCCAAAAGCA CGAGCTGTGT TTGGATTTTC 2160 

ATCAACGTCC ATTTTAACGA TTTTCAAGAC ATCTTCTGAA AGTTCTTCAG ACAATTTGTC 2220 
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CAAGATTGGA CCTTGCATAC GACATGGACC ACACCAAGTT GCCCAGAAGT CTACTAAGAC 2280 

CAAACCGTCT TTTGTTTCTT GTTCGAATGT TGCATCTGTA ATTGCTTTTG CCATTGTATT 2340 

TCTCCTTTTT TTAGTTATAT TGGCTTAAAT CTTGTTTCAT GAGATAGAAG AAGATATCTC 2400 

CATAAGTCCC ATGGTAGTCC AAATTATGAC CCTTGTAAGT TAATTTTTGG ACAGGGTAGT 2460 

AkkCTGCGAC GCCGATAAGG CAAGCTTGTT GCGAACGTTC AAAGTCTTCA TAAGACTCGG 2520 



(2) INFORMATION FOR SEQ ID NO: 227: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 5278 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227: 

ACTCAGTTAG ATTTTGTTTT CAAAAACAAC GAAGAAAAAG ACCATGTTGC TCTACTTGGA 60 

AGAATTGGCT CCGAACGTGT TTATCGATAT ATTAATAAAA AATATTTAGA TTTACCGGAA 12 0 

ACATTCGAAA ATTATAATGT TTTTGTACCA GAAGCTAATG GAAGTGGTGC CTTAGGTGAA 180 

GTCTTATCAA CACCCCTAAT CGGGGAACCC CTAATCGGGC ATACAGATAC TTTTTTATCT 240 

ATTGGTAATT TTAAAACAAA ATTTGAAGCC GATGCTTGTA TTAAATTTAT TT'-VL-.CTAAA 300 

TTCGCTAGAG TATTATTAGG TGTTTTGAAA GTTACTCAGC ATAATTCACG CAAAACTTGG 3 60 

TATTACGTCC CCCTCCAAGA CTTTACGGTC AATTCGGACA TTGATTGGAC ACAATCAGTG 420 

ACTGATATTG ACCGCCAGCT TGATCAAAAA TATGACTTTT CCCCTGAAGA AATTGCCTTT 4 80 

ATTGAGAATC ATGTAAGGGA GATGGATTAG AAAAGTATTT TTATTTGACA AATAGTGCTC 540 

AATGATCTAA AATGACTATA TAGGATTAGG TCAGGAAGCA TACGATGCCC TGACCCTTTT 600 

TGTACTTATG AGATGAGAAA GTCATTTGTT AGATAAATTG ACTCGTTAGC AAACGTTCAA 660 

AAAAGGAAAA CTTATGCCAG TAGAAATTAA AACCACTAAA GAAATTCATC CTAAAATCTA 720 

TGCCTACACC ACACCGACAG TAACCAGTAA TGAAGGCTGG ATTAAGATTG GGTATACAGA 7 80 

ACGTGATGTC ACACAACGTA TCAAGGAGCA AACGCATACA GCTCATATAG CTACAGATGT 840 

CTTATGGACT GGTGATGCAG CTTATACAGA AGAGCCTGAT AAGGGGAAAA CTTTCAAGGA 900 

CCATGATTTC CACCATTTCC TTTCTTTCCA TGATGTAGAA CGTCGTCCCA AGACGGAATG 960 

GTTCTATTTT AATGGAACTC CTGAAAAATC AAAAAATCTT TTTGATAAGT TTGTTCAGCA 1020 

TGATTTGTCT GGTTATCAGC CTGGAAAAGG ACAGGACTAT ACTCTGCGAC AAGAGCAAGA 1080 



wo 98/18931 



PCT/US97/19588 



1244 

AGAAGCAGTT GCTAAGACAT TAGCTTATTT CCAAGAACAT GCTGGAGGCA AGTTTCTCTG 1140 

GAATGCCAAG CCACGCTTTG GTAAAACCTT GTCTACCTAT GACCTAGCTC GACGGATGGA 1200 

AGCTGTCAAT GTCCTAATTG TAACAAACCG CCCTGCCATT GCTAACTCAT GGTATGATGA 1260 

TTTTGAAACA TTCATAGCAG GTCAAACGAC TTACAAGTTT GTTTCTGAAT CAGATAGCCT 1320 

TAAGAGTCGT CCAATCTTGT CACGACAAGA ATTTCTTGGT ATTTTAGCTG ACGATGTAAG 1380 

ACAACTTGCT TTTATCAGTC TCCAAGACTT GAAAGGATCT GTTTATTTAG GTGGAGAGCA 144 0 

CGATAAACTC AAATGGGTAA CTGATCTGCA TTGGGACTTG TTGGTTATTG ACGAGGCTCA 1500 

TGAAGGAGTT GATACCTTCA AGACTGACCA AGCCTTTAAT AAGATTCGAC GAAATTTTAC 1560 

TCTGCATTTG TCAGGTACAT CATTTAAAGC ATTGGCTAAA GGAGATTTTA CAGAGGAACA 1620 

AATCTACAAC TGGTCTTATG CTGATGAGCA GGCTGCTAAG TATTCGTGGT CTCTTGAGCA 1680 

AGAAGAGGAA AATCCTTATG AAAGCTTGCC TCAGTTGAAT CTCTTTACCT ATCAAATGTC 1740 

TCAGATGATT GGCGAAAAGT TAGAAAAAGG CGCTCAGATC GATGGTGAAA ATATTGACTA 1800 

TGTTTTTGAC TTAAGTGAAT TTTTCGCTAC AGATGATAAA GGGAAATTTA TTCATGAGCA 1860 

TGATGTCAGA AATTGGTTAG ATACTCTATC AAGCAATGAA AAATATCCAT TTTCAACCAA 1920 

AGAACTCCGT AATGAACTCA AGCATACTTT TTGGCTTTTA GAACGTGTCG CTTCGGCCAA 1980 

AGCATTAAAA GCCCTACTAG AAGAACACCC AATCTATGAA AACTATGAGA TCGTTCTAGC 2040 

TGCTGGTGAC GGACGTATGT CCGAAGAAGA CGATAAAGTC AAACTCAAAT CCTTGGACTT 2100 

GGTTAGAAAA GCGATAGCAG AGAATGACAA AACCATTACC CTATCCGTTG GTCAGCTGAC 2160 

GACAGGTGTC ACTATCCCTG AATGGACAGG TGTATTGATG TTATCAAATT TGAAATCACC 2220 

AGCTCTTTAT ATGCAGGCCG CCTTCCGTGC TCAAAATCCT TACTCATGGA GCGATAACAA 2280 

AGGAAATCAC TTTCGCAAAG AAAGAGCCTA TGTATTTGAC TTTGCGCCGG AAAGAACCTT 2340 

GATTCTCTTT GATGAGTTTG CCAACAACTT ATTGCTTGTA ACTGCAGCTG GTAGAGGAAC 2400 

TTCAGCTACA CGCGAAGAAA ATATTAGAGA ATTATTAAAC TTCTTTCCAA TTATTGCCC'-A 2460 

AGACCGTGCT GGTAAGATGG TTGAAATTGA TGCAAAGGCA GTTCTAACCA CTCCTCGCCA 2520 

GATAAAAGCT AGAGAAGTTC TTAAACGAGG TTTTATGTCC AATCTCTTAT TTGATAATAT 2580 

TAGTGGTATT TTCCAAGCAA GTCAAACAGT TTTAGATATT TTAAATGAGC TGCCAGTTGA 2640 

AAAGGAAGGG AAGGTACAAG ATAGTTCTGA TTTATTAGAT TTTTCAGATG TTACAGTCGA 2700 

TGATGAGGGA AATGCAGTAG TAGACCATGA AATTGTAGTT AATCAGCAAA TGCGACTTTT 2760 

TGGTGAAAAA GTTTATGGAC TTGGTGAATC TGTTGCTGAG TTAGTCACAA AAGATGAGGA 2820 

ACGAACTCAA AAACAGCTGG TCAATGACTT GAGTAAGACC GTTTCTTCAG TGATTGTAGA 2880 
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GGAATTGAAA GCAGATTATT CTCTAAAAAC AAGGGAAACT GAGCAAATTA AGAAACAAAT 294 0 

TACAGCAACA CTTGAGAATG AAATTCGAAA AAATGATATC GAAAGAAAAA TTTCTGAAGC 3000 

TCATATCAAG CAAGAGTTGC AACAGCAGCT CAAAGAAGCA AATGATAAAG CGCAAAAAGA 3060 

TAAGATTCAA GAAGATTTGG AAAAACGTTT AGAAGAAAAT AAACTCATTC ATAAAGAAAA 3120 

ACTAGAACAA ACACTCAAAA AAGAAGTGGA AAAAATGCCT GAGAAATTTA TCGAACAGGT 3180 

TGAGATAAAA CGTGTGGAAC AGTTGAAACA ATCAGCTCAA GATGAAATTC GTGACCATTT 3240 

ACGAGGGTTT GCAAGAACAA TTCCAAGTTT TATTATGGCT TACGGTGATC AAACTCTAAC 3300 

ACTTGATAAT TTTGATGCCT TTGTTCCTGA ACATGTTTTT TATGAAGTAA CAGGGATTAC 3360 

GATTGATCAG TTTAGATATT TGCGAGATGG TGGGCAGGAT TTTGCAGGGC ATCTCTTTGA 3420 

TAAAGCAACA TTTGACGAAG CTATTCAAGA ATTTCTTCGC AAGAAAAAGG AGTTGGCGGA 3480 

TTATTTTAAA GATCAAAAAG AAGACATTTT TGACTATATT CCACCGCAGA AGACCAACCA 3 54 0 

AATTTTCACT CCTAAACGAG TGGTGAAAAG GATGGTAGAT GATTTGGAAA AGGAAAATCC 3 600 

AGGGATTTTT GATGATCCAT CTAAGACTTT TATTGATTTA TATATGAAGT CAGGCCTCTA 3660 

TATTGCAGAA CTTGTGAAGC GGTTATATAA TAGCAATGGC TTGAAAGAGG CCTTTCCAAA 3720 

TCCTGAAGAA CGCTTAAAAC ATATTTTGGA AAAGCAAGTT TATGGATTTG CTCCGTCTGA 3780 

GATTATCTAT AACATTTCCA CTAATTTTAT ATTTGGCAJIT CTTTCTAAAC ATATCAGTAG 3S4 0 

GAAGAATTTT GTTTTAGCAG ATACCATTCC AGCGGCTAAA GAAGGGAGCA TTCAAAAGTT 3 900 

GGTTGATTCC TATTTTGAAA ATAATTAAAA AGAAGGCCGA GTCAAAATTC TTTGAAATCA 3960 

GAAAAAACGC ATAATATTGA GTGCTTTTGT ACTGCCCCCC AAAAGTTAGA CAGAAAAAAT 4 020 

CTAACTTTTG GGGGGCAGTT CAGACAATCC TTGGTATTAT GCGTTTTATT GTGGGAAGAT 4 080 

GTATAATGGA TTGAAATAAG ATATGAACAA ATCAATTAGG AATTTAAAGC ATTTTATAAC 4140 

AACGTTTTAG AGTAATGGGG GGCTATTTCA ACTTCAACCT ACTATAATAC AGAAAAAAAC 4200 

AACTCCCTGA TAATTCAAGG AGTTGTCTAT AGTTAAATTA GTTTTTAGAA GCTTCTTGGA 4260 

ATTCTGGGTT TTTCCATGCT TCGTCAATGA TAGCTTGTAA TTCTTTAGCA GATGCTTGCA 4320 

TTTTTTGAGT TTCTGCGTCG TTCAATGGGA TATTTACTGG ACGAACGATA CCATGTGCAC 4380 

CAACAACAGC TGGTTGACCG ATAAAGACAT TCTCAACTCC GTATTGACCT TCTTGGAATA 4440 

CTGAAAGTGG AAGTACTGCG TTTTCATCGT CAAGGATTGC TTTAGTGATA CGAGCAAGGG 4500 

CTACTGCGAT ACCGTAGTAT GTTGCACCTT TTTTGTTGAT GATTGTGTAG GCTGCATCAC 4 560 

GAACACCTTC GAACAATTCA ATCAATTCAG CTTCTTGAAC ATTTTGAGTG TCTTTAAGGA 4 620 
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ATTCTTCAAG GTTTACACCA GCGATGTTAG CGTGTGACCA AACAGCGAAC TCAGAGTCAC 4680 

CGTGTTCACC CATGATGTAG GCGTGCACTG AACGAGCATC CACATCCAAT TTTTCAGCAA 4740 

GTGCTTGACG GAAACGAGCT GAGTCAAGTG AAGTACCTGA ACCGATAACG CGTTCTTTAG 4 800 

GGAAACCAGA GAATTTCCAA GTTGAGTAAG TCAAAACGTC AACTGGGTTA GCAGCAACAA 4860 

GGAAGATACC TTTGAAACCA GATTCAACAA CTTGAGTTAC GATTGATTTG TTGATAGCAA 4 920 

GGTTTTTACC TACAAGGTCA AGACGAGTTT CACCTGGTTT TTGAGGTGCA CCTGCAGTGA 4 980 

TCACAACAAG GTCAGCGTCT GCACAGTCAG AGTATTGAGC TGCATAGATT TTTTTAGGTG 5040 

AAGTGAAGGC AAGGGCGTGA CTAAGGTCAA GCGCATCACC AACAGCTTTT TCATGCAATT 5100 

GTGGAATTTC GATAATTCCA AGCTCTTGTG CAATTCCTTG GTTAACAAGT GCAAAAGCGT 5160 

AAGATGAACC TACAGCACCA TCACCGACAA GGATAACTTT TTTGTGTTGT TTAGTTGAAG 5220 

TCATTGTTTT AAACATCTCC TTAATTTTAT TAGGGGATTT TCCCTAGACA ACTTCATT 5278 
(2) INFORMATION FOR SEQ ID NO: 228: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1941 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

*xi) SEQuEi^CE DEaCRiPTiON : SEQ ID NO: 228: 
ATAAGGAATC TCTA7LAAAAT TTTAAGGAGA ATCTAGCAAA TGGATTTCAC ATGGGCACTG 60 
AAGTATGCCA CTGAATTTTT GGGAACTGCC ATTTTGATCA TTCTTGGGAA TGGTGCAGTT 120 

GCCAACGTTG AACTTAAAGG TACGAAAGGT CACCAAAGTG GCTGGATCGT CATCGCTGTT 180 

GGTTATGGTA TGGGGGTTAT GATCCCAGCC TTGATGTTTG GTAACGTATC TGGGAATCAC 240 

ATCAACCCTG CTTTCACTCT AGGGCTTGCA GTTAGCGGTC TTTTCCCTTG GGCACAAGTG 300 

GTACCTTACA TTATCGCGCA AGTCTTGGGG GCTATCTTTG GCCAAGCCTT AGTTGTGGCA 3 60 

ACATACCGTC CATTCTACTT GAAAACTGAA AACCCAAATA ACATCTTGGG AACTTTCTCA 420 

ACTATTTCAA GTATTGACCA TGGTACAAAA GAAAGTCGCT ATGCAGCAAC TGTCAATGGT 4 80 

TTGATTAATG AGTTTGTTGG TTCATTTGTT TTGTTCTTTG CAGCTCTTGG TTTGACTAAA 540 

AACTTCTTTG GTGCTGAAGT GCTTCAATTC ATGAAACAAA AGGCAACAGA AGCAGGACAA 600 

ACAGTTGATT TTTCTGACTT GGCTATTAAA GCACAGGTGG CTCCACACAC TGCTTCAGGA 660 

CTTTCTGTGG CTCACTTGGC ACTTGGATTC CTCGTTATGG CTTTGGTAAC ATCACTTGGA 720 

GGACCTACAG GACCTGCCTT GAACCCAGCC CGTGACTTGG GACCACGTCT CCTTCATGCT 780 
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TTCCTTCCCA AATCAGTTCT TGGTGAGCAT AAAGGCGATT CAAAATGGTG GTATTCTTGG 840 

GTACCAGTAG TAGCACCTAT CGCAGCAGCA ATTGCGGCAG TAGCTGTATT CAAATTCCTT 900 

TATCTCTAAG AAATAGCTCC TTTAACATTT GAGTGAGCAC CATCTATAAG TAAGAGAGGA 960 

TCAGACTGGk TCTCTCTTTT kGATTTTTaG GGAAATGAAA GAAcTCTAAA CAAACTCCTC 1020 

TCCAGCAGTG GTTTAGAAGT CTCAGTGGGC TATTCCAGCT TCAATGGACT ATAGTAGGTT 1080 

GCAGTTGAAA TAATAGACCC TTGTTTCTAA AACATTGTGA GAAATTGGTT TGAATTCTCC 114 0 

AATCAAATTG TGCAGTTTTC ATTCTACTAT ATATTATCGG AATATTATCG GAGATGGGTT 1200 

CCCTATCTTG TAAGTCTGCT TTATAGTGGG TTGAAGTTGG AATAGTCCTC CCTTCTTTCT 1260 

CAAACATTGT GAGGAATTGA TTTACCTTCC TCAACAAAAT GTTCAGTTTC TATTTCATTT 1320 

TACTATAAAA TAAGCGATTA GGGGGGCTAT TCTTCGACCT ACATTGACTC TGCTGAGTCC 1380 

TATGATTGTT ATCGTTTTAT CTGCAATTTT ATACTCAATG AAAATCAAAG GGCAAACTAA 1440 

GAAGCTAGCC GCAGGTTGTT CAAAACACAG TTTTGAGGTT GTATAGTAGA TTGAAACTAG 1500 

AATAGTACAC ATCTACTTCT AAAACATTGT TAGAAATCGA TTTGACTGTC CTGAACGATT 15 60 

TGCCCTATTC TTGTTTCATT TTACTATATA AACCAGAGAC TGTTTACATT TTCAGCAAGT 1620 

GAGTGGATGG ATAATGCTGA AAACTCCTTG AAGGATAAGT CTATTTAGTA CTTTCTATTA 1680 

ATTAGTTAAA TTTTTACCAA GAATAATTCA CAAAAACGTT GTAAAACACT TGCAATTTAG 1740 

CTGAAATTTG ATAAAATAGT AAGGAAAGTT AGACTGTATT GCCTACTGTC TATCTATAAA 1800 

ATATATTTTA TTGGAGGCTT TTACTCAAAT GGCAAAAGAA AAATACGATC GTAGTAAACC 1860 

ACACGTTAAC ATTGGTACTA TCGGACACGT TGACCACGGT AAAACTACCC TAACTGCAGC 1920 

TATCACAACT GTTTTGGCAC G 1941 



(2) INFORMATION FOR SEQ ID NO: 229: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 755 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229: 

ATTTGAAGAA ATTGAAGAAA TCGTAGCCCC TACAGATGGT GAATTTTTGG GGGAAGTTTT 60 

ACTTGGAACT GGGGTAGTTC TCTTAATTGG AGTAGCCTGT TGTTAAAAAG ATAGGGAGTG 120 

ATAATCATGC AAGATAACTT TTTATTTGAG GAAATTGAAG AAATTTCAGT ACCAGTTAAT 180 
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GATTTTTCAG CTGGACTTGC AACAGGTATC GGATTTGGTT TAGCAATCCT TGCTCTTGCT 24 0 

GGTTGTTGAA GTTTGTTCAT TTACTAACAT CAAGCTTTTT CAATTTCATT TTAGACAGTC 300 

ATTTAAATTT TCCGTATTAG TCTTGCAGCA AGAGATTAAT AGAATTAGTC ATTATTTTAT 360 

TGATTGCGGA CTGAGGGACT AGAGTATGTT TTACTTAACC CCTCTTTTAT TTATTAAAGG 420 

TTAGGTTTGT TATGAGAATT GTTGATAAGA TTAAGATATT ACCTACTCCT TATGAGGGAC 480 

ACTATCATTT ATATATACCA TCCAGTAAGA AACATGTATT AGTTGGGAAA CAGGAAAAAA 540 

ATGGTTAGAG CAACTAATAG GTCAAGAATT TACCATATCG GACTTATTAG TGTTAGTAGG 600 

GAAGAAATAT TTTTAAAATA TCTTGGGACT TTAATATAAC ATTATCTGAA AAATTAAACT 660 

ATAAAAGATT TAATAAGAAT TTTGAAAAAA TCCTATCTTG TTGTCATTAT ATTTGCAACG 720 

ATACATGAAA TTAGTCATGC AATAATTGCT AATAA 755 



(2) INFORMATION FOR SEQ ID NO: 230: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1483 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230: 

CCAGAAAAAC CGTAGTGGAG CTCGTGGAAC AGTGGAATTG ATTTTCCAAA AAGAATACAA 60 

TAAATTTTCA AGTATCTCAA AGAGGGAGGC ATAAGATGTC AGATGCATTT ACAGATGTAG 120 

CCAAGATGAA AAAAATCAAA GAAGAAATCA AGGCACATGA GGGACAAGTC GTAGAAATGA 180 

CTTTGGAGAA TGGTCGTAAG CGCCAAAAAA ATAGATTGGG TAAGCTAATT GAAGTTTATC 240 

CATCTCTATT TATTGTGGAG TTTGGGGATG TGGAAGGAGA TAAACAAGTT AATGTTTACG 300 

TTGAATCCTT TACTTACTCA GATATTCTTA CAGAAAAGAA TTTGATTCAT TATCTTGACT 3 60 

AAAGTGAGAA ATTTTCTCAC TTTTTCTTTT TTCTCCGAAT AATTTAGGTG AAGGCAATCA 420 

TCGCTTTATA TTATTTTTCA AGGAGGAAGA ATGAAAATTT TACCGTTTAT AGCAAGAGGA 480 

ACAAGTTATT ACTTGAAGAT GTCAGTTAAA AAGCTTGTTC CTTTTTTAGT AGTAGGATTG 540 

ATGCTAGCAG CTGGTGATAG TGTCTATGCC TATTCCAGAG GAAATGGATC GATTGCGCGT 600 

GGGGATGATT ATCCTGCTTA TTATAAAAAT GGGAGCCAGG AGATTGATCA GTGGCGCATG 560 

TATTCTCGTC AGTGTACTTC TTTTGTAGCC TTTCGTTTGA GTAATGTCAA TGGTTTTGAA 720 

ATTCCGGCAG CTTATGGAAA TGCGAATGAA TGGGGACATC GTGCTCGTCG GGAAGGTTAT 780 

CGTGTAGATA ATACACCGAC GATTGGTTCC ATTACTTGGT CTACTGCAGG AACTTATGGT 840 
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CATGTTGCCT GGGTGTCAAA TGTAATGGGA GATCAGATTG AGATTGAGGA ATATAACTAT 900 

GGTTATACAG AATCCTATAA TAAACGAGTT ATAAAAGCAA ACACGATGAC AGGATTTATT 960 

CATTTTAAAG ATTTGGATGG TGGCAGTGTT GGGAATAGTC AATCCTCAAC TTCAACAGGC 1020 

GGAACTCATT ATTTTAAGAC CAAGTCTGCT ATTAAAACTG AACCTCTAGC TAGCGGAACT 1080 

GTGATTGATT ACTATTATCC TGGGGAGAAG GTTCATTATG ATCAGATACT TGAAAAAGAC 1140 

GGCTATAAGT GGTTGAGTTA TACTGCCTAT AATGGAAGCT ATCGTTATGT TCAATTGGAG 1200 

GCTGTGAATA AAAATCCTCT AGGTAAtTCT GTTCTTTCTT CAACAGGTGG AACTCATTAT 1260 

TTTAAGACCA AGTCTGCTAT CAAAACTGAA CCCCTAGTTA GTGCAACTGT GATTGATTAC 1320 

TATTATCCTG GAGAGAAGGT TCATTATGAT CAAATTCTCG AAAAAGACGG CTACAAGTGG 1380 

TTGAGTTATA CGGCTTATAA CGGAAGTCGT CGCTATATAC AGCTAGAGGG AGTGACTTCT 1440 

TCACAAAATT ATCAGAATCA ATCAGGAAAC ATCTCTAGCT ATG 1483 
(2) INFORMATION FOR SEQ ID NO: 231: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1027 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 

CCCGGAAAAC AAGTTAAAGT TGAAGTTGGT CAAGCAGTTT ACGTTGAAAA ATTGAACGTT 60 

GAAGCTGGTC AAGAAGTTAC TTTTJ^CGAA TTGTTCTTGT TGGTGGTGAA AACACTGTTG 120 

TCGGAACTCC ACTTGTTGCT GGAGCTACTG TAGTTGGAAC TGTTGAAAAA CAAGGAAAAC 180 

AAAAGAAAGT GGTTACTTAC AAGTACAAAC CTAAAAAAGG TAGCCACCGT AAACAAGGTC 240 

ACCGTCAACC ATATACA/VAA GTTGTCATCA ACGCAATCAA CGCTTAATTT TAAGGAGAAC 300 

ACATGATACA GGCAGTCTTT GAGAGAGCCG AAGATGGCGA GCTGAGGAGT GCGGAAATTA 360 

CTGGACACGC CGAGAGTGGC GAATACGGCT TAGATGTCGT GTGTGCATCG GTTTCTACGC 420 

TTGCCATTAA CTTTATCAAT TCTATTGAGA AATTTGCAGG CTATGAACCA ATCCTAGAAT 480 

TAAACGAAGA TGAAGGTGGC TATCTGATGG TTGAAATACC AAAAGATCTT CCTTCACACC 540 

AGAGAGAAAT GACCCAGTTA TTCTTTGAAT CATTTTTCTT AGGTATGGCA AACTTATCGG 600 

AGAACTATTC TGAGTTCGTC CAAACCAGAG TTATCACAGA AAACTAACAC GGAGGAAAAC 660 

ATTATGTTAA AAATGACTCT TAACAACTTG CAACTTTTCG CCCACAAAAA AGGTGGAGGT 720 
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TCTACATCAA ACGGACGTGA TTCACAAGCA AAACGTCTTG GAGCTAAAGC AGCTGACGGA 780 

CAAACTGTAA CAGGTGGATC AATCCTTTAC CGTCAACGTG GTACACACAT CTATCCAGGT 840 

GTAAACGTTG GTCGTGGTGG AGATGATACT TTGTTCGCTA AAGTTGAAGG CGTAGTACGC 900 

TTTGAACGTA AAGGACGCGA TAAAAAACAA GTGTCTGTTT ACCCAATCGC TAAATAAAAA 960 

GGTCCATTGA ACCTTTTATC CCGAACCTTG AAATGTAGAG GTGAGGAAGC TAGAAACAGC 1020 

TTAAAAT 1027 



(2) INFORMATION FOR SEQ ID NO: 232: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1990 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232: 

CGGTTCAAAT GGTGCAGGTA AATCTACGTT AATTAATTCT ATTGTAGGTT TTCAAGAGAT 60 

TTATTTAGGA GAAATAGAGT ATTGTGATAA AGATTTGATA GTTAGTTCTC AACCTTTTGC 120 

TCATTTAGGC TTTACTCCTC AAACCACAGT AATTGATTTT TATACTACTG TGAAGGACAA 180 

TGTAATATTG GGGCTGAACC TTGCTGGAAA GTTTGGGAAA AATGCTGAGA AGTTGTGTCA 240 

AATAGCCTTA GAAATTGTTG GGTTAGCTGA TAAAAAAAAT AATTTGGTAG AAACATTGTC 300 

AGGTGGACAA CTGCAACGCG TCCAGATTGC TAGAGCAATA GCTCATAATC CAGATTTTTA 360 

TATTTTAGAT GAACCTACCG TTGGTTTAGA TACTGAATCT GCCGAAAAAT TTTTAATGTA 420 

TTTAAAAGAT AAGAGTTTGG AAGGAAAAAC TATTATCATA TCTTCACATG ACATAAATCT 480 

ACTCGAAAAG TTTTGTAAAA AAATACTTTT TTTACAAAAT GGCTCCATAT CATTTTTTGG 540 

TGATATGCGT GACTTTGTAG ATAATTCAAC TATCAAATTA AATTTTTCAA TGCAGAATAG 600 

AATTTCTAGA TATCAAATTG AATTTTTAGA AAATTTTAGA TTTAAAGTTC ACATCGAAGA 660 

TAATGATAGT TTTACAATAG AAGTCCCTAT AGAAGAAAAG ATCTTAGATG TTATCAATGA 720 

GGTAGGAAAA GCATGTGAAA TTAAAAACTT TTCAACAAGT AAATTAACCT TACAAGAAAG 780 

TTATTTGCAA AGAATAGGAG GAGAAAAATG AAGGCTGATC AATTAAGGCA CAAATCGGAC 840 

TTAGGTTTAA GAGGTCTAGC GATTATTGCT AAAAATGAGA TTATTGCTTT TTTTAGAAGT 900 

AAAGGTTTAA TTATTTCTCA GTTTCTACAA CCAATCTTAT ATGTTGTTTT TATAATAATA 9 60 

GGATTAAATT CTTCGATAAA GAACATTCAG TTTAATGATA TAAAAACCTC TTATGCAGAA 1020 

TATACAATCA TTGGTGTTAT AGCTTTATTG ATAATCGGGC AGATGACTCA AGTTATTTAT 1080 
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AGGGTGACAA TAGATAAAAA ATATGGGCTA CTTGCTCTTA AGTTATGCAG TGGAGTTCGT 1140 

CCTTTATATT ATATTTTAGG GATGAGTATC TATTCTATAT TAGGGTTGAT AGTTCAAGAA 1200 

ATT ATT AT AT ATATAATTAC GTTAGCGTTT GAGATAAATA TCGCAATGGA TAGATTTTTT 1260 

TATACAGTTT TGTTATCTAT TGTTGTTTTA TTATTTTGGG ACTCCCTTGC AATTTTACTT 1320 

ACAATGTTTA TCAATGATTA CAGAAGACGT GATATTGTAA TACGTTTTGT ACTAACACCG 1380 

CTTGGTTTTA CAGCTCCTGT TTTCTACTTA ATAGATTCTG CTCCTAGTAT TGTGAGATGG 1440 

ATTGGTCAGT TAAATCCCTT AACTTATCAA TTAACTATTT TGAGAAACTT TTATTTTAAA 1500 

AATTCAACAA CTTTGGAATT AGTTTTCTTA TTGTTAACAT CATTACTTGT CCTTATATCT 1560 

GTATCTTTTA TTATACCAAA GATAAAATTG ATACTGATAG AAAGATAAAA GTTGGGTCAT 1620 

CCAACTTTTT TGTTGTCTCC CGAAAACCAC TAGCTATGCT AGTGGTTCCA TAGAGCTTTT 1680 

AGCGTGGTAA CAAAAAGAAC CTCCTAAAAT GATAAGATAG AAGTGGTTTC TCCGCCACTA 1740 

CAACATATCA TACAGGAGGT ACCTCATGAG AGAGGATAAT CAAAGTTTAT CACATACCAC 1800 

ATGGAATTGT AAATATCATA TTGTTTTTGC ACCCAAATAT CGTCGTCAAA TCATTTATGG 1860 

CAGATACAAA GCTAGTATCG GAAGAATCAT ACGTGACTTA TGTGAGCGTA AGGGTGTAAT 1920 

AATCCATGAA GCGAATGCTT GTTCAGACCA TATTCACATG CTTATCAGTA TTCCTCCGAA 1980 



;iCTTAGTGTT lyyo 

(2) INFORMATION FOR SEQ ID NO: 233: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4766 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233: 

GAACTATATT GCATATATTT CTAGCAATGA TCATGGCGAA TCTTGGTCTG CACCAACTTT 60 

ATTACCTCCT ATAATGGGAC TTAATCGGAA TGCGCCATAT TTAGGTCCTG GACGTGGAAT 12 0 

CATTGAAAGC TCAACTGGAC GTATTCTTAT TCCGTCTTAC ACTGGTAAAG AGTCTGCGTT 180 

CATTTATAGT GACGATAATG GAGCATCTTG GAAAGTTAAA GTAGTGCCAC TTCCTTCTAG 240 

TTGGTCAGCA GAAGCACAAT TTGTAGAATT GAGTCCAGGA GTAATTCAAG CATATATGCG 300 

TACAAATAAT GGTAAAATTG CATATTTAAC AAGTAAAGAC GCAGGTACTA CTTGGAGTGC 360 

ACCGGAATAT TTGAAATTTG TTTCAAATCC AAGTTATGGA ACACAATTAT CAATCATCAA 420 
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TTATAGCCAA TTGATTGATG GTAAAAAGGC TGTCATTTTA AGTACTCCAA ACTCCACAAA 480 

TGGTCGTAAA CACGGACAAA TTTGGATTGG TCTAATTAAT GATGATAATA CAATTGATTG 54 0 

GCGTTATCAT CACGACGTTG ATTATAGTAA CTATGGATAC TCATATTCAA CATTGACAGA 600 

GTTACCAAAT CATGAAATTG GATTGATGTT TGAAAAATTT GATTCATGGT CTCGTAATGA 660 

ACTTCATATG AAAAATGTTG TACCATATAT AACATTTAAG ATTGAAGATC TGAAAAAGAA 720 

TTAAAGCTGA AATTTGAAAA TATATAAAAA GAGGATAAAA ATTATGGTAA ATTACGGTAT 780 

TGTTGGAGCT GGATATTTTG GAGCTGATTT AGCTCGCTCA ATGAACAAAA TTGAAGATGC 840 

AAAAGTGGTT GCGGTATTTG ACCCAAATCA TGGAGAAGAA GTTGCTCAAG AGTTGGGATC 900 

AGATGTTTGT GCAAGTTTAG ATGAACTTGT AGCACGTGAA GATATTGATT GTGTGATCGT 960 

AGCTTCACCT AGCTACCTTC ACCGTGAACC AGTTGTGAAA GCTGCTCAAC ATGGCAAACA 1020 

CGTATTTTGT GAAAAGCCAA TTGCATTGTC TTATGAAGAT TGTAAAGCCA TGGTTGACGC 1080 

ATGTAAAGAA AATAATGTCA TCTTTATGGC TGGTCACATC ATGAACTTGT TTAACGGTGT 1140 

ACACCATGCT AAAGAATTGA TTACTCAAGG TAAAATCGGT AAAGTTCTTT ATTGCCATGC 1200 

TGCTCGTACA GGTTGGGAAG AACAACAACC AACTGTATCA TGGAAGAAAC TTCGTTCTCA 12 60 

ATCTGGAGGA CATTTGTACC ACCATATTCA TGAATTAGAT TGCATTCAGT TTATCATGGG 1320 

AGGACTTCCT GAAAAAGCGA CAATGGTAGG AGGCAATGTA TATCATAAAG GTGAAAACTT 1380 

TGGTGATGAA GATGATATGC TCATTGTAAA CTTAGAATAC TCTGATGATC GTTATGCTGT 1440 

TTTGGAATAT GGTAATGCTT TCCGTTGGGG TGAACACTAC GTCTTGATTC AAGGAACTGA 1500 

AGGAGCTATC AAACTTGACT TGTTCAATAC TGGCGGTACT CTTCGTGTTA AAGGTGAAGG 15 60 

AGAATCACAC TTCTTAGTTC ATGAAACTCA AGAGGAAGAT GATGATCGTA CAGCTATCTA 1620 

TACCGGTCGT GGTATGGATG GAGCAATTGC GTACGGTAAA CCAGGAGTAC GTTGCCCATT 1680 

ATGGTTGCAA ACATGTATTG ATAAAGAAAT GGAATATCTA CATGACATCA TTAAAGGTGG 1740 

AGAAATTACA GAAGAATTTG AAAAACTTCT CAATGGTGTA GCTGCTTTAG AATCAATCGC 1800 

TACCGCTGAT GCATGTACTT TATCAGTTAA AGAAGATCGA AAAGTAAGTC TTTCAGAAAT 1860 

CACAAATGCT TAACTTTTGT AAAACAGAAT AGTAAATTCT TGTCATTATA TAATTTCTAA 1920 

AGTTCTGTGA TACAACTCAT TGAATAAAGA AATAGAGATG GGACTGGGAT AATGCCCAGT 1980 

CCCATTTTTT ATCAAAAAGT AATGAGATCA AAAATGTGGG AGTGTTGAAA TGAAGATTAT 2040 

AGGTATCGAT ATTGGCGGAA CAACAATTAA GGCAGATTTA TACGATGAGT TTGGAACGAG 2100 

TTTGAATCAT TTCAAAGAGA TAGAAACAAT TATTGACTAT GATTTGGGAA CGAATCAGAT 2160 

ATTAAATCAG GTCTGTGATT TAATTGGTGA GTATACTTTA AATCATTCAA TTGATGGTGT 2220 
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TGGGATTTCC ACTGCTGGAG TTGTTAATGC TAATACTGGA GAAATCATCT ATGCAGGCTA 2280 

TAC/ATACCA GGGTATATCG GAGTAAACTT TACTGCCGAA ATAGAAAAAC GTTTTGGGTT 2340 

GTATACTTTT GTTGAAAATG ATGTTAATTG TGCTGCATTA GGTGAATTGT GGAAGGGACA 2400 

AGCCATU^GAT AAGAAAAATG TAGTAATGGT TACTATTGGA ACAGGTATAG GAGGCAGTAT 2460 

TATTGTCAAC GGACAAATTG TTAACGGATT TAACTATACT GCTGGTGAAG TAGGTTATAT 2520 

TCCTGTAGGT AATTCGGATT GGCAAAGTAA AGCCTCAACA ACCGCATTGA TTCATTTATA 2580 

TCAAAAJ\AAG AGCTTGAAAA CTAATCAAAC TGGACGTACT TTCTTCACTG ATTTAAGATC 2640 

TGGAGATAAA GTTGCTGAAG AAACTTTTGA AATTTTTGTA GAAAATCTAA CAAAAGGTTT 2700 

ATTAACGATT TCTTATCTAC TTAATCCAGA AATTCTCATA TTAGGAGGTG GGATTCTGGA 2760 

TAGTAAGGAT ATTTTGTTAC CTGAAATTCA AAGTTCTTTA GCTAAAAATG CAATGGATAA 2820 

TAGGTTTTTA CCTAAAAATC TTGTGGCAGC TACATTAGGA AATGAAGCTG GTCGTATAGG 2880 

AGCTGTAAAA AATTTCTTAG ATAGAATTTC TAATAAATAG TATGTAAGAT AAGGAGGTGT 2940 

CACAATGACT AACTCTGTAT TTTCGACAAT GCAAGATATT GAGAATGTTG CAACCGATAT 3 000 

TATAAAATCA TATGATAATG AGATTTATAC TTATAAAGCT GTTTCCCAAG AAGAATTGGA 3060 

AAAACTAGAA AAAAGTTATG ATGAAAAAAG TCACGAAGAA TTAGTTTCAA TAGAAAGCAA 3120 

TTTARAAAl^G liJKprIiJ^c^^fl7^ n'^'^TTATTGA TGACGTTAAT AAAACAATCA AGGAAAATGA jxSO 

TGCAAATATT CAGTATATTT CATCAAGTAG GAGAGGAGAA TTTGTAGAAA AAATTATTGG 324 0 

TAGGGTGGTA GAAAAATATG GCCATTAGTC AGATGAAAAG AATCTCTCTA CTATTTTCTA 3300 

AAAGTAGTCT TGATGATGTT TTAAAAACTA TTCAAGAACT' AGAGTCAGTG CAGTTCCGTG 3360 

ATTTAAAGGT TCAGGATAAC TGGTCAGAAG CTCTAGAAAA AGATGAAGTT GTATTTCCAA 3420 

CTATTCAAAT TTTTCATACT TCTAATTCCA ATCATGGGGT TATTGAGGGA AATGATGCCT 3480 

TGACTTATTT GATGAATCAA CAACAACATT TAGAAGCAAC TGTAGAGAAA TTACAAGAAT 3540 

ACCTACCGAA AGAAAACACG TTTAAATTAT TGCAGCAACC TCCGATAACT ACCTCTTATG 3600 

AAGAATTAGA GAAATTTGGT AAAGCTAATG TTGCTGAGGG TGTTCTTAAA AAAGTGAATC 3660 

ATCAAATTAA CAGAGTTCAT GAATTAGAAA GACACATTCA AAGTAATAAT GAGGAAATAG 3720 

AGCGATTAAT AAAGTGGGAA AAATTAGAAA TTGTTCCTGC GAATTTAGAA CAATTTTCTT 3780 

TCTGTAAAGG AAAAGTCGGA ACAATTCCAA GGACTGAAGA TAATCGCTTA TACAATAGTC 3840 

TTTTAGAAAA CAATATTGAA GTTCAAGAAA TATTTTCTAA TGATAGAGAG TACGGTGTTG 3 900 

TTGTTTTCTA TCAGTCTAGT TACTCTATAG ATTTTGATGA ATACTTATTT GAACCATTTG 3 960 
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ATTATTCTAG AAAGGAATTA CCGAAGCAGC GAGTAGTAGA TTTAGATCAA GAAAACATGC 4020 

AGTTAATAAC TGAAAAAGAG AATATTATCG CATCGTTGCA AGATTCAAAG AAATATTTGA 4080 

TAGATTTACA ATGGCAAATA GACTATATTT TATCTATCTA TGCTCGTCAA ATCTCTAAGA 4140 

ATAACTTTTT GTGCACTCCG CATCTAGTTG CATTAGAAGG ATGGATAGAA GAAACTCGTA 4200 

TTTTATATTT TATAAAAGTT ATGGATGAGC ATTTTGGACA TTCTATTTAT ATTTATGAAT 4260 

CGGAAACATT GACGGATAAT CAAGATGAAA TACCTATCAA ATTAACGAAT CATTCTTTAA 4 320 

TTGAACCATT TGAATTATTG ACAGAAATGT ATGCTCTGCC CAAATATTAT GAGAAAGATC 4380 

CTACACCTGT ATTAGCACCA TTTTACTTTA CATTTTTTGG AATGATGGTT GCTGATTTAG 4440 

GCTATGGTTT ACTATTGTTT TTAGGAACAA TGTTAGCATT AAAAATTTTT CATCTACCTT 4500 

CAGCAACTAA GAGATTTTTA AAATTCTTTA ATATATTAGG GGTAGCCGTT GCAATTTGGG 4560 

GTGGAATCTA TGGCTCATTT TTTGGATATG AGTTGCCATT TCATCTGATA TCTACAACCT 4620 

CTGATGTCAT GACTATATTA GTAGTGTCAG TTGTGTTTGG GTTTATTACA GTATTTGCAG 4 680 

GTTTGTTAGC TTCAGGACTA CAAAAAGTAA GAATGAATAA ATATGCAGAA GCATATAATT 4740 

CAGGATTTGC GTGGTGTGTT ATTCTG 4766 



(2) INFORMATION FOR SEQ ID NO: 234: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2484 base pair.? 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 234: 

CCTTTTAGAA AAAATTAAAG AATACGACAC CATTATCATT CATCGTCATA TGAAACCAGA 60 

CCCTGATGCC TTGGGAAGTC AGGTGGGATT GAAAGCCTTG CTGGAACATC ATTTCCCAGA 120 

AAAAACCATC AAAGCCGTCG GTTTTGATGA ACCAACTCTT ACTTGGATGG CTGAGATGGA 180 

TCTTGTTGAA GATAGAGCCT ACCAAGGCGC ACTTGTCATC GTCTGTGATA CAGCTAATAC 240 

TGCTCGTATC GATGATAAGC GCTATAGTCA AGGTGATTTT CTCATTAAGA TTGACCACCA 300 

TCCAAATGAT GATGTATACG GTGACCTGTC TTGGGTCGAT ACTAGTTCAA GTAGCGCTAg 360 

aGaTGATTAC CCTATTTGCC CAAACAACCC AACTAGCCTT GGCAGATCGC GATGCTGAGT 420 

TGCTCTTTGC AGGAATTGTC GGTGATACAG GTCGCTTCCT CTACCCTTCT ACCACTGCAC 480 

GGACTCTTCG CCTGGCTGCT TATTTGAGAG AACATAACTT TGACTTTGCG GCTCTCACTC 540 

GCAAAATGGA CACTATGAGC TACAAAATTG CTAAACTGCA AGGCTACATC TACGACCATC 600 
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TGGAAGTGGA TGAAAATGGT GCTGCTCGCG TTATCCTGAG TCAGAAAATC TTGAAACAAT 660 

ACAATATAAC CGATGCTGAA ACTGCGGCCA TTGTAGGTGC ACCTGGACGC ATTGACAGAG 720 

TGAGTCTCTG GGGAATTTTT GTCGAACAGG CTGATGGCCA CTACCGAGTT CGCTTACGCA 7 80 

GTAAAGTCCA TCCTATCAAT GAAATTGCCA AGGAGCATGA TGGTGGAGGC CACCCTCTAG 840 

CAAGTGGTGC TAATTCCTAT AGCCTAGAAG AAAACGAAAT CATCTACCAA AAGTTAGAAG 900 

ACTTGCTTAA AAACTGATAA AATACTTGCC AAACTTTTCA GAATCTGATA GACTAGTATA 960 

GTAACAATCT ATGGCTCGCA AAGAGACCAT GGCAGAAAGG AAATATTGCA AAATGAAAAr 1020 

AGATATCCAT CCAGAATATC GCCCAGTTGT CTTCATGGAC ACAACTACTG GTTACCArTT 1080 

CCTTAGCGGT TCAACAAAAC GCTCTAACGA AACAGTTGAG TTCGAAGGCG AAACTTACCC 1140 

ATTGATCCGT GTGGAAATTT CATCAGACTC ACACCCATTC TACACTGGAC GTCAAAAGTT 12 00 

CACTCAAGCA GATGGACGCG TGGATCGTTT CAACAAAAAA TACGGTCTCA AATAATGATA 12 60 

AGAGAACAGT TTTGGCTGTT CTTTTTTGTT TCTTGAAATC AACTGCTGTT TTCATGTTCC 1320 

AGACTCATCT GTAGGTTCGA TTTCCATGCT ACTAGGCAGG AAGGAAATAG CTGTTTCAAC 1380 

ACGTCCATAA TGAGCTATAC TATTGTCACG AACCACACTT TCATTGATGG TCCAAGTGGA 144 0 

ATTCATTTTC TTAAAAGCTT CTCGGACTTT TTCCAAATCT TTGGAGGCAA TGGCCTGCTC 1500 

TAAGGTTTCA AAACGAGGAC TTATACTCAT CTGCTTTCAA AAAGCATTCT AGTCCATCTC 1560 

CGATTACCGA TGGACTTTAT CACCTCCTTC TCCAGTCCTT GTATGACATC TTGAAGTTGA 1620 

TTCATGACAT CTTCCAAAGT TCgAAAGGCT TTATTCTTAA ATCCACGTTT ACGAATCTCT 1680 

TTCCACACTT GTTCAATGGG TTCATCTCTG GTGTGTATGG AGGAATAAAG GTAAAATCAA 1740 

TATTAGTCGG AATATTTAAG GTACTTGATT TATGCCATAT AGCATTGTCC ATAACGAGTA 1800 

AAAGGATAAG CTTGTGAAAG CTCTTCTAAA AAGGCGTTCA TCCACACTCC TTTTTATAAA 1860 

CCTGAAATAA GGCATCAATT GTAACAAATT CTCCTGCCTC TGTAGCCTTC AAATGACGGG 1920 

CAAGAAAGGC TTTCTCTTCC TCAACTGTCA TATATGCATG GTTACGACCA CCACGTGTTT 1980 

CTTGAAGGAG AGAGTCGAGT CCGAACTCCT CATATTTTTT TACGTTTCGC CAAATCGTTG 2040 

TTTGATTACA GTCTAAAAGC TCTATAATCT CTTTATAAGA TTTGCCCATC AGACGAAATA 2100 

TAGTAGATTG AAACTAGAAT AGTACACCTC TACTTCTAAA ACATTGTTAG AAATCGATTT 2160 

GTCCTGTTCT TGTTTCATTT TACTATAGAA CGATTTGAAG GCGTTTATAA TATTTAGCTG 2220 

TACGAGAGTC TTTTAAAAGT GTTTTGATGG TTTGGATTTC TTCTTTAGTT GATTTCATAT 2280 

TACTATTATA TAATGCTTTT TGATTTTAGT CTGGTATAAA TATTGCTTTC CTCCAAAATG 234 0 
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GTCATAGTTT TACTGGCAAA TCTAACATAT CACGGATAAA TTAACAAGTG ATTTCTGAAT 2400 

TGCTAAACAT TTTCTTTTCT TATAGCATAC TTTAAGATTT TGTCTTTGAG AAAGATATTT 2460 

CCAAGAAAAA CGTTCGTTTT TTGG 2484 



(2) INFORMATION FOR SEQ ID NO: 235: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1766 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235: 

CTAGATATAG CTATAATTTT ATTTATAACA AGAGGATAGA AATGACCGAA TTAGAAAGAA 60 

AAAATCGAAA AATTAGCTAA GAAATATTCT GATAACTTAA ACATCAAAGT TCAAGAGAGA 120 

GTTCGTGAAA TGGCAAATGA TAATAAGAGC CATTATTTGA TATACAGAGT TTTAGGTATT 180 

TCATTTGAAG AAGGAGAAAA TATCGATTTG TATCAAAATA AAGGTCGTTT TTTATACAAA 240 

TATGCTGGTT CATTTTTAGA AGAAGCTGCA GTACTATGCT TTAACGAAAA ATTTGGTACA 300 

GAAAATACTT AAAAAGTTAA CATTCCTAAT TCTGAAAGTA CAAAACCTAA GACTTTTGAA 360 

ATTGATTGTT TAGTCGGAGA AAAACACGCA TACGAAATAA AATGGTGGGA TGCAACTACA 420 

GATGGAGACC ATATAACTAA AGAACACACT AGAATAAAAG TTATTCATAA CAAAGGATAT 480 

ATACCAATTC GGTTAATGTT CTACTATCCA AATAGAACTC AAGCTATAAA AATTCAGCAA 540 

ACTTTAGAAA CATTGTATAA CGGTATTGGA GGGAAATATT ATTATGGAGA TTCTGCCTGG 600 

GAACATTTAA GAGCAGTGAC CGGTATTGAT TTACTTAGTA TTCTAACAGA TATTGCAAAT 660 

AAAAAAACAG GGGTAAAATC AAAATGACAG TATTAAAAGG AGATAACTTA GAAATATTAA 720 

AAACTATTGA ATCCTCAAGT ATTGATTTAA TCTATATGGA CCCTCCTTTC TTTACACAGA 780 

AAACCCAAAA ATTATCTAAT AACAAAAATA TTATGTATTC ATTCGAAGAT ACGTGGACTT 840 

CGATTGAGGA TTACAAAGAA TTTTTGTCTG TAAGATTAGA AGAATGCAAA AGAGTGCTAA 900 

AAAATAGTGG CAGTATTTTC GTTCATTGTG ATAAAATTGC AAATCATCAT ATTAGATTAA 960 

TTTTAGATAA TATCTTTGGA GTAGATATGT TTCAAAGCGA AATTATATGG AACTATAAAC 1020 

GGTGGTCTAA TTCAAAAAAG GGATTATTGA ACAATCATCA AAACATTTAC TTTTATTCAA 1080 

AGTCAAAAGA TTTTAAATTT AATACAATTT TTACAGAGTA TTCTTCTACT ACAAATATCG 1140 

ACCAAATACT AGTGGAACGA AAACGAGATG GAAACTCTAA AACTATATAT AAGGTTGATA 1200 

ATAATGGTAA CTATATTCTA GCAAAAGAGA AAAATGGAGT TCCCCTTTCA GATGTTTGGA 12 60 
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ATATACCATT TCTTAATCCA AAAGCTAAAG AAAGAGTAGG TTATCCTACA CAAAAACCTA 1320 

TTCTGTTATT AGAACAAATT ATAAAGATTG CTACTGATAA AAATGATATA GTTTTAGACC 1380 

CGTTCTGTGG AAGTGGAACT ACTTTAGTAG CCTCCAAGAT TTTGAATAGA AATTATATGG 1440 

GGATTGATTT ATCTGAGGAA GCTATCAATA TAACTCAGCA ACGTCTGGAA AATGTTATAA 1500 

AAACAAGTTC AAATTTATTG AATAAAGGAA TCGAAGCATA TAGAACCAAA ACTGAGGAAG 15 60 

AGGAAAACAT TCTTAAATTA TTACAGGCAA AAATTGTTCA AAGAAATAAA GGAATTGATG 1620 

GTTTTTTACC TAAACATTTT CAAAAAAAAC CGATACCTAT AAAAATTCAA AAAAATAATG 1680 

AATGTCTGAA TGAGAGTATC TCTTTATTAC AGAATGCTAT AAACTCCAAA AAACTTGATT 1740 

TTGGAGTAGT TATAAAAACT CATTCG 1766 



(2) INFORMATION FOR SEQ ID NO: 236: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 748 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236: 

CCGAAAATCA AATTCAAACC ACGTCAACGT CGCCTTGCCG TACTCAAGTA CAGCCTCCCG SO 

CTAGTTTCCT AGTTTGCTCT TTGATTTTCA TTGAGTATTA AACTAAATTA AATAATATTA 12 0 

GCGCGGAGAA TTTCTAATTC TTCCTTGGTC AAGCGACGCC ATTCCCCTCG TTCTAGGTTC 180 

TCATCTAATA CTAAAGTTCC CATAGTCAAT CGTTGCAAGT CCACCACTTC CTTGCCACAG 240 

TAGCCCACCA TACGCTTGAT CTGATGAAAC TTCCCTTCTG CAATGGTCAC ACGGATTTGG 300 

CTTTGATTCT TTTCTGTATC TATGGATACA AGCTCCAGTA TAGCGGGTTG ACAGGTAAAG 360 

TCTTTGAGAG GAATACCCTC AGCAAATGTC TCCACATCTT CTTGGGTCAT GATTCCCTTG 420 

ACTTGTGCCA GATAAGTCTT GTCCACATGA CGCTTGGGCG AAAGAAGAAC ATGAGCCAGC 4 80 

TGACCATCAT TGGTCAAGAG CAAAAGACCA TGCGTGTCAA TATCCAAGCG TCCTACTGGG 540 

AAAACTTCCT TACTCCGCGC CAAGTCATCC AACAAGTCCA GAACGGTTCT GTGCTTGGGA 600 

TCCTCAGTCG CTGAGATAAC TCCTTTGGGC TTGTTCATCA TGTAGTAGAC AAACTCTTCA 660 

TACTCCAACA CTTGCCCATC AAAGCGAATC TCATCTATTT TTTCATCAAT CTGCAATTTA 720 

GCTGATTTTT CTTTTTGACC ATTTACAG 748 
(2) INFORMATION FOR SEQ ID NO: 237: 
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(2) INFORMATION FOR SEQ ID NO: 238: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 904 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 238: 

TACCCGCTTC TTTCAAGAGT TGGAGCAGGG CTTGTTTGCG ATCTTTTGTC ATAGTTCTTC 60 

CTTTTAACGG CGTTTTCGAA GCACTTTATA GACAGCTAGT GCTAATGTAT AGTCTACCAT 120 

ACTATGGATA ATTGTACCAA ATCCAACTAG TACAAATAGA ACATAAAACA TATTTTCTAC 180 

ATTGGTACCA GAAGTTGCGT AAAAAACGAC ACAGGCCAAT ACTTCAGCAA GGGCATGAAC 240 

AACAGCCAAA ACAAAGTTGA AAATCCAGGA AGATTTTGGT TTATCTAGGG TATCGGGGAA 300 

TTTTTGTAGG TAAAGAGCTC CTAAAGCACC AAAAGATATA TGGGAAAAAG CCCGAAAAAC 360 

GATAACCATG GGATAGCCAG CCATCAAAAA TCCAAAACTA GAGGCTAGGA TGACAAAAAC 420 

TGCCATCAAG GGCGACAAGA ACATGGCTAT AAAAATAGCG ATGTGGCTCC CCAAAGTATA 4 80 

GGAAGCAGGT GGAATGACAA TCTTGAAAGG CATAACAATT GGAATCAAAA TCGCAATAGC 540 

CGTTAAAAGG GCTGTCATTG TCATAAATTG TGTCTTTTTC CGTGTATTCA CAAGAATCTC 600 

CTTTTTAACT GCATATACAC TAGTATGGTA CAATAAACCA GACAATAAAG CAAGAATTTA 660 

CTTGGGTTTA TAGATCATTT TTTAGTTAAA AGTTATAGTA GATTGAAACT AGAATAGTCC 720 

ACCTCTACTT CTAAAACATT GTTAGAAATC GATTTGGCTG TCCTGATCGA TTTGTCCTGT 780 

TCTTATTTCG TTTTACTATA GTAAAGATTT CATTAAAAAG AAACTGTATA GAGCAAAATC 840 

TCCACCTTCA GGTTTGGAAA GCGGAGATTG TTTnTTATTT TTTCCAGGGT TTGTAGTCGT 900 

GGGA gQ4 
(2) INFORMATION FOR SEQ ID NO: 239: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 946 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239: 
CACTCAAACA TGACTTATAT CAAGACGGAT GGACTTCAAG ACGATGCCAA TCGCTTGAAT 



60 



wo 98/18931 



PCT/US97/19588 



1260 

CGTAACATTC AGTTTGGTGT TCGTGAATTT GCAATGGGAA CAATCTTGAA CGGGATGGCC 120 

CTTCATGGTG GACtTCGTGT ATACGGTGGA ACTTTCTTCG TCTTCTCTGA CTATGTGAAG 180 

GCAGCTGTCC GCTTGTCAGC CTTACAAGGA CTTCCTGTGA CTTATGTCTT TACCCATGAT 240 

TCAATCGCAG TTGGGGAAGA TGGTCCGACT CATGAACCAG TTGAGCATTT AGCAGGTCTT 300 

CGTGCTATGC CAAATCTAAA TGTTTTCCGT CCAGCAGATG CGCGTGAAAC GCAAGCAGCT 360 

TGGTACCTTG CAGTGACAAG TGAGAAAACA CCAACTGCCC TTGTCTTGAC ACGTCAAAAT 420 

TTGACTGTTG AAGATGGAAC AGACTTCGAC AAGGTTGCTA AAGGTGCTTA TGTTGTATAT 480 

GAAAATGCAG CCGACTTTGA TACCATCTTG ATTGCGACAG GTTCAGAGGT TAATCTTGCT 540 

GTCTCAGCTG CTAAAGAATT GGCTAGTCAA GGCGAAAAAA TCCGCGTAGT CAGCATGCCA 600 

TCTACAGATG TCTTTGATAA ACAAGATGCA GCTTACAAGG AAGAAATTCT TCCAAATGCA 660 

GTCCGCCGTC GTGTTGCAGT CGAAATGGGT GCAAGTCAAA ACTGGTACAA ATATGTTGGT 720 

CTCGATGGTG CCGTTCTAGG TATTGATACT TCGGAGCCTC TGCCCCAGCA CCAAAAGTAT 780 

TGGCAGAATA TGGCTTTACT GTAGAAAATC TTGTAAAAGT TGTTCGAAAC TTGAAATAAT 84 0 

CCTAAAAATC AGGGCGTAAG CTCTGGTTTT TCTTACCAGA AAAGTAAGGT ACAATCTTGT 900 

AAAAGTAGCT GAAATTTGAT ATAGTAGTCC TATGTAAAAG ACAAAG 94 6 



(2) INFORMATION FOR SEQ ID NO: 240: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2764 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
{D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 240: 

CGGGGCTCCc TAGTTCTTAG GGAGCTATTT TTGTTTTTTC AAGAAGTTAT CTTCTTGTAT 60 

TTTATACTCA ATGAAAATCA AAGAGCAAGC TAGGAAACTA GCCGTAssTG CTCAAAACAC 120 

TGTTTTGAGG TTGTAGATAA GACTGACAAA GTCAGGAACA CATATCTACG GCAAGGCGAC 180 

GTTGACGCGG TTTGAAGAGA TTTTCGAAGA GTATTAGTTG TGAATCTGGT GCAGTCGTCC 240 

CAGATTATTC TTATTAGTAG GGTCTTGTTT TCTATATCCC CTCGTAGTTA ACAAGACCTT 300 

GAGCATTTTA GAAAGAGGAA TCTATGTCTA CGAAATATAT TTTTGTAACT GGTGGTGTGG 360 

TATCGTCCAT TGGGAAAGGG ATTGTGGCAG CGAGTCTAGG CCGTCTCTTG AAAAATCGTG 420 

GTCTCAAAGT AACCATTCAA AAGTTTGACC CTTATATCAA TATTGATCCG GGAACCATGA 480 

GTCCTTACCA GCACGGGGAA GTTTTTGTGA CAGATGACGG AGCTGAGACA GATTTGGACT 540 
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TGGGTCACTA TGAACGTTTC ATCGATATCA ATCTCAACAA ATATTCCAAC GTGACAACTG 600 

GGAAAATTTA CAGTGAAGTT CTTCGTAAAG AACGCCGTGG AGAATACCTT GGGGCAACTG 660 

TTCAAGTCAT TCCTCATATC ACAGATGCTT TGAAAGAAAA AATCAAGCGT GCCGCTCTAA 720 

CGACCGACTC TGATGTCATT ATCACAGAGG TTGGTGGAAC AGTAGGAGAT ATCGAGTCCT 780 

TGCCATTCCT AGAGGCTCTT CGTCAGATGA AGGCAGATGT GGGTGCGGAT AATGTCATGT 840 

ATATCCATAC AACCTTGCTT CCTTACCTCA AGGCTGCTGG TGAAATGAAA ACCAAACCAA 900 

CCCAACACTC TGTCAAAGAA TTGCGTGGCT TGGGAATCCA ACCAAATATG TTGGTTATTC 960 

GTACAGAAGA GCCAGCTGGT CAAGGAATTA AAAATAAACT GGCCCAGTTC TGTGATGTGG 1020 

CACCAGAAGC CGTTATCGAA TCGTTGGATG TTGAACACCT TTACCAAATT CCACTGAACT 1080 

TGCAGGCACA AGGGATGGAC CAAATTGTTT GTGATCATTT GAAATTAGAC GCACCAGCAG 1140 

CGGATATGAC AGAATGGTCA GCCATGGTGG ACAAGGTCAT GAACCTCAAG AAACAAGTTA 1200 

AGATTTCCCT TGTTGGTAAG TATGTGGAGT TGCAAGATGC CTATATCTCA GTGGTCGAAG 12 60 

CCTTGAAACA CTCTGGCTAT GTCAATGATG CAGAAGTTAA AATCAATTGG GTCAATGCCA 1320 

ATGATGTGAC AGCAGAGAAT GTAGCAGAAC TCTTGTCTGA TGCGGACGGG ATCATCGTAC 1380 

CAGGTGGTTT TGGTCAACGT GGTACAGAAG GGAAAATCCA AGCCATCCGC TATGCGCGTG 1440 

AAAATGATGT TCCAATGTTG GGAGTCTGCT TGGGAATGCA GTTGACATGT ATCGAGTTTG 15GC 

CTCGTCACGT TTTAGGTCTT GAAGGTGCCA ATTCTGCAGA GCTTGCACCA GAAACAAAAT 1560 

ACCCTATCAT TGATATCATG CGTGATCAGA TTGATATTGA GGATATGGGT GGAACCcTTC 1620 

GTTTGGGACT TTATCCGTCT AAGTTGAAAC GTGGCTCTAA GGCTGCTGCT GCTTATCACA 1680 

ATCAAGAAGT GGTGCAACGC CGTCACCGTC ACCGTTATGA GTTTAATAAT GCCTTCCGTG 1740 

AGCAGTTTGA GGCAGCAGGT TTTGTCTTTT CAGGAGTTTC TCCAGACAAT CGTTTGGTAG 1800 

AAATCGTGGA AATTCCTGAA AATAAATTCT TTGTAGCTTG TCAGTATCAC CCTGAACTGT 1860 

CAAGCCGTCC AAACCGACCA GAAGAACTCT ACACTGCCTT TGTTACTGCA GCAGTTGAGA 1920 

ACAGCAATTA GCAAAATCAG AACCTTTGAG AAAAATCTCA GAGGTTTTTT GCATACGATG 1980 

ATATTGCAGT ATATCTGAGG TAGGGGTCCT CTGTATGTAC CTGCTACCGT TGAAATCAAT 2040 

AGCGACTCCC TCTTGCCCTG TGCTAGTGAA TGGATTTATC AGTATATTGA AATGAAATAA 2100 

AATTTGAACA AATTAATTCG GAAAGCCAAA TCAATTTCTA GCAAAGTTTT AGGAACTGGA 2160 

TTGTATAGTG AATTGAAATA AGATGTGAAC ATCTCTATCA GGAAAGTCAA ATTAATTTAT 2220 

AGAAATATTT TAGCAGTCAA GATGTACTGT TATAGATTCA ATACATTATA CTTTTTTAAT 2280 



wo 98/18931 



PCTAJS97/19588 



1262 

TTAATCCACT ATAGTAAAAT GAAATAATAA CAGGACAAAT CGATCAGGAC AGTCAAATCG 2340 

ATTTCTAACA ATGTTTTAGA AATAGAGGTG TACTATTCTA GTTTCAATAT ACTATCCCAA 24 00 

ATCATTCATA CCTCTCTCAA CTAGATGTAA CTTACAAAAC CCCTGACCTC ATGAGCCACT 24 60 

TTCTTCCTCC TCATGAGGTC AGTTTTACTT TCTGCTGTTC CAGTATCGTT TTTCCTCGCT 2520 

AGATTTCCTC AAAAGGGCAG ACTCCTCCCT TGGTGCGTCA CACGATTTTT TCATCTCGAC 2580 

TGTTCTTTAA TGCATCATTA ACGACGCTTT TCTTCTAGGT GGTTCATAAG GAACAGGAAG 2 640 

ATTCAGGTTG ACTTTTCTAA TCCTAGAATA AAGTGCTGAA AACAATTCGG AATAGGCATA 2700 

GAGACTAGAC AATTTGAGGA GCTGCTTGCG TCCTGTTCGA ACACATTTTC CCACCACGTG 27 60 

AAGA 2764 



(2) INFORMATION FOR SEQ ID NO: 241: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1682 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241: 

CCGTTTTTTT CATTGTTCAG TACTACAACT TACGTTGTAG CGCCCTGCAC ATTGGTTCGT 60 

CTTGTTCAGT TTTCAAAGGT CTTTGTCACT TGCTTCTCTC AAGCGACAAC TATATTAGTA 120 

TATCACAACT GCTTTCGCTT GTCAACACTT TTTTGAAGAT TTTTAAGTTT TTTTAAACTT 180 

TTTTTCATCA AGTGGTCCTG ACGCAACATA CCATAGTCCG TACGGGATTC GAACCCGTGT 240 

TACCGCCGTG AAAAGGCGGT GTCTTAACCC CTTGACCAAC GGACCTGAGT TGTTATTTTC 300 

AACTCTTACT ATTATACAGT CTTTTCAAAC TTTGTCAACT ACTTTTTTAA ACTTTTTTTA 360 

TTAATTTTAC AACAGCTTCA GTTCGAGCTG TATGTGGGAA CATATCGACC GACTGGATAT 420 

AATGAAGATC ATAGACTTCT ACTAAGCGTA CCAAATCACG AGCCAAGGTC GAAACATTAC 480 

AAGAAATATA AACCATTTTT TCTGGTACAT AAGTAAGAAT AGTATCTAAT AACTTATCAT 540 

CCAGACCTGT ACGTGGTGGG TCAACAATCA AAGCATCTGC TCGGTAGCCT TCCTTGTACC 600 

AACGAGGAAT AATCTCTTCT GCCGTTCCAG CTTCATAATG AGTATTGTCA AATCCCATTC 660 

TTTTAGCATT TCGCTTGGCA TCTTCAATAG CTTCTGGAAT AATATCCATA CCTCTGAGTG 720 

TTTTTACTTT CTTTGCAAAG GCAAATCCAA TCGTTCCAAC TCCACAATAA GCGTCAATCA 780 

AATGGTCTTC TTTATCAACA TCCAGCGCTT TTACTGCTTC GCTATAGAGG ACTTCTGTTT 840 

GCTCAGGATT TAGTTGATAA AAAGCTCGAG GGGATAGTGA AAATTCATAA TTGAGTACAC 900 
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CTTCTTGAAT 


ACTCTCTTGC 


CCCCAGATAA 


TCTCTGTCTT 


TTCACCATAT 


ATCTCACTGG 


960 


TTTTAGCTGT 


ATTTGTATTA 


ACAGCTACTG 


TCACAACTTC 


TGGGAAATCT 


TTAACCAACT 


1020 


CTTTTACCAA 


TTGAGTTAAA 


TTAAGCTGGC 


GGTTTGTAAC 


AATAAT7VATC 


TGAACCTGTC 


1080 


CGGTCTTTCT 


CGCGCGTCGG 


ACCATAATAG 


TACGGACACC 


TAGAACTTTT 


CTCTCATCCG 


1140 


TGATTGGAAT 


CTGGTGATAA 


GTAAGTAATT 


CTGCTAAGCG 


ATTAGCAATC 


ACTTGGGTTT 


1200 


CCTTATCTTG 


TACCAGGCAG 


TCTTTCAACT 


CTACTAAATA 


GTGAGAGTTT 


TGTGCATATA 


1260 


AGCCCGCCTT 


GACCTGATTT 


TTAAATTTTC 


GAGTCTGAAA 


TTGTAACTTA 


GCTCTGTAAT 


1320 


ATTTTGGTTC 


CTGCATTCCA 


ATAGTTGGAC 


GAATTTCATA 


ATTTTCATAT 


CCTGCAGGAG 


1380 


CAAATTTTTT 


CAGCGCTTGA 


TGAAGTAAGT 


CCGTCTTGAA 


CTCCAGCTGC 


TTATCATAAT 


1440 


GCAGGTGCAT 


GATTTGGCAG 


CCTCCGCATT 


CATTATAAAT 


AGTACAAGAT 


GGCACAATTC 


1500 


GAAATTTAGA 


CTTCTTGTTG 


ACCTTCAGTA 


ATTTTGCTTC 


AACAAAGTTG 


CGTCTAATAG 


1560 


AAGTAATCTG 


ACAATAGATA 


TCTTCGCCTT 


TGAGAGCTCC 


TGGTACAAAG 


ACTAATGTTT 


1620 


TTTGGTAAAA 


GCCGATTCCC 


TCACCGTTAA 


TTCCCATGCG 


CTTGATTTTT 


AATGGTATTT 


1680 


TT 












1682 



(2) INFORMATION FOR SEQ ID NO: 24 2: 

(A) LENGTH: 2524 base pairs 
tB) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242: 

TTAACTTTGG TCAATTCTTT AAAGTCATCC TCTGTAAGCA TGTCTAACCA TTGATGTTTC 60 

CCTTTATTGC TAAAATCACC AATTCCGACT ACAGCTATAT CTAAATCTTT CCAACTATTT 120 

TTCAAATTTT CAAAATATCT TGATTGCAAA ATACCATCTG CTAACAATTT ATTTTCTTGC 180 

ACAATCGTTG CATTCATAAA TGTACACTCT CCATGAAATT TTCTAGACAT TTCATAAATC 240 

AGTGTATTCA CATGGTATTT AGCGTGTATG TGACTAGGAC CACCTGCTAG AGGATAGAAG 300 

TGAACATTTC GGACACTTTT ACTGTGAATT AAATCTACTA AATTACTTAA ACTTTTCCCC 360 

CAAGAAAAGC CAATTTTCAT ATTATCATCA ATTAGATTCC TAAGGACGCC TGCTGCAACT 42 0 

TGAGAAATTC TTTCAGATAA AATTGTTGGA GTATCATCAA ATTCATTTGG AATAATTTCT 480 

AAACTTTCCA AACTGTATTT TTCTTTTACA TAATTTTCCA ACTTAAACAT ATTGGTATCA 540 
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AAATTCTCTA TTTCAATTTT AACAATTCCT ACATTCCTTG CTTCTGTTAA CATTCTACTA 600 

ATAGAGGTTC TATAAATTCC TAATTTTGCT GCTATTTGTG ACTGATTTAA GTTTTCAATA 660 

TAATACAGAT AAGCAATTTT AGAAAGCAGT TTATTCCTAT CTTGATTCAT ACACTTAACC 72 0 

TCTTACGAAA CTACCTTAAC CATTATCCCA GCATTTTCTA ATGTAGCTAT ATTTTGTTTA 780 

GAAAGTTTTT CGTCTGTTAT TACTTCATAG ACTTGACTTA AAGCAAATCT TCTTACTGTA 840 

CCTCTTTTAT CAAATTTACT TGAGTCAGTT AGGACAATGA CTTTATCCGA CACTGCTGAA 900 

ATATATTGAA CTACCTCACT GCGCATTAAA TCTTTTCCGG TAAAGCCCAT CTCTTTATCG 960 

TAACCATCTG TCCCAACAAA AGCTTGACAC ACATGAAAAG TCTGTATCAT TTCTTTTAAT 1020 

AAAGGTCCTA CAGTCACCTG TGAATCTTTC TGAAACTCAC CACCAAGAAC AATAACACGA 1080 

CATGAATCAT AAGCTCTCAC AAAATTTGCT ATAAAAAACG AATTTGTTAC AATCGTAACA 1140 

TTTCTTTTTT GCTTGCAAAT TTCCTCAGCA AGTAAAGCAC AGGTCGATCC AGATTCTATC 1200 

ATTATTGTTT CATTATCTGA CACCAATTTT ACTGCTTCCT GAACAATTTT TCTCTTAGTT 12 60 

TCATAATTAA TTGACAAACG TACATTTAAG TCATCTCCAC TATTTAATAC AGCATATCCA 13 20 

TGCTCTCTGT GTAATAAACC TTTTGACTCT AATTTATCTA AATCTTTTCT AATCGTTACT 1380 

TTCGATACAT TTAATTTTTC CGATAATGTA TTAACGTCGA TCTTTTCATA TTCTGATACT 1440 

AATTTAATAA TTTGTTCCAA TCTTTTCATT TTACACCTCC GTTTTATTCT ACCAAAATAA 1500 

AAAGCAAAAA ACAACAAATT AACCTTTCGT TCGTAATTGT TTTTCTTTCG TTTTTGTGAT 1560 

AGGATAGACT TATGAAGAGG AGGAACTCTT ATGGAAATAT CTAAAGGAAT TATTTTTAAT 1620 

ATTCAACACT TTTCAATTCA TGACGGTCCG GGTATTCGTA CAACTGTTTT TTTAAAAGGA 1680 

TGTCCTCTGC GCTGTCCATG GTGTTCTAAT CCTGAATCTC AAAGAATGAA ACCTGAAAAA 1740 

ATGAAAGATG CTCAACGAGA GAAATTCACC TTAGTCGGTG AAGAAAAGAC TGTAGAAGAA 1800 

ATTATTACAG AGGTATTAAA AGACAAAGAA TTTTACGAAG AATCCGGTGG AGGTTTAACT 1860 

TTATCAGGAG GTGAAATATT TGCTCAGTTT GAATTTGCTA AAGCCATCTT AAAATCAGCT 1920 

AAAGAACATC ACATACACAC TGCCATTGAA ACTACTGCCT TTGTTGATCA TGAAAAATTT 1980 

ATTGATTTAA TTCAATATGT GGATTTTATC TACACAGACC TAAAACATTA TAATTCTATA 2040 

AAACATAAAA AAGTGACTGG GGTTTTTAAT CAAATGATTA TTAAAAACAT TCATTATGCT 2100 

TTTTCACAAA ATAAAACTAT CGTTTTAAGA ATCCCAGTTA TTCCTAATTT TAACAATAGT 2160 

TTAGAGGATG CAGAAAAATT CGCTACTCTA TTTAACTCAT TAAATATCGA CCAAGTTCAA 2220 

CTACTCCCTT TTCATCAATT TGGTG7VAAAC AAATATCGTT TATTAAATCG GAAATATGAA 2280 

ATGGATGGAA TCAACGCACT TCATCCwGAA GATCTTATTG ATTATCAAAA GGTATTTCTG 2340 
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AACCACCATA TTAATTGTTA TTTCTAGTTT ATTTCCTTGA AATGCTCTAG CTATTTGCAG 24 00 

ATAACAAGCA TCTATAATAC ATACTTAACT TTTCAAAAGG TTTAGCTAAA AAATTTTAGC 2460 

CAAACCTTTT CTATTTTACC TTGCTCTAGA ATTTTTAAAC TGCTATACTT ATCACAAAAA 2520 

AACG 2524 



(2) INFORMATION FOR SEQ ID NO: 243: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2359 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243: 



CGTGCTTGGG 


GGCTTGTGGT 


CAAAAGGAAA 


GTCAGACAGG 


AAAGGGGATG 


AAAATTGTGA 


60 


CCAGTTTTTA 


TCCTATCTAC 


GCTATGGTTA 


AGGAAGTATC 


TGGTGACTTG 


AATGATGTTC 


120 


GGATGATTCA 


GTCAAGTAGT 


GGTATTCACT 


CCTTTGAACC 


TTCGGCAAAT 


GATATCGCAG 


180 


CCATCTATGA 


TGCAGATGTC 


TTTGTTTACC 


ATTCTCATAC 


ACTCGAATCT 


TGGGCAGGAA 


240 


GTCTGGATCC 


AAATCTAAAA 


AAATCCAAAG 


TGAAGGTCTT 


AGAGGCTTCT 


GAGGGAATGA 


300 


CCTTGGAACG 


TGTCCCTGGA 


CTAGAGGATG 


TGGAAGCAfiG 


OGATGG.AGTT 


GATCi/JU\AAA 


3 GO 


CGCTCTATGA 


CCCTCACACA 


TGGCTAGATC 


CTGAAAAAGC 


TGGAGAAGAA 


GCCCAAATTA 


420 


TCGCTGATAA 


ACTTTCAGAG 


GTGGATAGTG 


AGCATAAAGA 


GACTTATCAA 


AAAAATGCGC 


480 


AAGCCTTTAT 


CAAAAAAGCT 


CAGGAATTGA 


CTAAGAAATT 


CCAACCAAAA 


TTTGAAAAAG 


540 


CGACTCAGAA 


AACATTTGTA 


ACACAACATA 


CAGCCTTTTC 


TTATCTAGCG 


AAGAGATTTG 


600 


GGCTTAATCA 


ACTTGGTATT 


GCAGGTATCT 


CTCCTGAACA 


AGAACCAAGT 


CCACGACAAC 


660 


TAACAGAAAT 


TCAGGAATTT 


GTTAAGACCT 


ATAAGGTTAA 


AACGATTTTT 


ACAGAAAGTA 


720 


ACGCTTCTTC 


AAAAGTAGCT 


GAAACTCTTG 


TCAAATCAAC 


AGGTGTGGGT 


CTTAAAACTC 


780 


TGAATCCTTT 


AGAGTCAGAC 


CCACAAAATG 


ACAAGACCTA 


TTTAGAAAAT 


CTTGAAGAAA 


840 


ATATGAGTAT 


TCTAGCAGAA 


GAATTAAAGT 


GAGGAAAGAA 


TGAAAATTAA 


TAAAAAATAT 


900 


CTAGCAGGTT 


CAGTGGCAGT 


CCTTGCCCTA 


AGTGTTTGTT 


CCTATGAGCT 


TGGACGTTAC 


960 


CAAGCTGGTC 


AGGATAAGAA 


AGAGTCTAAT 


CGAGTTGCTT 


ATATAGATGG 


TGATCAGGCT 


1020 


GGTCAAAAGG 


CAGAAAACTT 


GACACCAGAT 


GAAGTCAGTA 


AGAGGGAGGG 


GATCAACGCC 


1080 


GAACAT^TTG 


TTATCAAGAT 


TACGGATCAA 


GGTTATGTGA 


CCTCTCATGG 


AGACCATTAT 


1140 
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CATTACTATA ATGGCAAGGT TCCTTATGAT GCCATCATCA GTGAAGAGCT CCTCATGAAA 1200 

GATCCGAATT ATCAGTTGAA GGATTCAGAC ATTGTCAATG AAATCAAGGG TGGTTATGTC 12 60 

ATTAAGGTAA ACGGTAAATA CTATGTTTAC CTTAAGGATG CAGCTCATGC GGATAATATT 1320 

CGGACAAAAG AAGAGATTAA ACGTCAGAAG CAGGAACGCA GTCATAATCA TAACTCAAGA 1380 

GCAGATAATG CTGTTGCTGC AGCCAGAGCC CTlAGGACGTT ATACAACGGA TGATGGGTAT 1440 

ATCTTCAATG CATCTGATAT CATTGAGGAC ACGGGTGATG CTTATATCGT TCCTCACGGC 1500 

GACCATTACC ATTACATTCC TAAGAATGAG TTATCAGCTA GCGAGTTAGC TGCTGCAGAA 1560 

GCCTATTGGA ATGGGAAGCA GGGATCTCGT CCTTCTTCAA GTTCTAGTTA TAATGCAAAT 1620 

CCAGCTCAAC CAAGATTGTC AGAGAACCAC AATCTGACTG TCACTCCAAC TTATCATCAA 1680 

AATCAAGGGG AAAACATTTC AAGCCTTTTA CGTGAATTGT ATGCTAAACC CTTATCAGAA 1740 

CGCCATGTGG AATCTGATGG CCTTATTTTC GACCCAGCGC AAATCACAAG TCGAACCGCC 1800 

AGAGGTGTAG CTGTCCCTCA TGGTAACCAT TACCACTTTA TCCCTTATGA ACAAATGTCT I860 

GAATTGGAAA AACGAATTGC TCGTATTATT CCCCTTCGTT ATCGTTCAAA CCATTGGGTA 1920 

CCAGATTCAA GACCAGAAGA ACCAAGTCCA CAACCGACTC CAGTIACCTAG TCCAAGTCCG 1980 

CAACCAGCTC CAAGCAATCC AATTGATGAG AAATTGGTCA AAGAAGCTGT TCGAAAAGTA 2040 

GGCGATGGTT ATGTCTTTGA GGAGAATGGA GTTTCTCGTT ATATCCCAGC CAAGGATCTT 2100 

TCAGCAviAAA CAGCAGCAGG CATTGATAGC AAACTGGCCA AGCAGGAAAG TTTATCTCAT 2160 

AAGCTAGGAA CTAAGAAAAC TGACCTCCCA TCTAGTGATC GAGAATTTTA CAATAAGGCT 2220 

TATGACTTAC TAGCAAGAAT TCACCAAGAT TTACTTGATA ATAAAGGTCG ACAAGTTGAT 2280 

TTTGAGGCTT TGGATAACCT GTTGGAACGA CTCAAGGATG TCTCAAGTGA TAAAGTCAAG 2 340 

TTAGTGGAAG ATATTCTTG 2359 
(2) INFORMATION FOR SEQ ID NO : 244: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1052 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244: 

TTCTTTCTGC TATAATCGTA TAAAATACTT ACTTTAGGAG TTCTTATGAA AGTTGTTAAA 60 

TTTGGAGGTA GTTCTCTTGC CTCTGCTAGT CAATTAGAAA AAGTTTTAAA CATCGTCAAA 12 0 

AGCGATTCAG AGCGTCGTTT TGTAGTCGTT TCTGCGCCTG GTAAACGCAA TGCTGAAGAT 180 
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ACTAAGGTTA CGGATGCCCT GATTAAATAC TACCGCGACT ATGTTGCGGG TAACGATATT 240 

AGCAAGAACC AAAGCTGGAT TATCGACCGC TATGCTGCTA TGGTTAGTGA ATTGGGACTA 300 

AAACCAGCTG TGCTAGAAAA AATTTCTAAA AGCATTCACG CCTTGGCCAC TCTTCCTATT 360 

GAAGAAAATG AATTTCTCTA CGATACTTTC CTAGCAGCCG GTGAAAATAA CAATGCCAAA 420 

TTGATTGCTG CCTACTTTAA CCAAAATGGT ATCGATGCAC GCTATATGCA CCCTAGAGAA 4 80 

GCTGGGATTG TGGTCACAAG TGAACCTGGT CACGCTCGCA TCATTCCATC AAGTTATGAC 540 

AAGATTGAAG AATTGACAAA CACCAATGAA GTCCTTGTCA TTCCTGGTTT CTTTGGTGTC 600 

ACTAAGGAAA ATCAAATCTG TACTTTCTCA CGTGGAGGTT CTGATATTAC AGGTTCTATC 660 

ATTGCTGCTG GTGTCAAAGC TGACCTCTAT GAAAACTTTA CGGACGTTGA TGGTATCTTT 720 

GCAGCCCACC CTGGTATTAT CCACCAACCA CACTCGATTC CTGAGTTGAC CTACCGTGAA 780 

ATGCGCGAGT TGGCCTATGC AGGCTTCTCA GTCCTTCATG ACGAGGCTCT TCTTCCTGCC 840 

TACCGTGGAA AAATTCCTCT GGTTATCAAG AATACCAACA ACCCTGACCA TCCAGGTACT 900 

CGTATCGTTC TAAAACACAG TAATGATGAA TTTCCAGTTG TGGGAATTGC TGGTGACTCA 960 

GGCTTTGTCA GCATTAACAT GTCGAAATAC CTCATGAACC GTGAGGTTGG ATTTGGCCGC 1020 

AAGGTTCTGC AAATCCTGGA AGAACTTAAC AT 1052 



(2) INFORMATION FOR SEQ ID NO: 245: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 855 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245: 

CCCTCGAAAA CTAAGCCGAT GAAGTCAGAA CACTTCAATC CTGTTCGTGA CTGGTGGGAA 60 

AATCGTGAAG AGATTCTGGA AGGTAAGTTC TACAAATCTA AATCATTTAC ACCTAGTGAA 120 

TTGGCTGAGT TGAATTATAA TTTAGACCAG TGTGACTTTC CAAAAGAGGA AGAGGAAATC 180 

TTAAATCCCT TTGAGTTGAT TCAGAATTAT CAAGCGGAAA GAGCAACTTT AAATCATAAG 240 

ATTGATAATG TATTAGCTGA TATTTTGCAG TTGTTGGAGG ACAAATAATG ACACCAGAAC 300 

AACTTAAAGC AAGTATTCTC CAAAGAGCGA TGGAAGGGAA ATTAGTGCCG CAAAATCCCA 3 60 

ATGACGAACC TGCAAGTGAA TTATTAAAGA GAATTAAAGC TGAAAAAGAA A7VACTTATCA 42 0 

GTGAAGGAAA AATCAAACGA GATAAAAAGG AAACTGAGAT ATTTCGTGGT GATGATGGGA 480 



.1 
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AACATTATGG GAAGTTTGCT GATGGAAGCA CTCAAGAAAT TGATGTTCCT TATGATATTC 540 

CTGATACTTG GGAGTGGGTG AGGATAAAAT CAATTTATTG GAATTTTGGG CAAAATAAGC 600 

CAGAGAAATC CTTTAGGTAT ATAGATACGT CTAGTATTGA TAGAAAAAAG AACATAATCA 660 

ACTACAAAAA TCTACAATAT CTTTCACCTG AACAAGCGCC TTCCCGTGCT AGAAAATTAG 720 

TTTCGCAGAA TAGTGTCTTA TTTTCAACAG TTAGACCATA TCTAAAAAAT ATTGCTGTAG 780 

TTAGAGAACT TAAAGAGTAT TTGATAGCTA GTACAGCATT TAATGTTTTG GGATACTTTA 84 0 

CTTAACGAAA CATAT 855 



(2) INFORMATION FOR SEQ ID NO: 246: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 660 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO; 246: 



TTTAGGAAGG 


CTATCCGTAA 


TTTTACAAAG 


GATTTAGATA 


TTACAGAGGA 


ACATTTAGAT 


60 


ATTATCAAAA 


GAGAGATGTT 


TGGCGAATTT 


TTCAGTAGCA 


TGAACTCTCT 


TGAATTTATT 


120 


GCAACGCAAT 


ATGATGCTTT 


TGAAAATGGT 


GAGATAATTT 


TTGATTTGCC 


GAAAATTTTA 


180 


CAGGAAATTA 


CTTTAGAGGA 


TGTCCTTGAT 


GCTGGACATC 


ATTTAATAGA 


TGATGGTGAC 


240 


ATAGTTGATT 


TTACAATATT 


CCCATCGTAG 


TAACCTATTA 


TAATAGACAC 


TAGAAAGAAG 


300 


GGATGACAAG 


TATGAGAAAA 


AAAACAATTG 


GAGAGGTTTT 


ACGATTAGCT 


AGAATCAATC 


360 


AGGGATTGAG 


TTTAGATGAA 


TTGCAGAAAA 


AGACAGAAAT 


CCAGTTAGAT 


ATGTTGGAAG 


420 


CAATGGAAGC 


AGACGATTTC 


GATCAACTTC 


CAAGTCCTTT 


TTACACGCGT 


TCTTTCTTGA 


480 


AAAAATATGC 


ATGGGCTGTT 


GAGTTAGATG 


ACCAAATTGT 


TTTGGATGCT 


TATGATTCTG 


540 


GGAGTATGAT 


TACTTATGAG 


GAAGTAGATG 


TTGATGAAGA 


TGAGTTGACA 


GGTCGTAGAC 


600 


GTTCAAGTAA 


GAAAAAGAAG 


AAAAAAACAT 


CATTTTTACC 


TTTATTTTAT 


TTTATCCTGG 


660 



(2) INFORMATION FOR SEQ ID NO: 247: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1805 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247: 
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CCGGTTGCAC AGGATCGTGC ATAGTCAACT CTTCAAGTAT AGCATATCTC CTATTTTCTT 60 

ACAAGTAATA ACACCTAAAA TGAAGCTTTT TCTTTTACTT TTTTCTGCCA AGAGGCAAAA 120 

AGCATGCTGA GGTAAAAAAC GCTCATCATA ATAGGAACAC CAAGAATGGT CTTTTCATGA 180 

TAGAAAATCG TCAAATAGGC TGAAAAGACA ACGCCAAGGA CAAAACTACT AAGCAGGCTA 240 

ACAAATATGA ATCCTTCACG CAAAAAAGGA GTGTGCTTGG TTCGGAAATA ATCTCCAAAA 300 

GCCAGCATGG TCCGTTTGAT ATTCCCTGTC ATAAAAGCGT TATTATAGGC AATACCCGAC 360 

ACTTCTCCAA AAGCAGTTGT CACCAGTCCC ATACAGAAGG CCAAGGGCGG CACTAGATAG 420 

ATATTATCCA CAGTTTGCGG CACAAAAGCA ATAATGATTG ATAAGATTGC CAAGGGAATC 480 

AAGGACAGAA TAGGTTTTTT CACAATTCTC AATTTTTCCT TATAAATCGT TAATAAAAAG 540 

ACTCCCATCA TAAACGCTAG CAAGGTGAGA ACCTTGTCCC TAACATCCGA AACATTATTT 600 

TTAATTAATT CTACTGAAAG AAAGACAACA TTTCCAGTTT GTCCAGCTAC AAGGGTATTC 660 

CCGCGAACAA TAAAAGTGTA AGCATCCACA TATCCAGCAC AAAACGTCAA AAAAAGTGCT 72 0 

AACCTTTTAG ACTGACGTGA TATTTTTCTT ATAGGTAATA ACCTCATTTT ACCTCCCATT 780 

GTATTTTCTC TTAGAAATAT TGTACCATTT TCTTTCTAAA AAATCGTAGG CTACCATTTA 840 

GATTTTACTA TTAGCATAAA AATAATAATA GACAACTATT TATCCAAAAA TAGATAGATG 900 

TAACATGTTT GCAAACAAAG CATACGAACC TTTAGTAAAA TCATTTCrAT GAAACTACAA 9£C 

TAGAGCCCTC TTAGCAAAAA TCATTATTTT AATTTATTTC TAATCACTCC TTGACATAAA 102 0 

TAACTCTCAC CAATAAAAGA CTATGTCTTA AAAAAATGGT ATAATAAAAT CAATACTTGG 1080 

GCTTGATGGC TATGCTACTA ATAACAATTA GGAGAGAAAA TCAGGCACTT GTTAACAACA 114 0 

AGGATTATCC CCTTGAGATG AAAGGAACTT TAGAAATCTT ATGATGAACA TGCAAAACAT 1200 

GATGCGTCAA GCACAAAAAC TTCAAAAACA AATGGAACAA AGCCAAGCTG AACTTGCTGC 1260 

TATGCAATTT GTTGGCAAAT CTGCTCAAGA TCTTGTCCAA GCGACCTTAA CTGGCGATAA 1320 

GAAAGTTGTC AGCATTGATT TCAATCCAGC TGTCGTTGAC CCAGAGGACC TTGAGACTCT 1380 

TTCTGATATG ACCGTTCAAG CCATCAACTC TGCTCTTGAA CAAATCGATG AAACTACCAA 1440 

GAAAAAACTG GGTGCTTTCG CTGGGAAATT ACCTTTCTAA AAACAAGGAG CTAGAACAAT 1500 

GCTTGTCGAT AACAAAGGCT AAGAAAGGTG CAAAAATGAC TCTATAATAT TTGTAGTGGG 1560 

TAAATCCCCT ATGGATATTA TGGAGCCTAT TTTTGTGTAG AAAAAAGTCC CATATGACCT 1620 

ATAATGAAAA GCGACAAAAC AACTCATTAG AAAGAATCAT ATGGAACAAT TACATTTTAT 1680 

CACTiAAATTA CTAGACATTA AAGACCCTAA TATCCAGATT TTAGACATCG TCAATAAGGA 1740 
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TACACACAAG GwAATCATCG CCAAACTGGr CTATGAAGCT CCATCTTGTC CTGAGTGCGG 1800 
AAGTC 1805 
(2) INFORMATION FOR SEQ ID NO: 248: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2516 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248: 

CTGCATCTAG TTTGTTTCTC CCTACAGTTT TAGCTAGACA GATTGGAGAT TATGATTTAA 60 

CGTCGCCGCG TTGGGGTTCG GATACAACTA GTGAGCTTGA GAAAGAAAAC TCCTCTGCTG 120 

GAATTAATAA TAATGACAGC ACTGGTGGCG GTAAAAGGTT AAATACCTCT ATTCGTAGCG 180 

CCTATAGTGG GTCAGATATT ACCCCGGTAT ATTCATTGGG GTCTGGCTCT AGGATTGTCA 240 

TGTACTATAA TGGAGGTGGT GACAATTATA TTGGTTCTGG TACTAGATTA GCTATGGCGC 300 

CACAATTTGG AAATCATGTA AGAATTCATA CTTCAGGTTC TTGGAATCCA GATTCTTATT 360 

AACTTACTTG TCAGAGTAAG CCTTAAAGAT GGTTGATTGT GGGTGTAGCA TGAAAAAAGA 420 

ATGCTACACC CTATTTTTAT TATAAGGAGG AGTAAGGATG GAATTTTTCA TTTGTAATCT 4 80 

TGTACGAGTC GTTCAATCAC CTCGATTTTA TATGTCTTTA TTTTTGACCC TTCTTTGCAT 540 

GAGTTTAGGA AATTTCCTTG CTTTCAATGG TATTTATAAA ATTGAAGGTT TATCGATTTT 600 

TTTTGCCGCT TCTTCTATTC GAGGATTTTC ACCGATTAGC CTAGTAGCTG CACTTATCTG 660 

TACACTGCCC TATTCTAGTC AGATAATAGA GGATGCTGAG AGTCATTTTC TAACAGCACA 720 

ATTGTGTCGA ATTTCTAAAA AGAAGTATCT GGCTATTGTG GGTAGTACTG TAATTATTTC 780 

TTCTTTTCTA GTCTTTTTTC TCCCCTATTT ATTATTATTA GGAATTAATC TTTTAGTGAC 840 

TCCTTATCAG GAAATTTATA TTGGAGATTA TAGTGGTGCC TTAAAAGAAT TATTTGATTC 900 

CAATCAGTTT CTCTATAGTC TTGTAACGAC TCTCTGGTAT GGAGTTTGGG GCGCTGTGTT 960 

CTCTATTTTT GGACTAGCTA GTGCTTTGCT AGTGAAGAAA AAAATAGGAG CTATTTTCAT 1020 

CCCAGTTGCC TATATGATGG TTGGTGGTAT TTTTTGGGCT ATTTTAGGGC TATCTTACTT 1080 

AGAACCTGTG ACAACGCTAG CTTTGGGATA TCAGAAAGAT ATCAGTCTTT CCTTAGTTAG 1140 

TGCTCATCTT GCTTTTATTT TATTTGTTAG TTGTTTGGTT GTTTATGGTA CATTTTTTCT 1200 

ACATTCAGAG GACTATGTAT AATGAAACAA TTTGTTCAAT TTTATAAAAA AGATTTCTTA 1260 

GCAGTATTGG TTTATTTTAT ATTACTGCTA TCCTGTGTTT TATCTAGTAC AGTATATTTA 1320 
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TTGCGCtGTC GCCAATATTC AATCCATCCA AATGTATTAG AATGGATCTT AGTTTTACTT 1380 

CAAGATATGA CGACTGGAGT ATATTGCTTT CCGTTCACAT ATATATTGTT CTTTTTTTAT 1440 

TTGATGAATA ACTATTTTAA TAGGTTGGAG TGTCGCATTC GTCTGAAATC AATTAAGCAC 1500 

TTTACCAGTT TTAGTTTCAA ATTAGCAGCT CTTAGTACGG GGATTTGGAC GGCGACTTTA 1560 

TTTTTATTGA TTTTTCTAAT TGCATTTAGT AATGGTTTTA GCTTCTCTTT GGAGATAAAG 1620 

GAGGTTGATT TTTTAAGAGA ATTTTATGGT ATAAGTATTG CAAACAATGC TAGTTTCTTT 1680 

ATAGGATTTT TTTTCTCTTA TATAGCATAC TATTTCTTTT TATCCTTACT TACTATTAGC 1740 

AGTTTTTCTT GGTTTAAAAA ATCAAACATG AGCTTAGTAT TTCTGTTTAC TTTTTTATTT 1800 
GTAGAATCCT TATTCTGGAT TTATCAGTTG GACAATGGGA TAATTGGATT ATTGCCAATT " 1860 

TTTCAGTATA TGGTAAATTC CAATCCGTAT GCATTGATTT ATTGGCTTAC ATTACTATCT 1920 

ATCATAATTC CATTGACTGT ATTTTCTGTT CATAGAAACT GGAGGAGAGT GTAAAAGTTG 1980 

GAAATGGGAA AGTTAAGTAG TCACATGTGG AGGTTGAATC AGATAATCTA TACCAAGTAC 2 040 

TTTTGGGGTT ATGTTCTTTT TTGGATATTG ATTTGTTTAG GATTATGGTA TTGGTTAGAA 2100 

GGAAATGATA GACTTGTTAT AGAAATTTTA AAAGGGCCTA ATCTGAGTCA AAACTCTTTT 2160 

TTAGTCTTAT CTATATGGTT GCTTCATTGG TTTATTATTC ATACATTTTT TCTAGCAGTT 2220 

GTATATCGTA GAAGAGCATC CGATTTCTTT ATGGAAGTGA TTCGATTTTC TTCTATTAAG 2 2 SO 

CTCTGGATTA GGTATCAGAT TTGGACCTGT TTTCTTTATG GACTCATTTT AATCATGGTA 2 340 

AAAGTTCTAG TGATTCAATT TATGTTACAG TTACCAAACT GGGATATAGG AGTTTTGTTT 2400 

ATAGTTGATT CTTTGAATGC TTGTGTGTTA GTCTTGTTTT GCTTTATGTT ATACGCACTA 24 60 

GGAGCGAATG TACAAATGAA CTTTGCTTGC GTTAGTTTCT TTTTACTCAT GATTGG 2516 
(2) INFORMATION FOR SEQ ID NO: 249: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1364 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249: 

CGGTGTTTTT TTGTAAATTT TCTAGCACTT GTATGGTAAA ATAGATACAG GTGTTCATTA 60 

AACTAGACTA AAAACCTATT TAAGCAGGCA AAATGAAGAA ATACCAACAA TTATTTAAGC 120 

AAATCCAAGA AACCATTCAA AACGAGACTT ACGCTGTCGG AGATTTCCTT CCTAGCGAGC 180 
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ACGACCTTAT GGAGCAATAT CAAGTGAGTC GTGATACCGT CCGAAAGcCC TGTCTCTCCT 240 

CCAAGAGGAA GGATTGATCA AAAAGATAAG AGGGCAAGGT TCTCAAGTCG TCAAAGAAGA 300 

AACCGTCAAT TTCCCTGTAT CCAACCTAAC CAGCTACCAA GAACTAGTTA AAGAACTTGG 360 

ACTGCGCTCT AAAACCAACG TGGTCAGTCT GGACAAGATT ATTATTGATA AAAAATCCTC 420 

ACTGATAACC GGTTTCCCAG AGTTTCGGAT GGTTTGGAAG GTGGTCCGCC AGCGTGTGGT 480 

GGATGATCTG GTATCCGTTC TGGATACGGA CTATCTGGAT ATGGAACTCA TCCCAAATCT 540 

CACTCGCCAA ATTGCTGAGC AGTCTATCTA TTCTTATATA GAAAATGGCC TCAAACTCCT 600 

TATTGATTAT GCTCAGAAGG AAATCACCAT TGACCACTCA AGCGACCGAG ACAAGATTCT 660 

CATGGACATT GGCAAAGACC CTTATGTCGT TTCGATTAAA TCAAAAGTCT ATCTCCAAGA 720 

CGGACGCCAA TTTCAGTTTA CCGAAAGTCG CCATAAGTTA GAGAAATTTA GATTTGTAGA 780 

TTTTGCAAAA CGCAAGAAAT AAAAGACTGA GACACCAGAT CTCAGCCTTT TTCGGCTCTA 84 0 

TAATATTTGT AGTGGGTAAC CCCCCTATGG ATATTATGGA GCCTATTTTG TGTAGAAAAA 900 

AAGTCCCATA TGACCTATAA TGAAAAGCGA CAAAACAACT CATTAGAAAG ATTCATATGG 960 

AACAATTACA TTTTATCACA AAACTGCTCG ATATTAAAGA CCCAAACATC AAGATTCTAG 1020 

ACATCATCAA TATGGATACC CACAAAGAAA TTATCGCTAA GCTGGATTAT GAGGCTCCAT 1080 

CTTGCCCTGA TTGTGGAAGT CTAATGAAGA AATATGACTT TCAAAAACCG TCTAAGATCC 1140 

CTTACCTCGA AACAACTGGT ATGCCTACTA GAATTCTCCT TAGAAAGCGT CGTTTCAAGT 1200 

GCTATCATTG TTCTAAAATG ATGGTCGCTG AAACTTCTAT CGTCAAGAAG AATCATCAAA 1260 

TTCCTCGTAT TATCAACCAA AAAATTGCGC AAAAGTTGAT TGAGAAGATT TCTATGACCG 1320 

ATATTGCTCA TCAGCTGGCC ATTTCAACTT CAACTGTCAT TCGG 13 64 



(2) INFORMATION FOR SEQ ID NO: 250: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1227 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250: 

CCATGAAGAC CGCTTGGAAT TGGAATGGCA CAAGTCTTTG TTGAATGGTC TATTCCCATT 60 

GACAATCGGT GGAGGAATTG GACAATCTCG TATGGCCATG TTCCTACTTC GCAAGAGACA 120 

CATCGGAGAA GTGCAAACAA GTGTTTGGCC TCAAGAAGTC CGCGATACTT ACGAAAATAT 180 

TTTGTAGAGA ATCGAACCGC AAGGTTCGGT TTTCTTTCTC TTTTTGTCTA TAATTTGGTA 240 
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TAATAAACAG TATGAAAATC GTATCAGGAA TCTATGGGGG ACGTCCCCTC AAGACACTAG 300 

AAGGCAAGAC GACAAGACCT ACTTCGGATA AGGTTAGGGG AGCCATTTTT AACATGATTG 3 60 

GTCCCTACTT TGAAGTGGGA CGAGTCTTGG ACCTTTATGC AGGTAGTGGT GGTTTATCTA 420 

TCGAAGCAGT ATCGCGTGGC ATGTCCAGTG CTGTTTTGGT GGAGCGAGAC CGTAAGcTCA 480 

GACCATCGTG GCTGAAAATA TCCAGATGAC CAAGGAAGTT GGAAAATTTC AACTCCTCAA 540 

GATGGATGCA GAAAGGGCAT TGGAACAGGT ATCTGGGGAA TTTGACCTCG TTTTCTTAGA 600 

CCCTCCCTAT GCCAAGGAAC AAATCGTAGC AGATATTGAA AAAATGGCTG AGAGAGAGCT 660 

TTTTTCTGAA GATGTTATGG TTGTGTGCGA GACGGATAAA GCCGTTGAAC TTCCAGAAGA 720 

AATTGCCTGT CTGGGTATCT GGAAGGAAAA GATTTATGGA ATTAGTAAGG TGACAGTCTA 780 

TGTCAGATAA GATTGGCTTA TTCACAGGCT CATTTGATCC GATGACAAAT GGGCATCTGG 840 

ATATCATTGA ACGGGCGAGC AGACTTTTTG ATAAGCTTTA TGTGGGTATT TTTTTTAATC 900 

CCCACAAACA AGGATTTCTC CCTCTTGAAA ATCGTAAACG GGGGTTAGAA AAGGCTGTGA 9 60 

AACATTTGGG AAATGTTAAA GTCGTGTCTT CTCATGATAA ATTGGTGGTC GATGTCGCAA 1020 

AAAGACTGGG GGCTACTTGC CTAGTGCGAG GTTTGAGAAA TGCGTCGGAT TTGCAATATG 1080 

AAGCCAGTTT TGATTACTAC AATCATCAGC TGTCTTCTGA TATAGAGACT ATTTATTTAC 1140 

ATAGTCGACC TGAACATCTC TATATCAGTT CATCAGGCGT TAf?Af7_AGCTT TTGAJIGTTTG 12 CG 

GTCAGGATAT TGCCTGCTAT GTTCCCG 1227 



(2) INFORMATION FOR SEQ ID NO: 251: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3652 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 251: 

CCGGTCAAGT TAAAAACGCT ATTTCTTCCC ATTTTATTTA TTTTTTAGGA GTGGTAACGT 60 

ATCAAAATAG CCCAAGCGTT CTCACCCGTG TGAGTTTGAA TAATGGAACC CGTTTCCAAA 120 

ACAGAAATTG GCTTTTCAAC ATAAGCTTGT AAGCTTTCTT TCATCTCTTT TGCCCAATCA 180 

TCACTACCAG AATATGAAAT TCCAATCTCT GCTACAGCAC GTTCAGAAAG CGATGTTATC 240 

AACTCATCTA ACCATTTTTT AAATGTTTTA GTTCCACGAC CTTTAACCAT TGGCTGCAAT 300 

TCATGGTCTT TCATTTGCAT GACAGCACGG ATATTGAGAA GAGAGCTCAA CAAGCCAGTT 360 
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ACACGGCTAA " TTCGTCCACC TTTGACAAGA TTTTCCAAAG TTGAAACACC AATATAAAGC 420 

TCTGTATGGT TTTTAACCTC TTCTACATGA GATAAAATTG CCTCCATATC TTTACCTTCT 480 

TGAGCTAACT TCGCAGCCTC AACAACTTGG AATTTCAGGG CTTGGTCAGT GAAGGAACTA 540 

TCAACAACAG TCACATCTGC AGTAGATAGG CTAGCACCTT GGCGTGCTGC TTCTACCGTA 600 

CCCGAAAGAG CATGGGACAT ATGAATAGCA AGAATCTGGC CACCATCTTT GCATAGGTCT 660 

TCAAAAATCT CAGCAAAGAC ACCTACAGGT GGCTGACTTG TTTTCGGAAG ATTCTTACTT 72 0 

TCTTGCATCA ACTGAAGAAA TTTACCTTCT TCTTTCAAAT CCGCATCAGA ATAAACAACA 7 BO 

TTATCAATCA TTACAGATAA TGGAACAATT GTAATATCTA ATTGCTTTAC TAGTTCAGGT 840 

TCAATAGTAA CAGATGAATC GGTTACAATC TTAATTTTTG TCATAGTATC AATCTTTCTA 900 

TTTTAGGATT CAGATTGGTT TCCTTACTTC TAATTATATC AAAAAAAAGA TTAAAAATCC 960 

TAATGGAGTC AATCAAATTT TCCGTAAAAT TTGATATAAT CAACTTATAA GAAAAGAGGT 1020 

GTCCTATGAT TAAAAAAATT TACCCCATTT TTACCATTTT ACTAGGTGCT GCTATTTATG 1080 

CTTTTGGACT GACTTATTTT GTAGTTCCCC ATCATCTCTT TGAAGGAGGG GCGACAGGCA 1140 

TTACCCTCAT CACCTTTTAT CTTTTTAAAA TCCCTGTTTC CCTCATGAAC CTGCTGATTA 1200 

ATATTCCCCT TTTCATCCTA GCTTGGAAGA TTTTTGGAGC CAAATCCCTC TATTCTAGTT 1260 

TACTAGGAAC CTTAGCTTTG TCCGGCTGGT TAGCTTTTTT TGAGCATATT CCCCTTCATA 1320 

TTGATCTTCA AGGTGATTTA CTAATCACAG CCCTTATAGC GGGAATCCTA TTGGGAATTG 1380 

GCCTTGGAAT TATTTTTAAT GCTGGAGGTA CAACTGGCGG AACTGATATT CTAGCTCGTA 1440 

TTCTCAACAA ATACACTCAT ATATCCATAG GAAAACTGCT CTTTATCTTA GATTTTTGTA 1500 

TTCTCATGTT GATTCTCCTA ATCTTCAAGG ATTTGAGATT GGTTTCCTAC ACGCTTTTGT 1560 

TTGATTTTAT TGTTTCTCGT GTTATTGATT TGATTGGTGA AGGAGGATAT GCCGGCAAAG 1620 

GCTTTATGAT TATCACAAAA CGTCCTGACC AACTTGCTAA GGCGATTAAT GATGACCTCG 1680 

GAAGAGGTGT TACTTTTATT TCTGGTCAAG GCTACTATAG TAAAGAAAAT TTGAAAATCA 1740 

TCTACTGTAT TGTCGGAAGA AATGAAATTG TGAAAACGAA GGAAATGATT CATCGAATCG 1800 

ATCCTCAAGC CTTTATAACT ATTACAGAAG CCCATGAAAT CCTAGGAGAA GGCTTCACCT 1860 

TTGAAAAAGA ATAAAAAGAG GTAATGTCGT GACCTCAAAA GTTAGACTAA ATCATCTATC 1920 

TTTTGGGTTA CAGACAACCT CTTTTTTATT TTATTTACTC AAGCTCTTAA GACCAATTCC 1980 

GAGTTACTTC TTCATCAGCC TTTAACTGAT CCACTAATTG GTCAACTGAG TCAAATTTGG 2040 

TCATATCTCG AATGCGATCA AGCCAATAAA CCATGACGGT TTCCCCATAA ATATCTTGAT 2100 

TAAAATCAAA AATATTGACT TCAAAACGTG CTTCTTCTCC ATCAAAGGTC ACATTTTTCC 2160 
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CGACACTAGC CATAGCACGA TACTTCTGTC TTTGAATCTC AACATCAACA ACATAAACGC 2220 

CATCTGCTGG CATATAAGTA CGGTCTAAAA GCACTAAATT CGCTGTCGGA TAACCAATTG 22 80 

TACGACCACG AGCATTACCA TGAACCACCA TACCTCTTGA TGGAAGCGGT GCCCCCAAAA 2340 

GTTTTCCTGC TTCTTTCACA TTTCCATCTA AAATAGCTTG ACGGATACGA GTTGAACTAA 2400 

TCTTTCCTTT CTCATCTTCT ACAGGTGGAA CAATGATAAC TTCTCCATCA AAGTAATTCT 24 60 

TTAAATCTTC TGCTGTTTTT TTGTCAGAAC CAAATGTATA ATCAAAACCT GCAACAATAA 2520 

TTTTGGCATT CATAGCCTTG ATATAAGTTG CAAAGAATTC TTGTGCAGTG AGACTAGCGA 2580 

ATTGACTACT AAAATCAAGG AGATATAATT CTTCTACACC TTCGCGCTTT AATTTTCTTT 2640 

CACGTTCAGC AGGGTTCAAA ATATGCAAAA ACAAATCTGG ATGATAAGGC TCTAAAGCGA 2700 

TCTTTGGAGA TTCATTAAAG GTCATAACGA CGATAGGCAA CAAATCCTTT CTCGCAGCCT 2760 

TGTTGGCAAC ACGAAATAAT TCTTGATGCC CCTTATGTAT GCCATCAAAA TAGCCGAGAA 2820 

CAACGACTGA ATCAGATGGT GTGCCAATAT CTTTTTGGTT TTTTATAGGA ATAGTAATAA 2880 

TCATAAAATA ATTATATCAT AGCGATAGCT ATTTCTGGAA CAGAAAATCT GAAATGTTGT 2940 

TTTTTTCACA TGAAGTGTAC CTGTTTTCAA AAAGCACTTT ATTCTATCGT TGCTTAACTA 3000 

TGAACTTTGC AATATTCTTC TCAAAAACTT GTAGGACATC TTCAAAATTT TGCAAGGAGT 3060 

GATTAGACTT GTTCGGTAJ^C CATAAAGTGT CATACTATGC TTATGTATGA AAJ'JVGCT'J'iTC 3120 

CAACTAACTC CTGAGAACTT TAAATTACTA ATTGGTGCCG AAAAGGTAGA ATTTAGAATC 3180 

GAGGTACACC TATGGCTGTA AAATTTACAA AATGAGACAA CTTGGGCAAG ATGTTTGAAG 3240 

AATTTCCTAA ACTCCCTGAT TTGAAGCAAG TCACTTTCCC TAATGACAAA GAAAAAAGCC 33 00 

AAAACAGCAA AGAAAAACTA GATGACTGCT TTCCAACAAC TCCCATCTAG TGTGCTTCAG 33 60 

ACTGGGCTAT TTTTCTCTCC ATCTGTTAGC TTGGATTCTC AGACCGTTTC AGCTAAAGAA 3420 

TATCTTTTCC CTTATCAGAA GGAACGGCTC AAGCCATTCA GACAAGTGAA GGGACGACAA 3480 

GCCAATATTT GAAACCAGAT AGCAGTTCTT ATAGTCAATT GAAATAAAAT CTGAAGAAAT 3540 

CGAGTAGGAA ACTCATATCA ATGTTTAACA GTGTTCTATT CCAGATTCAT ACTCAATGAw 3600 

AATTAAAGTG CAAACTAGGA AGTTAGCCGC AGGTGATACT TTGGGTACGG CA 3652 
(2) INFORMATION FOR SEQ ID NO: 252: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 743 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 252: 

GTACCGTGGT GCCAAAGTAC AGCAAGGTTG GCTTTTTGAC AAACAATACC AATCTTGGTT 60 

TTACATCAAA GAAAATGGAA ACTATGCTGA TAAAGAATGG ATTTTCGAGA ATGGTCACTA 120 

TTATTATCTA AAATCCGGTG GCTACATGGC AGCCAATGAA TGGATTTGGG ATAAGGAATC 180 

TTGGTTTTAT CTCAAATTTG ATGGGAAAAT GGCTGAAAAA GAATGGGTCT ACGATTCTCA 240 

TAGTCAAGCT TGGTACTACT TCAAATCCGG TGGTTACATG ACAGCCAATG AATGGATTTG 3 00 

GGATAAGGAA TCTTGGTTTT ATCTCAAATC TGATGGGAAA ATAGCTGAAA AAGAATGGGT 3 60 

CTACGATTCT CATAGTCAAG CTTGGTACTA CTTCAAATCC GGTGGTTACA TGACAGCCAA 420 

TGAATGGATT TGGGATAAGG AATCTTGGTT TTACCTCAAA TCTGATGGGA AAATAGCTGA 4 80 

AAAAGAATGG GTCTACGATT CTCATAGTCA AGCTTGGTAC TACTTCAAAT CTGGTGGCTA 540 

CATGGCGAAA AATGAGACAG TAGATGGTTA TCAGCTTGGA AGCGATGGTA AATGGCTTGG 600 

AGGAAAAACT ACAAATGAAA ATGCTGCTTA CTATCAAGTA GTGCCTGTTA CAGCCAATGT 660 

TTATGATTCA GATGGTGAAA AGCTTTCCTA TATATCGCAA AGTAGTGTCG TATGGCTAGA 720 

TAAGGATAGA AAAAGTGATG ACA 743 



(2) INFORMATION FOR SEQ ID NO: 253: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4010 base pairs 
(Bl TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253: 

TTTTGGTTGA TGATACGAGG GATTTGGTGA TTCTTCTTGA CGATAGAAGT TTCAGCGACC 60 

ATCATTTTTG AACAGTGATA GCACTTGAAT CGACGCTTTC TAAGGAGAAT TCTAGTAGGC 120 

ATACCAGTCG TTTCAAGATA AGGAATTTTA GAAGGTTTTT GAAAGTCATA TTTCTTCAAT ISO 

TGGTTTCCGC ACTCAGGGCA AGATGGGGCG TCGTAGTCCA GTTTGGCGAT GATTTCCTTG 24 0 

TGTGTATCCT TATTGATGAT GTCTAAAATC TGGATATTAG GGTCTTTAAT GTCTAGTAAT 300 

TTTGTGATAA AATGTAATTG TTCCATATGA TTCTTTCTAA TGAGTTGTTT TGTCGCTTTT 360 

CATTATAGGT CATATGGGAC TTTTTTTCTA CAATAAAATA GGCTCCATAA TATCTATAGT 420 

. GGATTTACCC ACTACAAATA TTATAGAACC GAATTAATTT AATTAGAGAG CCAACTTTCT 480 

AATATAGTAA TCGCGTCATA ACAAGGTATC TATCATTCAT GGAGTTCCTC CTGTATACTA 540 
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TTAGTAAAGT AAAACTATTG GAGGATATTT TAATGCCACA ACCTATTGTT CCTGTAGAGA 600 

TTCCACAATC TCGTCGTTTT GATTCTAAAA AGAGAAATGA TATTCTGCTT AAAATTCGTA 660 

TTGGCAAGCT TGAAGTAAGT TTTTTTCAAT CTCTCAATCT CGAAATGGTA GAACAGCTTT 720 

TGGATAAAGT GTTGCTCTAT GACAATTCAT CTATCTAGCC TAGGGCAGGT CTATCTCGTA 780 

TGTGGGAAAA CGGATATGAG GCAAGGCATT GATTCATTGG CTTATCTGGT TAAAACCCAC 840 

TTTGAATTAG ATCCTTTCTC CGGTCAAGTT TTTCTCTTTT GTGGTGGACG TAAAGACCGC 900 

TTTAAAGCCC TTTACTGGGA TGGTCAAGGA TTTTGGCTAC TATATAAACG CTTTGAGAAC 960 

GGAAAACTGA CTTGGCCCAG TACAGAAAAG GATGTCAAAG CTCTCACACC TGAACAAGTA 1020 

GATTGGCTTA TGAAGGGCTT TTCTATCACT CCAAAAATAA ATTTATCAGA AAGTCGTGAT 1080 

TTCTATTGAA ATGAGGACTT TCTTTTTAGT TATAATAAAG TTAGGAAATA AGGAGAGGAA 1140 

GCCCATGGAA GAAGATTGAA AATCATTCAA CAACAGAGTG CTACAATTGA TAGTCTCACC 1200 

AATGAACTTG CCCTTCTTCG TGAACAAGTG GCTTATCTAA CGCAAAAGCT CTATGGAAAA 1260 

TCCTCTGAGA AAAGTGTTTG CCCATCTGGA CAACTCAGTC TTTTTGAAGA GGAACAAAAT 1320 

ATGGAAGAAG ACTCTGACTT ACCCAGTTGA AAGAGAAGAA ATCACCTATA AACGTAAGAA 1380 

AGCTAAAGGG AAACGTCAAG CTCTTCTTGC CCAATTTGAT TCAGAAGAAG TTCATCATCA 1440 

AGTAGAAGAG AGCATTTGCC CTGATTGTf^A GGGAGATCTA AAACACATTG GAGCAACCCT 1500 

TCAACGACAA GAATTAGTCT TTATTCCTGC GCAATTAAAA CGAATAGATC ATATCCAACA 1560 

CGCTTATAAG TGCCAAGCAT GCAGTGATAA AAATCCGAGT GATAAAATCG TGAAAGCTCC 1620 

TATTCCTAAA GCCCCTTTGG CGCATAGCCT TGGCTCAGCT TCTATTATCG CTCACACCAT 1680 

CCATCAGAAG TTTAATCTGA AGGTACCCAA TTATCGCCAA GAAGAAGATT GGGCTAAGAT 1740 

GGGTTTACCA ATCACACGTA AGGAAATTGC TAATTGGCAT ATCAAGGCGA GTCAATACTA 1800 

TTTQGAGCCC CTTTATAATC TTTTACGAGA AAAGTTGTTA GAACAAGCTC TTCTTCATGC 1860 

GGATGAAACC TCTTATCGGG TTCTAGAGAG TGATAGTCAG TTGCCTTACT ATTGGACTTT 1920 

TTTGTCTGGG AAAGCTGAGA ATCAAGCAAT CACGCTGTAC CACCATGATC AGCGTCGGAG 1980 

TGGTTTAGTA GTACAAGAAT TCCTAGGAGA TTATTCTGGC TATGTTCATT GTGACATGTT 2040 

GCGGCAGTAA CTTAGGACTT TAGTCCTCTA GTTCTGCCTA TGCGATAGCA GTCCAAGGTT 2100 

TAGGAGTAAG GCGACGCTAA GCTTGGTAAA CTGCGAACAG CTAGAAGCTT ATCGTCAACT 2160 

GGAAGAAGCT GCACTTGTTG GATGTTGGGC GCATGTGAGA AGGAAGTTTT TTGAAGTGCC 2220 

CCCCAAGCAA GCAGATAAAT CATCCTTAGG AGCTAAAGGT TTAGCTTATT GTGATCAGTT 2280 
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ATTTTCCTTG GAAAGAGACT GGGAGGCTTT GCCAGCTGAT GAACGACTAC AGAAACGTCA 2340 

AGAACATCTC CAGCCCCTAA TGGAAGACTT CTTTGCTTGG TGCCGCCGTC AGTCAGTTTT 2400 

AGCAGGTTCA AAACTAGGAA GGGCAATTGA ATACAGCCTC AAGTATGAAG AAACCTTTAA 2460 

GACTATTTTG AAAGACGGAC ATCTGGTCCT TTCCAATAAT CTAGCTGAAC GCGCCATTAA 2520 

ATCATTGGTT ATGGGACGGA GTAAAAGAGT CCAGTGGACT CTTTTAGCCT GAGCTCAGTT 2580 

TAAAAAAGCG AGGGTGGTTA TTTTCTCAAA GTTTTGAAGG AGCTAAAGCA AGAGCTATTG 2640 

TTATGAGCTT GTTGGAAACA GCTAAACGTC ATCAATTATA GTGCGTTGAA TCTATAACAG 2700 

TACGCATCGA CTGCTAAAAC ATTTCTATAA ATCAATTTTC CTTTCCTAAT CGATTTGTTC 2760 

ATATCTTATT TCAATCCATT ATAAATAGCG AGAAATATCT ATCCTATCTT CTAGAATGTC 2 820 

TTCCAAACGA GGAAACTCTC GTAAACAAAG AGGTTTTAGA GGCCTATTTA CCGTGGACTA 2880 

AAGTTGTACA AGAAAAGTGC AAATAAGAAA TCTCCAGATT AGGAACTATC CGTGAGTTCT 2940 

CTAGTCTGGA GATTTTTCAA TAGACTTCGT TATTGGACGG TTACAATTTA TTATATGAAA 3000 

ATCCCATATT ATTCTCCAAT TCTATATTTT ACCTTTCTAA ATGTATAGAT TAACTACCTA 3 060 

ATTATAGCAT ATAACGCAGA TTCCTTTCAA TCGTATGATT TACTGCATTA AATTAAGTAA 3120 

AAAAATAAAG GCAGTCCGAA GACTGCCGAT ATTTATCTCT CATCTCTTTA ATTATGGTAA 3180 

GTAAATAAAT AATTTCCCTA AAGATATGGA AATTATTAAT ACTATAAATA CATATTATAA 3240 

AGTTTATAAA TACTGTAAAA ATCCTGAAGT TAATTTTCTA ATAAATATCA ATATGTGTTA 3300 

GTATCTTTTA AATTTTTAGA CAATTTACTA GTTCTATAGA CATGTTTAAC AGACTCTATT 3360 

TTACAATTCA AAAATTTCAT CTGCCACTTC ATTTAAAAAT TCTATATCAT GGGAAACAAT 3420 

AAAAATTATT TTATCCATGG TTTTATACTT ATTAATCAGT TCAGATATTT TTATCATATT 3480 

GGAATAATCC ATACCACTTG AAGGTTCGTC AAAAAAGACA AATGGAGAAT TCTTGCACAT 3540 

AACAGATGCT ATTGCAAGCC TTTGCTTTTG CCCTCCTGAT AAACTCATCG GATGCCTTTC 3600 

AATAAATTCG TCCAGGCATA AATCTTTTAA CCCAAATCAT TCATACCTCT CTCAACTAGA 3660 

TGTAACTTAC AAAACCCCTG ACCTCATGAG CCACTTTCTT CCTCCTCATG AGGTCAGTTT 3720 

TACTTTCTGC TGTTCCAGTA TCGTTTTTCC TCGCTAGATT TCCTCAAAAG GGCAGACTCC 3780 

TCCCTTGGTT CGTCACACGA TTTTTTCATC TCGACTGTTC TTTAATGCAT CATTAACGAC 3840 

GCTTTTCTTC TAGGTGGTTC ATAAGGAACA GGAAGATTCA GGTTGACTTT TCTAATCCTA 3900 

GAATAAAGTG CTGAAAACAA TTCGGAATAG GCATAGAGAC TAGACAATTT GAGGAGCTGC 3960 

TTGCGTCCTG TTCGAACACA TTTTCCCACC ACGTGAAGAA AAAGATGGCG 4010 
(2) INFORMATION FOR SEQ ID NO: 254: 
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(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 2789 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254: 

ATGCATCCGT TTGTCAAGCC TAAATTGTAA TTTTTTTCAA TTTAAAACAG AAAAACCCAG 60 

GAAAATGACA TAAAAATATC ATTCCTAGGC CTATTTATGC TATTTCTCTC TGAAAAATAT - 120 

GAGTATTCAG TCGGTCAAAT GAAGCTGAAC GAACTCATTT TCCCTCGCCT AATTCAATGA 180 

TTCGATGACA TTGTTGGGCT ACATAAGCAT CGTGGGTCAC GATAATGACT GTTTTCCCCT 240 

CTCGATTCAT CTCTAAGAGA AACTTCAAGA CCAAATCTCT ATTTTCAGGA TCCAGAGAAC 300 

CTGTCGGTTC ATCGGCTAAA ATCAGCTGGC TGGGTTTTAA GATGGCTCTA GCAACTGCAA 3 60 

TTCGTTGTTG TTCGCCCCCA GACAACTCGG AGACCCTTTG ATGCAAAGTA GCTGATAAAC 420 

CTACTCTCTC TAAAATCTCT TCCACCTTTT TGAGCTTGTC TTTCTTAGGC AATTTCACAT 480 

ATTTCAGCGC CACATGAGAT TGTACTCGAC CGTTTCATCA TCAATCAGGG CAAAATTTTG 540 

AAACAGATAA GAGATATGTT CACGGATTAT TGTTTGCGAC TTAGCAGAAT TAACCGCTAG 600 

.'TTTGTCTGA CCAAAAATCT CATACCGTCC GCTATAATCA CCATCTATCA AACCCAATAA 660 

ATTTAACAAG GTCGACTTCC CACTACCACT CTTACCAACA ATAGCTACCA AATCCCCCTG 720 

ATCAATCCTG AGAGATAAGT TATCCAAAAT CACTTTTCCC CCAATGGTTT TGGTAATATT 780 

TTTCAACTCA ATCATAAGAT GCCCCCTTTC AATAACTCTA CTAGACTTCT TTTCTCCATC 840 

CTAGAAGCTA AGCCTAGCAC AAATAGTATA TCCAGACATG TAAAACCTGC AAACAGTAGA 900 

AGTGGTAAGA ACGCATGGGC AAAGAAAATC AAGACTAGAA GAGGGAAACT ATAGCCCAGC 9 50 

AAGAGCAGAA CGAGGAGAGG ACGGTAGCGA TCGACCAGTT TCCACCCCAT AAACTTCTTG 1020 

GTAATGATAT CCCTGCGCTT CAATAAGAAA GTTGTTACTA GTAAGAAGTA GGAAATCATC 1080 

ATGCTAAGGA GACCAAACAA AGCAAAGAGT AGGTTAAAAT TCCGAACAGC ATCTCGATAA 1140 

GAATCCACTT TCTCTTGTTG AATGGCTTGA ATAGATGAAA ATTTTAAATA ATTTCCATCT 1200 

GACAATTTCT CAACTAACTC TGTAATCTCT TTTTGATGTT GAACCGTATT TTCAATTTTA 1260 

ATCGGATTAT TTAAGCCAGT TGTTGACAGG GAGGCTTTCT CATCCCACAT CATATCAGAA 1320 

TCATTGACCA AGCTAATAAT TGGATTGGAG AGATTTTCCT TTCGCTTATC ACTATATGGG 1380 

AAAAATGACC AATCTCCTTC ATAATAGGCA ATCTCGACAT CCATCTCCTC TATCGTTCGT 1440 
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TTTTGCTGCT CTTCATACTT CATCGAATGA AAGGCAATTA ACTTCCCCAA GAGCTGATTT 1500 

TTATCTTCTT CACCTTTCGT ACTTGCTGGC ATCAAAATAA CTTTTTTAAT ACCGGTATTT 1560 

GGTAGCTTGA ATCCCTTGCT CTTTAGAAAA TTGCGATTGG CATAGTAAAC ATCCACCGTA 1620 

TCTGTTAACT GATATTGCTG AATCTGTTCT GATTGGACAA AATTTTTTAC AGGAAGACTG 1680 

CTACTCTGCA CATAGCCCGC CTGCGTTTTT TCTACCAAAT CCTGATAAAA TCGATAGAAA 1740 

TAATCTGTAG ATTTCCCTGA CCCTGCTAGC TCTTCTTGCC ACAGATTATC ATTGAGTTTG 1800 

AAGGTTTCTA AGGTCAGGTA ATTACCTTGA CTTACCCACT GTTGCTGATA AGCAAGTTCT ^ 1860 

TTGTTTTCTT GTTCTAAACT TCTGCCCACC CCAATCAGTA AGGCCGTCAG TAAAATAGTT 1920 

GTCCCTATTT TCATCACATA ATTGAAGATA AGACCAAATT TGAAAGATGA AAAACCTTTC 1980 

AGCAGAGAGC TGATTGTCAT TTTTTGGATT AAGAGGTAAG TCAACCAACT GATAAAGAGA 2040 

TAAAGCTGCA ACAGCAAAAA ATGAGACAAC CACAGCATAG GAAACAAATC TTTTGGCTTA 2100 

TAATCAAGCA AGAAAAACAC GCCTAGATTG ATCACAAGAG CCCCACCTAG GAGGAGGTAA 2160 

AGGTTGCCTT TTACAACATC AGCTAAAACA GCCCTATCTT GAAAACCAAG TAATTTTTGT 2220 

ACCCCAACTC TTTTCATCTC CATCATCGGT TGATACACTG TCACTAACAC AAGAAGCAAA 2280 

ATAGCCAAGA CAAAAACAAT GGCAGATAAA AGCAAATCTC GATTTATGAC TTCCACTGCA 2340 

CTTTTGTAGG TCGGCTCTAG CAAGGTAGCC TGGTCTATCT TGAAAAAATC GCTCCATTTC 2400 

TGTACAATCC TATCCTTGTC CATCTCTTGT GTAGAAGTTA TCGTATAGCG ACCATTTAAA 24 60 

CTACGAGATG TATCCTTGAT ATAGGTTTGA AAAGTCATAA GCTGAATAGG TTTGGCTTTT 2 520 

AGAAAGGTCG GAATCGTACC AAGTTTATTG GT^TTTCTT TATTACTATA GACTCCTTCA 2580 

CCATCTGTGG TAAAATCAAG AGAAGAAATC CCAAACTCTT GGTAGGGGAA GGTATCTTTA 2 64 0 

TCAAAAACAC CAGACTTGAC CACCTCATCA CCACTGTCTG TTTTGATGAT GGAGACTTTA 2700 

TACTCCTTTG ATACATCCTC AAAAAATCGA AGAACAGACG CTGCAGGTTC GTTAATATCT 27 60 

TTCAAATACA AATCCAAAGA ATCTACAGG 2789 
(2) INFORMATION FOR SEQ ID NO: 255: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2495 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: 

CTGCGTIATTT TATTA7VAGAT AATGTGTTAA TTACAGCGGC TCACAACTAC TACAGACATG 60 
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ACTATGGGAA AGAAGCGGAT GATATTTATG TTCTTCCGGC TGTTAGTCCA AGTCAAGAAC 120 

CATTTGGAAA GATCAAAGTA AAGGAAGTTC GTTATTTGAA GGAATTTAGA AATTTAAATT 180 

CTAAGGATGC AAGGGAATAT GACTTGGCTT TATTAATTCT AGAAGAGCCC ATTGGTGCAA 240 

AATTAGGGAC TTTGGGTCTT CCTACTAGTC AAAAAAATTT GACAGGAATA ACTGTGACTA 300 

TCACAGGCTA TCCATCATAT AATTTTAAAA TTCATCAAAT GTATACAGAT AAGAAACAAG 360 

TTTTAAGTGA TGATGGCATG TTCTTGGATT ACCAAGTTGA TACTTTAGAG GGGTCTAGTG 42 0 

GATCTACAGT TTATGATGCT AGTCACCGTG TAGTAGGAGT GCATACTTTA GGAGATGGAG 4 80 

CTT^TCAAAT TAACAGTGCA GTTAAATTAA ATGAACGAAA TTTGCCATTT ATTTAwTCGG 540 

TTCTTAAAGG TTACTCTCTT GAAGGATGGA AGAAAATAAA TGGTAGTTGG TACCATTATA 600 

GACAACATGA TAAACAAACG GGTTGGCAGG AGATAAATGA TACCTGGTAT TATTTAGACA 660 

GTTCCGGTAA GATGCTTACA GATTGGCAAA AAGTCCATGG AAAATGGTAT TATCTCAATT 720 

CAAATGGAGC AATGGTTACA GGTAGCCAAA CTATCGATGG TAAAGTTTAT AACTTCGCTT 780 

CATCTGGTGA GTGGATTTAA TGTTGGAGGA TATATAAAAT GAAGCTTTTG AAAAAAATGA 840 

TGCAAATCGC ACTAGCCACA TTTTTCTTCG GTTTGTTAGC GACAAATACA GTATTTGCAG 900 

ATGATTCTGA AGGATGGCAG TTTGTCCAAG AAAATGGTAG AACCTACTAC AAAAAGGGGG 960 

ATCTAAA_AGA AACCTACTGG AGAGTGATAG ATGCC.V.CTA CTATTATTTT GATCCTTTAT 1020 

CCGGAGAGAT GGTTGTCGGC TGGCAATATA TACCTGCTCC ACACAAGGGG GTTACGATTG 1080 

GTCCTTCTCC AAGAATAGAG ATTGCTCTTA GACCAGATTG GTTTTATTTT GGTCAAGATG 1140 

GTGTATTACA AGAATTTGTT GGCAAGCAAG TTTTAGAAGC AAAAACTGCT ACGAATACCA 1200 

ACAAACATCA TGGGGAAGAA TATGATAGCC AAGCAGAGAA ACGAGTCTAT TATTTTGAAG 12 60 

ATCAGCGTAG TTATCATACT TTAAAAACTG GTTGGATTTA TGAAGAGGGT CATTGGTATT 1320 

ATTTACAGAA GGATGGTGGC TTTGATTCGC GCATCAACAG ATTGACGGTT GGAGAGCTAG 13 80 

CACGTGGTTG GGTTAAGGAT TACCCTCTTA CGTATGATGA AGAGAAGCTA AAAGCAGCTC 1440 

CATGGTACTA TCTAAATCCA GCAACTGGCA TTATGCAAAC AGGTTGGCAA TATCTAGGTA 1500 

ATAGATGGTA CTACCTCCAT TCGTCAGGAG CTATGGCAAC TGGCTGGTAT AAGGAAGGCT 1560 

CAACTTGGTA CTATCTAGAT GCTGAAAATG GTGATATGAG AACTGGCTGG CAAAACCTTG 1620 

GGAACAAATG GTACTATCTC CGTTCATCAG GAGCTATGGC AACTGGTTGG TATCAGGAAA 1680 

GTTCGACTTG GTACTATCTA AATGCAAGTA ATGGAGATAT GAAAACAGGC TGGTTCCAAG 1740 

TCAATGGTAA CTGGTACTAT GCCTATGATT CAGGTGCTTT AGCTGTTAAT ACCACAGTAG 1800 
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GTGGTTACTA CTTAAACTAT AATGGTGAAT GGGTTAAGTA ATGAAGGCTA ATTGTAAACT 1860 

GTGATGGATA CTTAACTTTG TATAATAGGT GGATAAAAGT CTTCACAATC AAAAAACGCA 1920 

TAGTATCAAG GTTTTTCTGT ACTGCCCTCA AACAGTTAGA CAATTAATTT ATCCGAAGgA 1980 

TTTAGTTCTG TATTGCACAG GGCTAAGTCC TTTTAGTTTT ACCTTAATTC GTTTATTGTT 2040 

GTAGTAATCA ATATAGTCTA TAATGGCTTG TTCCAATTGC TTAAGCGACT GAAACGACTT 2100 

CTCATAACCG TAAAACATTT CCGATTTCAG AATCCCAAAG AAGGACTCCA TC AT ACT ATT 2160 

GTCTGGGCTG TTTCCCTTAC GTGACATGGA TGCTTGAATT CCCTTACTCT CTAGGAACCG 2220 

ATGATAAGAA TCGTGTTGGT ATTGCCAGCC TTGGTCACTA TGGAGAATCG TATTCTCGTA 2280 

GTGCTTCTCT GTGAATGCCT GTTCCAACAT TGTTTGTACT TGTTCTAAGT TGGGTGAAGT 2340 

TGAAAGATTA TAGGCGATAA TTTCGCTATT AAAGCCATCT AAAACTGGTG ATAAGTAAAG 2400 

CTTTTGAGTA CTTGCTGGAA TGGCAAATTC TGTCACATCT GTGTAGCACT TTTCCATTGT 2460 

TTTAGAGCCT TCAAATTGGC CTTGAATGAG ATTCG 2495 



(2) INFORMATION FOR SEQ ID NO: 256: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 870 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO; 256: 

TACCACCGTA TTCATCCAGC AAGATTGCCA TTTGTCTTTG GGTATTTCGC AGTTCTTTTA 60 

GCAAGTCATC CACAAAAATA GTTTCAGGTA CAAAAAGTGG ATCTTGTAAA ATTCTCTTCC 120 

AAACAATATT GTCAAAACCG TCCACAAAGC CTGCCTTAAG GAGACTCTTG GTGTGAATGA 180 

TTCCAATTAC ATTGTCCTTA TCCCCATCAT AAACCGGGAT ACGAGAATAA TTTTGTTTTA 240 

AAATACTTTG GATAATGGCT TGACTATCAT CCTGAATATC CACCATAAAG GCATCCGTTC 300 

GAGGAACCAT AACCTCTCGT GCCATCAGTT CATCGAGCGA AAAGACACCT TGTAGCATCT 360 

CAATCTCATC AGCATCCAAT GTTTCTTCAC TATTTGTCAG CATATAGGCA ATTTCATCAC 420 

GGGTCATCTT TTCATCCGCA TCATCGAATG ACATAGGAGT CAAATGGCTC AAGAAATTGG 480 

TCGAAGCAGC TAAAAGCCAA ACAAAAGGAC TGACTAGTTT TCCGATCCCA ATGATAATCG 540 

GCGCTGTACG AATTGCCAAG GCATCCTTTA GATTAAGAGC GATTCTCTTA GGATATAATT 600 

CCCCAAAAAC GATGGAAATA TAGGTCAAAA ATGCCAAGGA TAGAAAAGTT GCCACGGCTT 660 

GTGCTGTTTC GCCATTCCCA AGCCAAGAGG CAATCACACG TCCTAGAGTA TCAGTTAAAC 720 
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TCGCCCCTGA TAAGATTGTA ATCAGGGTGA TTCCTACCTG GATGGTTGAT AAAAAGTGGT 780 

TAGGATTTTC TAGTACCTTC AGCAGGCGGA TGTAGCGTCT GTCTCCTTCT TCCGCCTTTT 840 

GTTCAACTCG GGCACGATTA AGAGAAACGG 870 



(2) INFORMATION FOR SEQ ID NO: 257: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1245 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 57: 

CGTTCCCAGA AGCCCGCATT CTCATCGCCA ATGTCGTGAT TGATTTGGCC CTTTCTCCAA 60 

AATCCAACTC AGCCTATGTA GCTATGGATA AGGCACTTGC TGACCTCAAA ACATCAGGGC 120 

ACTTGCCTAT TCCGCGACAC CTGCGTGATG GGCACTACAG TGGAAGCAAG GAACTGGGGA 180 

ATGCCCAAGA CTATCTCTAT CCACACAACT ATCCTGGAAA TTGGGTCAAG CAAGACTATC 240 

TGCCAGAAAA AATTCGTAAT CATCACTATT TCCAAGCAGA AGATACTGGT AAATATGAAC 300 

GGGCTTTGGC TCAAAGAAAG GAAGCTATCG ACCGTTTGCG AAAAATCTGA AATCCTTTTC 360 

AAAAARTTGC A.CTTTCCTCT TGATTTTTTT TGAA.*J''-\CTG CTATCATATA AATATAGAAA 420 

CGCTGTGGTG TACGACTTCA CACTTAAGTG TTGACCGACT ATTTTTTGTA TTATTAGGGA 480 

AACAAAAGTC TTCTAACAGC ATGTAGGCCG TCTCACACGG AAACAGCTTC AGTTAGAGCG 540 

AGTTGCCCAC CTGCTTAATT GCGCGGGTTC AATACAAACC GTGAAGTTTC GGCACCAATA 600 

CAGCTTTTTT CTTTGCCTCC TTAGCTCAGC TGGCAGAGCA GCGGACTCTT AATCCGTGGG 660 

TCACAGGTTC GATCCCTGTA GGGGGCATAT AAATACAACA GGAAAAGCCT TATAATATAG 720 

GGCTTTTTTT GCTTTCCTTT TAAAAATTGT CGTGCAATTT GCCGTGTTTT TACAACAAAC 7 80 

TTTTCACAGC CATAAACTCC TCACTAATTT TTTCCTCCAA GGTATGCCCA TAAACGTCAA 840 

TCAACATGGA GATATCTTTA TGTCCTAAAA TTTGGCTCTT TGTCAACTGT AGTGGGTTGA 900 

AGTCAGCTAA GCTCGAGAAA GGACAAATTT TGTCCTTTCT TTTTTGATAT T.CAGAGCGAT 960 

AAAAATCCGT TTTTTGAAGT TTTCAAAGTT CCGAAAACCA AAGGCATTGC GCTTGATAAG 1020 

TTTGATGAGA TTATTGGTCG CTTCCAATTT GGCGTTAGAA TAGTGTAGTT GAAGGGCGTT 1080 

GACGATTTTC TCTTTGTCCT TTAGAAAGGT TTTAAAGACA GTCTGAAAAA GAGGAGGAAC 1140 

CTGCTTTAGA TTGTCCTCAA TGAGTCCGAA AAATTTCTCC GGTGCCTTAT TCTGAAAGTG 1200 
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AAACAGCAAG AGTTGATAGA GCTGATAGTG ATGTTTCAAG TCTTG 1245 
(2) INFORMATION FOR SEQ ID NO: 258: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1684 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258; 

ATGCCTATGT AACTCCACAT ATGACCCATA GCCACTGGAT TAAAAAAGAT AGTTTGTCTG 60 

AAGCTGAGAG AGCGGCAcCC AGGCTTATGC TAAAGAGAAA GGTTTGACCC CTCCTTCGAC 120 

AGACCATCAG GATTCAGGAA ATACTGAGGC AAAAGGAGCA GAAGCTATCT ACAACCGCGT 180 

GAAAGCAGCT AAGAAGGTGC CACTTGATCG TATGCCTTAC AATCTTCAAT ATACTGTAGA 240 

AGTCAAAAAC GGTAGTTTAA TCATACCTCA TTATGACCAT TACCATAACA TCAAATTTGA 300 

GTGGTTTGAC GAAGGCCTTT ATGAGGCACC TAAGGGGTAT ACTCTTGAGG ATCTTTTGGC 3 60 

GACTGTCAAG TACTATGTCG AACATCCAAA CGAACGTCCG CATTCAGATA ATGGTTTTGG 420 

TAACGCTAGC GACCATGTTC AAAGAAACAA AAATGGTCAA GCTGATACCA ATCAAACGGA 480 

AAAACCAAGC GAGGAGAAAC CTCAGACAGA AAAACCTGAG GAAGAAACCC CTCGAGAAGA 540 

GAAACCGCAA AGCGAGAAAC CAGAGTCTCC AAAACCAACA GAGGAACCAG AAGAATCACC 600 

AGAGGAATCA GAAGAACCTC AGGTCGAGAC TGAAAAGGTT GAAGAAAAAC TGAGAGAGGC 660 

TGAAGATTTA CTTGGAAAAA TCCAGGATCC AATTATCAAG TCCAATGCCA AAGAGACTCT 720 

CACAGGATTA AAAAATAATT TACTATTTGG CACCCAGGAC AACAATACTA TTATGGCAGA 780 

AGCTGAAAAA CTATTGGCTT TATTAAAGGA GAGTAAGTAA AGGTAGCAGC ATTTTCTAAC 840 

TCCTAAAAAC AGGATAGGAG AACGGGAAAA CGAAAAATGA GAGCAGAATG TGAGTTCTAG 900 

TTCTCATTTT TTTCATGAAA ATGTGCAAAA TATAGTAGAT TGAAACTAGA ATAGTATACC 960 

TCTACTTCTA AAACATTGTT AGAAATCGAT TTGACTGTCC TGTTCTTATT TCATTTTACT 1020 

ATATCTTAAC AGATAGTGTA AATAAAGATA AACTATTTAC TGGCTAATTA ATCAGTTAAA 1080 

CACTAGTTAA GGAGTAATGA TGAAAAAAAG AACAATACTA TTATTGATGG CCAGTCTGTT 1140 

AGCTCTTGTC TTAGGAGCAT GTGGTTTCTT GGACATATTG ATCCTGGATC ATTCTCATCA 1200 

GGATTACTCT TTACTGCTAT TTTAGAAACT GGGGTGGTTT GATGGAAAGT ATTGGTCTTG 1260 

TTATCGTTTC ACATTCCAAA CACATTGCAG AAGGTGTTGT TGAACTGATT AGTAAAGTAG 1320 

CTAAAGATGT TCCGATTACT TATGTAAGAG GAACCGAGGG CGGAGGAATT GGAACGAGTT 1380 
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TTGAACAAGT AGATAGGGTT GTTTCCGAAA ATCCAGCAGA TACTTTACTT GCCTTTTTTG 1440 

ACCTAGGTTC TGCTAAAATG AACTTAAAAA TGGTGACTGA TTTCAGTGAT AAAAGTATCA 1500 

TCATCAACAG GGTTCCAATT GTAGAAGGTG CCTATAATGC AGCTGCTCTT CTTCAGGCTG 1560 

GTGCAGAACT GTCAGTTATT CAAACACAGT TaGCGGAgCt TGAAATCAAT AAATAAGGAA 1620 

TTTTACTATA ACTCTTTTTA TAGATAAGCT ATTGaTTATC TCAACTATAA TAATGTTAAG 1680 

TnAA 1684 



(2) INFORMATION FOR SEQ ID NO: 259: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 970 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259: 

AGGAGTGGAG AnATATGAAG ACACAAATTT TCACATTATT GAAAATCGTT GCTGAGATTA 60 

TTATTATTTT GCCATTTCTA ACTAATCTAT AAGTTCTTTA TATTGCTGAA AACGCAATTC 120 

AAAAAGGGCT ATTAATTGTG GATTTTCTAA TACCTGCAGA GATTGGATAA AGCGTTCAAT 180 

CTCTTTTTGA TTGCTTCCCT TTGTTTGAAG AAAGACACTC ATCTTCTTTA AAAATTGCCA 240 

CGATACTTTT TCAAAAACAT CATACGGTCG TAACATCCTC TCCAACTCGG CTTCGAAGAT 300 

TGGGATGTAG GAGAAAAGTT TTCGCTCCAT GAGTTCTGAT AAGATATTTA AGAGTCCTTG 360 

CTTCATATAC AATCGATTGT GTACTAACTC TTTAAATTCT TTGGATTTTT CGAGTAAGGA 420 

GGTTGATAAA AAAATCAGAT CTTGATTGCT CAAGAAGGGC ATGGTATTGC AAAAGAGATA 480 

GAGTTCAAAC CAGGTCCAAG ACTCGATAGC ATAGAGATAG GTGGTCAAAA ACTCGCTATC 540 

CTCCTCTGCT AGTGGGTAGC TTTTATTTAG TGAATGGATG GCATCTTTAA TCACGATGGC 600 

ATTCAAACGA CGATAGGTCT GCGCCATCTG TTCTTGATCG ACTTCCTCCA ATAGCTGCTC 660 

TAAAGCAGCT ATATCCTGAT GGGCAAAGCG ATTCACAACC TTTCGACCGA TTCGCATATG 720 

TGGAGATTCT TGATAGTTGT TGAGCTTGTG CCCAAACTCA TCAAAGGTCA CATTTATACC 780 

TTGGATAGCT AGAATCAACT TATCCGCAGA CAGCATAGAC TGCCCTAGTT CAAACTTGGA 840 

CAACTGAGAA GCTGTTAGAC CCTCACAAGC CACATCTGAC TGCTTGAGCT TTCTCGCCAA 900 

ACGTAATTCC TTGTAAAATT CCCCCAGTTC CATTCTCTCA ATCATCTGAC CACCTCCTAG 9 60 

CTTTTGCAGG 970 
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(2) INFORMATION FOR SEQ ID NO: 260: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2996 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260: 

GTTGACCACG GGTAAAACTA CCCTAACTGC AGCTATCACA ACTGTTTTGG CACGTCGCTT 60 

GCCTTCATCA GTTAACCAAC CTAAAGACTA TGCGTCTATC GATGCTGCTC CAGAAGAACG 120 

CGAACGCGGT ATCACTATCA ACACTGCGCA CGTTGAGTAC GAAACTGAAA AACGTCACTA 180 

CGCTCACATC GACGCTCCAG GACACGCGGA CTACGTTAAA AACATGATCA CTGGTGCTGC 240 

TCAAATGGAC GGAGCTATCC TTGTAGTAGC TTCAACTGAC GGACCAATGC CACAAACTCG 300 

TGAGCACATC CTTCTTTCAC GTCAGGTTGG TGTTAAACAC CTTATCGTCT TCATGAACAA 360 

AGTTGACTTG GTTGACGACG AAGAATTGCT TGAATTGGTT GAAATGGAAA TCCGTGACCT 420 

ATTGTCAGAA TACGACTTCC CAGGTGACGA TCTTCCAGTT ATCCAAGGTT CAGCACTTAA 480 

AGCTCTTGAA GGTGACTCTA AATACGAAGA CATCGTTATG GAATTGATGA ACACAGTTGA 540 

TGAGTATATC CCAGAACCAG AACGTGACAC TGACAAACCA TTGCTTCTTC CAGTCGAGGA 600 

CGTATTCTCA ATCACTGGAC GTGGTACAGT TGCTTCAGGA CGTATCGACC GTGGTATCGT 660 

TAAAGTCAAC GACGAAATCG AAATCGTTGG TATCAAAGAA GAAACTGAAA AAGCAGTTGT 720 

TACTGGTGTT GAAATGTTCC GTAAACAACT TGACGAAGGT CTTGCTGGAG ATAACGTAGG 780 

TGTCCTTCTT CGTGGTGTTC AACGTGATGA AATCGAACGT GGACAAGTTA TCGCTAAACC 840 

AGGTTCAATC AACCCACACA CTAAATTCAA AGGTGAAGTC TACATCCTTA CTAAAGAAGA 900 

AGGTGGACGT CACACTCCAT TCTTCAACAA CTACCGTCCA CAATTCTACT TCCGTACTAC 960 

TGACGTTACA GGTTCAATCG AACTTCCAGC AGGTACTGAA ATGGTAATGC CTGGTGATAA 1020 

CGTGACAATC GACGTTGAGT TGATTCACCC AATCGCCGTA GAACAAGGTA CTACATTCTC 1080 

TATCCGTGAG GGTGGACGTA CTGTTGGTTC AGGTATGGTT ACAGAAATCG AAGCTTAATT 1140 

CGATTTAGTT CCCAGAAGAA CAATTATTTA AGTTAGACAC TAAAAGAATC TTGCTTGGCA 1200 

AGGTTCTTTT TTTAGATATT GAACTAATAC TCAATGAAAA TCAAAGAGCA AACTATAATA 1260 

TATTGAAACT AGAATAGTAC ACATCTACTT CTAAAACATT GTTAGAAATC GATTTGACTG 1320 

TCCTGATCGA TTTGTCTTGT TCTTATTTCA TTTTACTATA GAAAGTTAGC TACAGACTGC 1380 

TCAAAACATT GTTTTTAGGT TGTAGATAGA ACTGACGAAG TCAGtAACAT CTATACGACA 1440 
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AGGCGAAGCT GACGCGGTTT GAAGAGATTT TCGAAGAGTA TAATACTAGA CTAAAATCAA 1500 

AAAGCATTAT ACAATAGTAA TATGA7ATCA ATTAAAGAAG AAATCCAAAC CATCAAAACA 1560 

CTTTTAAAAG ACTCTCGTAC AGCTAAATAT CATAAACGCC TTCAAATCGT TCTATTTCGT 1620 

CTGATGGGCA AATCTTATAA AGAGATTATA GAACTTTTAT AGTGGTTTGA AATAAGATGT 1680 

GAACAACTCT ATCAGGAAAG TCAAACTAAT TTATAGAAAT ATTTTAGCAG CCAAGGTGTA 1740 

CTGTTATAGA TTCAATACAC TTTAGACTGT AATCAAACAA CGATTTGGCG AAATGTAAAA 1800 

AATATGAGGA GTTCGGACTC GACTCTCTCC TTCAAGAAAC ACGTGGTGGT CGTAACCATG 1860 

CTTATATGAC GGTTGAGCAA GAGAAAGTCT TTCTTGCCCG CCATTTGAAG GCTACAGAGG 1920 

CAGGAGAATT TGTTACAATT GATGCCTTAT TTCAGGCTTA TAAAAAGGAG TTAGGTCGTT 1980 

CCTACACACG TGATGCCTTC TATCAACTGT TGAAGCGCCA TGGTTGGCGA AATATTACGC 2040 

CACGTCCAGA ACATCCTAAG AAAGCAGATG CTCAAACCAT TGTCGCGTCT AAAAATAAAG 2100 

TCTCAATTCA AGAAGACAAG TGAACTGCAC CCCT^AAAGTT AGACAGAAAA AATCTAACTT 2160 

TTGGGGTGTT TTTATTATGA AATTAACTTA TGATGATAAA GTTCAGATCT ATGAACTTAG 2220 

AAAACAAGGA TATAGCTTAG AGAAGCTTTC AAATAAATTT GGGATAAACA ATTCTAATCT 22 80 

TAGGTACATG ATTAAATTGA TTGATCGTTA CGGAATAGAG TTCGTCAAAA AAGGAAAAAA 2340 

TCGTTACTAT TCTCCTGATT TAAAACAAGA AATGATTCAT AAAGTCTCAC ATGAAGGCTG 2400 

GACTAAAGAT AGAGTTTCTC TTGAATACTG TCTCCCAAGT CGTACGATAC TTCTTAACTG 24 60 

GCTAGCACAA TACAGGAAAA ACGGGTATAC TATTGTTGAG AAAACAAGAG GGAGAGTACC 2520 

TGAGAGCGGA GAATGCCATC CTAAAAAAGT TAAGAGAACT CCGATTGAAG GAGGAAAAAG 2 580 

AGAAAGAAGA AAGACAGAAA TTATTCAAGA ATTAATGACT GAGTTTTCGT TAGATATTCT 2 640 

TCTAAAAGCC ATTAAACTAG CTCGTTTGAC CTACTACTAT CACTTGAAAC AGCTAGATAA 2700 

ACCAGATAAG GACCAAGAGC TTAAAGCTGA AATTCAATCC ATTTTTATCG AACACAAGGG 2760 

AAATTATGCT TATCGTCGGA TTTATTTAGA ACTAAGAAAT CGTGGTTATC TGGTAAATCA 2 820 

TAAAAGAGTT CAAGGCTTGA TAAAAGTACT CAATTTACAA GCTAAAATGC GACAGAAACG 2 880 

AAAATATTCT TCTCATAAAG GAGACGTTGG CAAGAAGGCA GAGAATCTCA TTCAAGGACA 2 940 

ATTTGAAGGC TCTAAAACAA TGGAAAAGTG CTACACAGAT GTGACAGAAT TTGCCG 2996 
(2) INFORMATION FOR SEQ ID NO: 261: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 837 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 61: 

CTTATCAACT CCCGACATGG CTCTCAGACC AATCCAAATC CCTAAAAAAA TCAGAACAAG 60 

GATGGTGGTC AAGATCAAAC TCTCGAAATA TAAAGAAAAT AGTTGCAGTA GCATGATTTC 120 

TCTCATTTCT ATCTTTTTTA AAGAGTAAAC TCAGCTAGTC CAACTAACTG AGTTTTCCTT 180 

TATCTATTAT ATCAAATATA AGTCCGTTTG TAACTAGCGA AGAATTCTTT TGTCCGCTCT 240 

TCTTTAGGGG TGTGGATAAT CTCATCCGGA GTTCCAGACT CGATGATTTT CCCCTTATCT 300 

AAGAAGAGAA TTTTATCCGC AACTTGGGCT ACAAAGGACA TGTCATGACT GACCAAAATC 360 

ATGGTCTGAC CTGACTTAGC AGCATCTGCA ATAGACTTTT CTACTTCACC GACCAATTCT 420 

GGGTCAAGGG CTGAAGTTGG TTCGTCTAAG AGCAAAACAT CTGGTTTCAT AGCAAGCGCA 480 

CGCGCTAGGG CAACCCGTTG CTTCTGTCCA CCTGATAAAT GGCGAGGATA ATGGTTTTCA 540 

CGGTCCGAAA GCCCAACCTT AGCCAACTCT TCCTTGGCAA TCTTAGTCGC TTCTTGGTCA 500 

GATAATTTCT TGACAACAAC CAAGCCTTCT TTCACATTAT CAAGTGCTGT TCGGCGTTCA 660 

AACAAATTAA ACTGTTGGAA AACCATAGAC AACTTACGAC GTAGGGCAAG GATTTCTTCT 720 

TGAGTGATTT TAGAAAAATC AACTGAAAAA CCATCAATCT GAATAGAGCC ACTGTCAGGT 780 

GTTTCTAGAT AATTGAGACT GCGAGAAAGG TTGATTTTCA GCTCTGAAGA CCAATCA 837 



(2) INFORMATION FOR SEQ ID NO: 262: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 868 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 62: 



CCGAACAAAA TGGGCTAATT AGATTATAGT AAGAAAGGTA AGTTAAAAAT 


GAGAATTGCA 


60 


ATTGGATGTG 


ACCACATCGT 


AACTGATGAA 


AAAATGGCGG 


TTTCAGAATT 


TTTGAAATCA 


120 


AAAGGATATG 


AAGTCATTGA 


CTTTGGTACC 


TATGACCATA 


CACGGACTCA 


CTACCCAATC 


180 


TTTGGTAAAA 


AAGTAGGGGA 


AGCTGTAACT 


AGCGGTCAAG 


CTGATCTTGG 


AGTATGTATC 


240 


TGTGGTACTG 


GTGTTGGTAT 


CAACAACGCT 


GTAAATAAAG 


TTCCAGGTGT 


TCGTTCTGCC 


300 


TTGGTTCGTG 


ATATGACAAC 


AGCCCTTTAT 


GCTAAAGAAC 


AATTGAACGC 


TAACGTTATT 


360 


GGTTTTGGTG 


GTAAAATTAC 


TGGTGAATTG 


CTTATGTGTG 


ATATCATCGA 


AGCTTTCATC 


420 
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CATGCTGAAT ACAAACCAAC TGAAGAAAAC AAAAAATTGA TTGCGAAAAT TGAACATGTT 480 

GAAAGTCACA ATGCTCAACA AACAGACGCA AACTTCTTTA CAGAATTCCT TGAGAAATGG 54 0 

GATCGTGGAG AATACCACGA CTAAGAGGTG ACCTATGATT TTAACAGTCA CAATGAACCC 600 

ATCCATCGAT ATTTCCTATC CCTTGGATGA GTTGAAGATT GATACTGTCA ATCGTGTGGT 660 

GGATGTAACC AAAACGGCTG GTGGTAAGGG ACTCAATGTT ACCCGAGTAC TTTCAGAATT 720 

TGGCGATTCT GTTCTTGCTA CTGGTTTAGT GGGTGGCAAA CTTGGTGAGT TTTTGGTTGA 780 

ACATATCGAT AATCAAGTAA AGAAAGATTT CTTCTCAATT AAGGGAGAAA CTCGTAACTG 840 

TATCGCTATT CTCCACGGAG ACAACCAA 868 



(2) INFORMATION FOR SEQ ID NO: 263: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 3744 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263: 

CCGTTCAAAG TCTTCATAAG ACTCGAAAGT CACAGTTCTT TCGTTCTTGC TGGCATCTAT . 60 

ATAGGTAATT TCAATCATOT TTAAAACTCC TTTGTTTAAT GCTA.\CTTTA TTTTACTCCT 120 

TATAAAAGAG AATGTCAAGA AAAATGATTG CGCACGCAAC TTTTTTTAAA ATCATCTTAA 180 

ATCAAGAAAT CCAAACCTGC TTCCAAGCTT TCTTCGACAG TCTTTTGTAG CGAGGCCAGT 240 

GTCTTTTGCC CATCATTTGT CAGGCAGATA AAACTAGAGC GTCTATCTTG ATGGCAACAC 300 

ATGCGACTGA GTAGACCGCA ATTTTTAGCT TCCAAGCGAG CCACCATCCT AGAAACTGCG 360 

CTCGGGCTCA GATGAAGCTT ATCTGGCAGG TCAATCTGGC GTAGAGATTT TTCTTCAGCC 420 

AAGTCCAGAT AGTAGAGCAG GTAGAACTCT TTCAAGGTCA GACTTTGCTC GCTCTGTTGG 480 

GCAATGGTCT CTTCCAAGAG ACTTTCAATT TCTTTCTGAC GCCGATTGAA GTCAAACCAT 54 0 

TTTTCCAAAT AGGTCATAGT GTCTCCTTTC TTTTTAGAGT CATAAATAGA AGAAAGTCCA 600 

TTAACGGGCA GTCTCTGCGT CACAAGATGA TTGCGCATGC AATAATTATA CTACTTTTCA 660 

AGAATGCTGG CAAGCTCTGT TTTTTAGTGG TTTTATTTTT GTGTGAATAA TGGGGGAATC 720 

CTATTGTTTC AATTTCTAAC TCCTTATCAC ATTCGAATTC AGATTTTATT TCATTTCTCT 7 80 

ATCTATAGTT GCTTAGTTTA AAATAAGCAT GGTCTAATAA AGCTATGCAT ATAGTACTGA 84 0 

TTTTAAACAA GGAGCATTAG ATTCCATTAA AGGAGGGCAC AGACATGTCG AGGCGGCCAA 900 
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AGTTTTTGAT GTCGGCGTCA GAACTCTCTT CACGTGGGAA AAGAAAGACG TAAACAAGGG 960 

AACTTAGAGC GGAAAAAGCG AGTCGTCAAA AAGCGTAAGA TCCCTTTAGA AGAATTGAAA 1020 

GCCTTTGTAG AGGCTCATCC AGACGCTTTT TTACGGGAAA TTGCGGCCCG TTTTGATTGT 1080 

GCTTTGCCCT CCGTATGGGC AGTTTTAAAG CAGATTAAGG TCATTTTAAA AAAGACGACC 1140 

AGTTTTAGGG AACAAAAGCC TGAGAAAGTT TCTGAGTTTC TTGATATTTT GGATAACCTA 1200 

AAAGATTTAC CAGTCCTATA TATTGACGAA ACGGGAATCG ACCGCTACCT CTATCGTCCT 12 60 

TATGCAGGGG CTCCTAGAGG GGAGAAAGTC TATGGCAAGA TTAGCGGACG GCGTTTTGAG 1320 

CGGACTAATG AGGTGGAGCA AAAACTCAAT GGTAGTTTTC TAATCAGATA TATTGATTCA 1380 

CTU^TTAGAG AATGAAAGAA TAATTATGCA TAAAAATAGG AATATAAACC AAAAATTAGC 1440 

TGATTTATAC TCATTTGCGT GTCTTTATAA AAAACTTATC TTATAATATA TATATATATA 1500 

TATACAAAAT AGTAAAATGC tTTTTTTTTT TAGCAAAAAT ACCTCAAGTT TCTTGCTATT 1560 

TTGGGTTCCC TATTCTATAA TTATAGTATG GTAATTTATT TATATCCATA CATGAAAATA 1620 

ATACTCGAAA GGAAATTTCA AAATATTTTT TAGACGTCAG AAGGGTGAAT ATAGAGAAAC 1680 

AGACCGAGTA ACTCGGTTCA AATTAATCAA ATCAGGGAAG CATTGGCTAC GGGCCTCGAC 1740 

TTCTCTTTTT GGCTTGTTTA AGGTCTTGCG AGGTGGTGTT GATACTACTC AGGTCATGAC 1800 

CGAAACGGTA GAAGATAAAG TAAGTCATTC AATTACTGGG CTTGATATCC TCAAGGGGAT 1860 

AGTTGCTGCG GGAGCTGTCA TAAGTGGAAC CGTTGCAACT CAAACGAAGG TATTTACAAA 1920 

TGAGTCAGCA GTACTTGAAA AAACTGTAGA GAAAACGGAT GCTTTGGCAA CAAATGATAC 1980 

AGTAGTTCTA GGTACGATAT CTACAAGTAA TTCAGCGAGT TCAACTAGTT TGTCAGCTTC 2040 

AGAGTCGGCA AGTACATCTG CATCTGAGTC AGCCTCAACC AGCGCTTCGA CCTCAGCAAG 2100 

TACAAGTGCA TCAGAATCAG CAAGTACATC GGCTTCGACA AGTATTTCTG CATCATCTAC 2160 

TGTGGTAGGT TCACAAACAG CTGCCGCTAC AGAAGCAACT GCTAAGAAGG TCGAAGAAGA 222 0 

TCGTAAGAAA CCAGCTAGTG ATTATGTAGC ATCAGTTACA AATGTCAATC TCCAATCTTA 2280 

TGCTAAGCGA CGCAAGCGTT CAGTGGATTC CATCGAGCAA TTGCTGGCTT CTATAAAAAA 2 340 

TGCTGCTGTT TTTTCTGGCA ATACGATTGT AAATGGCGCC CCTGCAATTA ATGCAAGTCT 2400 

AAACATTGCT AAAAGTGAGA CAAAAGTTTA TACAGGTGAA GGTGTAGATT CGGTATATCG 24 60 

TGTTCCAATT TACTATAAAT TGAAAGTGAC AAATGATGGT TCAAAATTGA CCTTTACCTA 2520 

TACGGTTACG TATGTGAATC CTAAAACAAA TGATCTTGGT AATATATCAA GTATGCGTCC 2580 

TGGATATTCT ATCTATAATT CAGGTACTTC AACACAAACA ATGTTAACCC TTGGCAGTGA 2 640 

TCTTGGTAAA CCTTCAGGTG TAAAGAACTA CATTACTGAC AAAAATGGTA GACAGGTTCT 2700 
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ATCCTATAAT ACATCTACAA TGACGACGCA GGGTAGTGGG TATACTTGGG GAAATGGTGC 2760 

CCAAATGAAT GGTTTCTTTG CTAAGAAAGG ATATGGATTA ACATCATCTT GGACTGTACC 2 820 

AATTACTGGA ACGGATACAT CCTTTACATT TACCCCTTAC GCTGCTAGAA CAGATAGAAT 2880 

TGGAATTAAC TACTTCAATG GTGGAGGAAA GGTAGTTGAA TCTAGCACGA CCAGTCAGTC 2940 

ACTTTCACAG TCTAAGTCAC TCTCAGTAAG TGCTAGTCAA AGCGCCTCAG CTTCAGCATC 3000 

AACAAGTGCG TCGGCTTCAG CATCAACCAG TGCCTCGGCT TCAGCGTCAA CCAGTGCGTC 3060 

AGCTTCAGCA AGTACCAGTG CTTCAGTCTC AGCATCAACA AGTGCTTCAG CCTCAGCATC 3120 

GACAAGTGCC TCGGCTTCAG CAAGCACATC AGCATCTGAA TCAGCGTCAA CCAGTGCTTC 3180 

GGCTTCAGCA AGTACCAGTG CTTCAGCTTC AGCATCAACC AGCGCCTCGG CCTCAGCAAG 3240 

CACCTCAGCT TCTGAATCGG CCTCAACCAG CGCCTCGGCC TCAGCAAGCA CCTCAGCTTC . 3300 

TGAATCGGCC TCAACCAGCG CCTCAGCCTC AGCATCAACG AGTGCTTCGG CTTCAGCAAG 33 60 

CACAAGCGCC TCGGCTTCAG CATCAACGAG TACGTCAGCT TCAGCGTCAA CCAGTGCTTC 3420 

AGCCTCAGCA TCAACAAGTG CGTCAGCTCA GCAAGTATCT CAGCGTCTGA ATCGGCATCA 3480 

ACGAGTGCGT CTGAGTCAGC ATCAACGAGT ACGTCAGCCT CAGCAAGCAC CTCAGCTTCT 3540 

GAATCGGCCT CAACCAGTGC GTCACCTCAG CATCGACAAG CGCCTCAGCT TCAGCAAGTA 3600 

CCAGTGCTTC AGCCTCAGCG TCGACAAGTG CGTCGGCCTC AACCAGTGCA TCTGAATCGG 3 6^0 

CATCAACCAG TGCGTCAGCC TCAGCAAGTA CTAGTGCATC GGCTTCAGCA TCAACCAGTG 3720 

CCTCGGCTTC AGCGTCAAAC AGTG 3744 

(2) INFORMATION FOR SEQ ID NO: 264: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 795 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264: 

CGATAAAGAG GCCTTGAGTA ATCTCAATTT GCAGATTGAA AATGGAGAGA TTATGGGCTT 60 

GATTGGTCAT AATGGGGCTG GAAAATCGAC CACTATAAAA TCCCTAGTCA GTATCATTTC 120 

ACCCAGCAGT GGTCGTATTT TGGTAGACGG TCAGGAGTTA TCGGAAAATC GCTTGGCTAT 180 

TAAACGAAAG ATTGGCTACG TAGCAGACTC GCCTGACTTA TTTTTACGCT TAACGGCCAA 240 

TGAATTTTGG GAATTGATCG CCTCATCCTA TGATCTGAGT AGATCTGACT TGGAGGCTAG 300 
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TCTAGCTAGG CTATTGAACG TTTTTGATTT TGCTGAAAAT CGCTATCAGG TTATTGAAAC 360 

TCTTTCTCAC GGAATGCGTC AGAAAGTCTT TGTCATCGGA GCACTCTTGT CTGATCCCGA 420 

TATTTGGGTC TTGGATGAAC CCTTGACTGG TTTGGATCCC CAGGCTGCCT TTGATTTGAA 480 

ACAGATGATG AAGGAACATG CACAAAAAGG GAAGACAGTC TTGTTTTCAA CTCATGTCCT 540 

AGAGGTGGCA GAGCAAGTCT GTGATCGGAT TGCCATTTTG AAAAAGGGGC ATTTGATTTA 600 

TTGTGGTAGT GTAGAGGACT TGAGAAAAGA TTACCCAGAC CAGTCTTTGG AAAGTATCTA 660 

CCTTAGTCTT GCTGGTAGAA AAGAGGAGGT TGCGGATGCG TCTCAAGGTC ATTAAAAAAT 720 

TAGTTGATAT CAATATCCTT TATTCATCTC AAGAAGCTAA TCTGGCTAAT CTACGAAAGA 780 

AGCAGGCTAA GAATC 795 
(2) INFORMATION FOR SEQ ID NO: 2 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2231 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 265: 



TGGTAATGTG 


CTTGGCAGCw 


TCCTTGACAC 


TGCTACTACC 


ATTTCCCATA 


GCGACCGACA 


60 


TACCAACtiCC 


AGCCAGCATT 


TCAAGATCAT 


TATCTGAGTC 


ACCAAAAGCC 


ATGACTTGGT 


120 


TGAGGTCAAA 


GCCATATTCT 


TTCCCAACTC 


GGCGAATGCC 


TTCTAATTTA 


GAATTTCCCT 


180 


GATTGATGAC 


ATCCGATGCA 


AAAGGATTGC 


TACGTGTCAA 


TTTCAAGTCT 


TCAAAATCAG 


240 


CTGCCGCCTT 


CTCAGATTCT 


TCTGGTGTCA 


TCAGCATCAA 


AACTTGGTAG 


ATAGGCTGAT 


300 


TCATCAGGTG 


AAGCAGGTCC 


TCTTCCTTTT 


GGGGAACAAC 


CTTGCTGACC 


ATGCGATTAA 


360 


AAGACTGACT 


CACCGTCCGA 


GTTAAAACAG 


AGGGAACGAA 


GCGACTAATT 


CGTTGGGAAA 


420 


AAGAACCCAG 


ACCAAAGGAC 


ATGATTTTAG 


AACCCAACAT 


GGCATCCTTG 


GTCCCTAGAG 


480 


CAATCTCCGT 


GCCCTCTTTT 


TTAGCATAGC 


TAATTAGATG 


GCGCAAATGT 


AACTTGGAAA 


540 


TAGGGCTCGT 


GAACAAGACT 


CTGTCTTTAC 


TAAAGATATA 


CTGGCCATTA 


TAGGTTACCG 


600 


CAAAATCCAG 


ATCCAAATCG 


TCCATCAATT 


CCTTAACAAA 


AAAAGGTCCT 


CGCCCTGTCG 


660 


CTACGCCAAC 


TAGTACCCCT 


TGTTCTTTGA 


CAATCTTAAT 


CGCATCCTTA 


GTGGATTTCA 


720 


AAACACTCTT 


GCGATTGTTG 


ACCAAGGTTC 


CATCGATATC 


AAAAAAAACA 


GCTTTGACTT 


780 


CCATCCTATC 


CCAATCTCCC 


CTTTTGTGAT 


ACAATGATTA 


TACCACATTT 


CAGAAAGAGT 


840 


GAGTAAATCA 


TGCCTAAGAA 


AATCCTTGTT 


TTACATACGG 


GTGGAACTAT 


TTCCATGCAG 


900 
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GCCGATGCTT CTGGCGCTGT TGTGACGAGT TCAGATAATC CCATGAACCA TGTGTCCAAC 960 

CCACTTGAAG GAATCCAAGT CCACGCCTTG GACTTTTTTA ACCTTCCAAG TCCCCATATC 1020 

AAACCCAAAC ATATGCTGGT CCTCTACCAG AAAATTAAAG AGGAAGCAGA TAACTACGAT 1080 

GGAGTGGTGA TCACACACGG AACCGATACT TTAGAGGAAA CAGCCTATTT CCTTGATACC 1140 

ATGGAAGTTC CCCATATGCC TATCGTTCTA ACAGGAGCCA TGCGTACt CC AATGAGCTCG 12 00 

GTAGTGATGG TGTTTATAAT TACCTAAGTG CTTTACGAGT GGCCAGCGAT GACAGGGCTG 12 60 

CTGACAAAGG AGTTTTGGTC GTTATGAACG ATGAAATCCA CGCTGCCAAG TATGTCACCA 1320 

AAACACATAC GACTAATGTC AGCACCTTCC AGACTCCAAC ACATGGCCCC CTTGGTCTCA 1380 

TCATGAAACA GGAAATCCTC TACTTCAAAA CAGCTGAACC TCGTGTTCGC TTTGACCTTG 1440 

ATCACATACA AGGTTTAGTC CCTATCATCT CGGCTTATGC TGGTATGACA GATGAGCTGA 1500 

TTGATATGCT GGATTTAGAA CACTTGGACG GTTTGATTAT CCAAGCCTTC GGAGCTGGTA 1560 

ATATTCCCAA AGAAACGGCT CAAAAATTAG AAAGCCTTCT GCAAAAAGGA ATTCCAGTCG 1620 

CTCTGGTATC ACGATGCTTT AACGGTATTG CCGAGCCTGT TTATGCATAC CAGGGTGGGG 1680 

GCGTACAGTT GCAAAAAGGA GGCGTTTTCT TTGTTAAAGA ACTCAACGCC CAAAAAGCTC 1740 

GCTTGAAACT CCTCATCGCC CTCAATGCCG GACTAACAGG ACAGGCTTTG AAAGACTATA 18O0 

TGGAAGGCTA ATACTCTTCG AAAATCTCTG CAAACCACGT CACGTCGCCT TACCCTATGT IS GO 

ATGGtACTGA CTTCGTCAGT TTCATCTACA ACCTCAAAAA CATGTTTTGA GCTGACTTCG 1920 

TCAGTTCTAT CTACAACCTC AAAAACATGT TTTGAGCTGA CTTCGTCAGT TCTATCTACA 1980 

ACCTCAAAAA CATGTTTTGA GCTGACTTCG TCAGTTCTAT CTACAACCTC AAAAACATGT 2040 

TTTGAGCTGA CTTCGTCAGT TCTATCTACA ACCTCAAAAA CATGTTTTGA GCTGACTTCG 2100 

TCAGTTCTAT CTACAACCTC AAAAACATGT TTTGAGCTGA CTTCGTCAGk TCTATCTACA 2160 

ACCTCAAAAA CATGTTTTGA GCTGACTTCG TTAGTTTCAT CTACAACCTC AAAAACATGT 2220 

TTTGAGCTGA C 2231 

(2) INFORMATION FOR SEQ ID NO: 266: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1310 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266: 
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GAGTCAAAGG CTCCGAGGTT GACTTTTTAC AAGGGGACAG GTGAATATTA TCTAGACCTG 60 

TCAGAAATTC TCTTCTTTGA AACAGAAGGG AGCAAGATCT ACGCTCATAA CCAGAAGGAA 120 

GCTTATGAGG TTCGCCTCAA GCTCTATGAG TTGGAGTCTA TCTTGCCTCG CTATTTTAAT 180 

CGAGTTTCCA AGTCAACGAT CGCAAACATC CGTCAGATTT ACTCAGTGGA CAAGTCCTTT 240 

TCAGGAACGG GCACCATTTC CTTTTATCAG ACGCACAAGG AGGTTCATGT CTCACGGCAT 300 

TACCAATCCC TCCTAAAAGA AAATCTAAGA AACATGAGGT AAAAAACATG AAAAAGAAAG 360 

CATTTGGTAT TGTTTTATTG GTTTTAGCAG CTTGGATCTT GCTGCAAGGG AATTTTGGAA 420 

TTCCTTCTTT GGATGGTAAA ATATGGCCTT TACTAGGTAT TGTTTTTTTT GCTTATAAGT 480 

CCATTGAGTC CATCCTTAGA CGTCATCTCA CTTCGGCAGT TTTTACAGGT TTACTGGCGC 54 0 

TCATCATTGC AAATTACGCT TATGACTTGT TACCAGTTAC CAATCATTCT CTTATTTGGG 600 

CTAGCATCTT GGTGGTACTT GGTGTTGGTT ATCTGACGCA TTCAAGTAAG TTCTGGAATG 660 

AAAAAAAATG GTGGTACAAT GGGAAAAAAA CAGTCGTCAC GGATAAGGAA GTCGCTTTTG 720 

GTAGCGGGAC CTTCTATAAG CAAGATCAAG ATCTCGTAGA TGACCAAGTG GAAGTCGCTT 780 

TTGGGGATGC TAAAATCTAC TATGATAATG CAGAGATGCT AGGTGATTTT GCAACTTTAA 840 

ATATTGAAGT GGCCTTCGGG AATGCAACCG TCTATGTTCC ACAACACTGG CGTGTAGATT 900 

TGAAAGTAGA AACCTCCTTT GGTGCAGCTA AGGCTGACGC TCCTGTAGCC CCAACCAGCA 960 

AAACCTTGAT TATCCGTGGA GATGTGGCTT TTGGGAAGTT GGAAATTGTC TACGTTAAAT 102 0 

AAAAAAATCT TCACTTCAAC CATCAAAATA GACGTACTAA GAGTAGGAAA TTGATGCCTT 1080 

GCTCTGATTT CAGTTCTATG GTTGTTAGAC TTTAAAAAAT GAAATGCTGC CTTTAAAAGT 1140 

TGTATATTTT TCGATATTTT GGCTTTTACG TTTGATGTAT CTATGTACTA CAGCGTAGAT 1200 

GATGTAGTGT CAAATGCTTT TAAAAAACGG ATGATATTGG ACAGTTTTTT TGCCTTTAAT 12 60 

TGCTCAGGAA CCATGAAAGT CAGTACCTGG GTTTATGACA AGGGAGAATG 1310 



(2) INFORMATION FOR SEQ ID NO: 267: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5922 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267: 
ACTCTGATTT GATTGGAACG ACAGTCGGTG CCATTGCAGT TACTTCAAAC GTAACGACTT 60 
ATGTTGAGTC TGCTGCTGGT ATCGGTGCAG GTGGACGTAC TGGTTTGACA GCCTTGGTTG 120 
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TAGCTATCTG TTTTGCGATT TCAAGCTTCT TTAGCCCACT TCTAGCGATC GTACCAACAG 180 

CGGCTACAGC TCCAATCTTG ATTATCGTTG GGATTATGAT GCTTGGTAGC TTGAAAAATA 240 

TCCATTGGGA TGATATGTCT GAAGCAGTTC CTGCCTTCTT CACATCTATC TTTATGGGAT 300 

TCAGCTACTC TATCACTCAA GGGATTGCAG TTGGTTTCTT GACTTACACT TTGACTAAGC 3 60 

TTGTTAAAGG TCAAGTTAAA GATGTTCATG TCATGATTTG GATTTTGGAT GCCTTGTTTA 420 

TCCTTAACTA CATCAGCATG GCCTTATAAT AGAATGACCC AGGGGGATTT CCCCCCTTTT 480 

TTAATACAaG GAGATAGGTG ATGAAAGAGA AAAATATGTG GAAAGAATTG TTGAATCGTG 540 

CAGGCTGGAT TTTGGTCTTT TTACTTGCCG TCCTTTTATA TCAGGTTCCC CTAGTGGTTA 600 

CCTCTATTTT GACTTTAAAA GAAGTAGCCC TGCTACAGTC AGGGCTGATA GTTGCTGGCC 660 

TTTCAATTGT GGTTCTGGCT CTATTTATTA TGGGAGCTCG TAAAACCAAG TTAGCTAGTT 720 

TTAATTTTTC TTTTTTTAGA GCTAAAGATT TGGCACGTTT GGGCTTGAGT TATCTAGTTA 780 

TTGTCGGGTC AAATATACTT GGTTCCATTT TATTGCAACT GTCAAATGAG ACGACAACAG 84 0 

CTAACCAGTC TCAGATTAAT GATATGGTTC AAAATAGTTC GTTGATTTCC AGTTTCTTCT 900 

TGCTAGCCTT GCTTGCTCCG ATTTGTGAGG AAATCTTGTG TCGTGGGATT GTTCCTAAAA 960 

AGATTTTCCG AGGCAAGGAG AACTTGGGAT TTGTAGTCGG TACGATTGTG TTTGCTTTAT 1020 

TGCATCAACC AAGTAATTTA CCTTCTTTAT TGATTTATGG AGGTATGTCG ACACTTCTAT lOCC 

CTTGGACAGC CTACAAGACC CAACGTTTGG AAATGTCGAT CTTGCTTCAC ATGATTGTTA 1140 

ATGGGATTGC TTTCTGTTTG TTGGCTCTTG TGGTGATTAT GAGTCGGACA TTAGGAATTT 1200 

CTGTTTAAAA GTTTTTATGT AGGAACCGAC CTCTTTCTAC CAGGGAAAGA TGAATGCAAT 12 60 

CGTGTCCATC TTTTTCTTTT TATGGTAAAA TAGAAAAATA ATATGATGAA AATCCTTGAG 132 0 

GGAGTGACCG ATATGTCAAG TAAAGCCAAT CATGCAAAGA CAGTTATTTG CGGAATTATC 1380 

AATGTAACCC CAGACTCCTT TTCGGACGGT GGTCAATTTT TTGCTCTTGA GCAGGCGCTC 1440 

CAGCAGGCTC GTAAATTGAT AGCAGAAGGA GCCAGTATGC TAGATATCGG CGGAGAATCG 1500 

ACTCGGCCGG GAAGTAGCTA TGTTGAGATA GAAGAGGAAA TCCAGCGTGT TGTTCCAGTG 1560 

ATCAAAGCGA TTCGCAAGGA AAGTGATGTC CTCATCTCTA TTGATACTTG GAAGAGTCAA 1620 

GTAGCAGAGG CTGCTTTGGC TGCTGGTGCC GATCTAGTCA ATGATATCAC TGGTCTTATG 1680 

GGTGATGAGA AAATGGCTTA TGTGGTAGCT GAAGCGAGAg CGAAAGTGGT CATCATGTTT 1740 

AACCCAGTTA TGGCTCGACC TCAGCATCCT AGTTCGCTTA TCTTCCCTCA TTTTGGTTTT 1800 

GGTCAAACCT TTACAGAAAA AGAGTTAGCT GACTTTGAAA CATTGCCAAT CGAAGACTTG 1860 
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ATGGTGGCTT TCTTTGAACG AGCACTAGCG AGAGCGGCAG AAGCTGGTAT TGCACCAGAA 1920 

AATATCCTGT TGGATCCAGG AATTGGCTTT GGTCTGACCA AGAAAGAAAA TCTGCTTCTT 1980 

TTACGGGACC TGGATAAACT ACATCAGAAG GGCTATCCAA TCTTTCTCGG AGTGTCGCGC 2040 

AAGCgATTTG TCATCAATAT CCTAGAGGAG AATGGTTTTG AAGTCAATCC TGAGACAGAG 2100 

CTTGGTTTCC GAAATCGGGA CACGGCTTCG GCTCATGTAA CTAGTATCGC TGCGAGACAG 2160 

GGTGTAGAAG TGGTGCGCGT GCATGACGTA GCTAGTCACA GGATGGCAGT TGAAATTGCC 2220 

TCTGCCATTC GTCTGGCTGA TGAAGCGGAA AATTTAGATT TAAAACAATA TAAATAAGAT 2280 

GAAAGAAATT GAAAACAATC AGTGGATTGC TAACTACCGG ACGGATCAAC CGCATTTTGG 2340 

CTTGGAACGA ATGGTGGAAC TGTTAGCTTT GCGTGGCAAT CCCCATCTCA AACTCAAGGT 2400 

CCTCCATATC GGAGGGACTA ACGGCAAGGG CTCGACTATT GCTTTTTTGA AAAAGATGCT 2460 

AGAAAAGCTA GGGTTGAGAG TTGGCGTGTT TAGCTCGCCC TATCTCATTC ATTACACAGA 2520 

CCAGATTAGC ATCAATGGGG AATCGATCTC AGAAGCGAGG CTAGAAGCTC TCATGGCAGA 2 580 

CTATCAGTCT TTGCTGGAGG GAGAAGCGGT CGCCAATTTA CAGGGCACAA CCGAGTTTGA 2 640 

GATTATCACA GCCCTGGCCT ATGACTACTT TGCCTCAGAG CAAGTAGATG TGGCCATCAT 2700 

GGAAGTTGGC ATGGGTGGAC TTTTGGATAG TACCAATGTC TGTCAGCCCA TTTTGACAGG 2760 

AATTACAACT ATTGGCTTGG ATCATGTGGC TCTACTTGGT GACACCTTGG AGGTCATAGC 2820 

AGAGCAGAAG GCAGGTATTA TCAAACAAGG GATGCCCTTG GTAACAGGGC GTATTGCTCC 2880 

AGAAGCCTTG GCTGTGATTG ACCGCATTGC GGAAGGGAAA GATGCGCCGA GACTTGCCTA 2940 

CGGGACAGAT TATCAGGTTC GTCATCAAGA AAGTGTGGTG ACAGGGGAAG TCTTTGACTA 3000 

TACAAGTGCT GTCAGACAAG GTCGCTTCCA GACTAGCCTG CTTGGTTTGT ACCAAATAGA 3060 

GAATGCTGGG ATGGCCATAG CTTTACTTGA TACTTTTTGT CAAGAAGATG GTCGAGAGCT 3120 

AGCAAGCAAT GATTTTCTTG GTCAAGCCTT GGAAGAAACA AGTTGGCCAG GGCGTTTGGA 3180 

AATCGTGTCA AGAGATCCCT TGATGATTTT GGATGGAGCC CACAATCCCC ATGCTATCAA 3240 

GGCCTTGTTG GTAACCTTGC AAGAACGTTT TGCGGATTAT CATAAGGAAA TCCTCTTCAC 3300 

TTGTATCAAA ACCAAGGCCT TGGAGGATAT GTTGGACTTG CTGGGAGCCA TGCCAGTTAC 33 60 

CGAGCTTACT CTAACACATT TTGCGGATAG TCGGGCGACG GATGAAAACG TGCTGAAAGA 3420 

GGCAGCTAAG TCTAGAAATC TCAGCTACCA AGATTGGCAT GATTTTCTAG AGCAGAATTT 3480 

GACAGATAAA AAAGAAGAGA AACAAACAGT TAGGATTGTC ACAGGTTCCT TGTATTTCTT 3540 

GAGCCAAGTG AGGGCCTATC TGATGGAGAG GAAGAACGAG AATGGATACA CAAAAGATTG 3 600 

AAGCGGCTGT AAAAATGATT ATCGAGGCTG TAGGAGAGGA CGCTAATCGC GAGGGCTTGC 3 660 
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AGGAAACACC TGCTCGTGTA GCCCGTATGT ATCAAGAGAT TTTTTCAGGT CTTGGTCAAA 3720 

CAGCAGAGGA ACATTTGTCA AAATCCTTTG AAATTATTGA CGATAATATG GTGGTAGAAA 3780 

AGGATATCTT TTTCCATACC ATGTGTGAAC ACCACTTCTT GCCATTTTAT GGTAGAGCGC 3840 

ACATTGCCTA CATTCCAGAT GGTCGTGTGG CAGGCTTGTC TAAGCTAGCC CGTACGGTTG 3900 

AAGTTTATTC GAAAAAACCA CAAATTCAAG AACGTTTGAA TATCGAAGTG GCCGATGCCT 3960 

TGATGGACTA TCTAGGTGCT AAAGGAGCCT TTGTTGTCAT TGAGGCGGAA CATATGTGTA 4020 

TGAGTATGCG TGGTGTTAGA AAACCAGGCA CTGCAACCTT GACGACAGTA GCTCGTGGTC 4080 

TATTTGAAAC AGATAAGGAT CTCCGTGACC AAGCTTATCG TTTAATGGGG CTATAAAAAG 4140 

AATCCGCTTC AAGCGGATTT TTCTAGAAAG GAATCATTAT GGATCAACTG CAGATTAAGG 4200 

ATTTGGAAAT GTTTGCCTAT CATGGTCTTT TTCCTAGTGA GAAAGAATTG GGGCAGAAAT 4260 

TTGTCGTTTC AGCCATCCTA TCCTATGATA TGACCAAGGC AGCTACAGAC TTGGATTTAA 4320 

CAGCCTCTGT CCATTACGGA GAATTGTGTC AGCAGTGGAC GACTTGGTTT CAGGAAACGA 43 80 

GTGAAGATTT GATTGAAACG GTAGCCTATA AACTGGTGGA ACGTACCTTT GAGTTTTATC 4 4 40 

CTCTTGTCCA AGAAATGAAG TTGGAACTGA AAAAACCTTG GGCACCGGTG CATTTGTCAC 4500 

TAGATACTTG CTCGGTAACC ATTCATCGCC GCAAGCAACG AGCCTTTATC GCCCTAGGAA 4 560 

GCAATATGGG AGATAAACAA GCAAACTTGA AGCAAGCCAT TGACA?J>.CTG CGAGCTCGTG 4G2G 

GCATCCATAT TCTCAAAGAG TCCAGTGTCT TAGCGACGGA GCCTTGGGGT GGAGTGGAGC 4680 

AGGATAGCTT TGCCAATCAA GTGGTTGAGG TGGAAACCTG GCTACCAGCA CAAGACTTGT 4740 

TAGAAACCTT GTTAGCCATT GAGTCAGAGC TGGGACGGGT GAGAGAAGTG CATTGGGGAC 4800 

CTCGTTTGAT TGATTTGGAC TTGCTCTTTG TGGAGGACCA GATCCTTTAT ACAGACGACC 4860 

TCATATTGCC TCATCCTTAC ATAGCGGAAC GCCTTTTTGT CCTTGAGTCt TACAGGAAAT 4920 

TGCGCCTCAT TTTATCCATC CGATATTAAA ACAACCGATC CGCAACTTGT ATGATGCTTT 4980 

GAAAAAATAG AAAAACTCTA GTTTTCAGTT ACTTGCAACT GAAGGCTAGA GTTTTTATAC 5040 

TCTTCGAAAA TCTCTTCAAA CCACGTCAGC GTCGCCTTAC CGTACTCAAG TACAGCTTGC 5100 

GGCTAGCTTC CTAGTTTGCT CTTTGATTTT CATTGAGTAT TAAAATAGGT CATTTTCTTC 5160 

TGGGAGGAGG ATAGTTTCTC TACCGTCCAT GTCTAAAACC AGTACTCTTG GGGGATAACG 5220 

AGGGTCGAAA GGATGGTTAA AGTCAAAATC AATGGCTGTA GGGAGGTGTT GACTTGAAAA 5280 

GTGGAAGGTA ATCTTTCCTT GGTTATTAAG CAATTGAAAC TCGAGTTCTT CTTCCAATTC 5340 

AAAGACATTT TTTAAGAAAT GGTCGATGAT ATACCAAAAA GAGTCAATGA TGTCATCAGG 5400 



wo 98/18931 



PCT/US97/19588 



1298 

CAAGCTGGTA ACAATACCAA AACTAGCAGA TCGCATGTGG GTATTGGTAA AAGCCATATC 54 60 

TCTGTCCCCT TTCTTTTCCC TTATCATACA GCAAATAGGA TTAAAAATCA AGAAAAGGTG 5520 

ATTTTTTGAA AAGGATTTTA GTTACAGGGA GAAATAGGGA AAAAATTCCT AAAAATCTAC 5580 

CGAAGTTAAT AGGTAAATTC CCAAATTAAC TTGATTATAT AACTTTCAGT TACTTTGAGA 5640 

AGTTACCGAA AAATATTTTT CATATCTATT GACTTTTAGG GGTAAAATTT GGTATGATAG 5700 

TAGGCGGTAT TGTTTACCCC ATTTGAAAGG CCCCGGAACC TTCCAAATAC TTTTCGATGG 5760 

GAAGGAACAC CCATCACCGT AAACAAAAAT CGAACTATAT ATAGGAGAAA TCATGAACAA 5820 

AACAACATTT ATGGCTAAAC CAGGCCAAGT TGAACGTAAA TGGTACGTAG TTGACGCAAC 5880 

TGATGTACCA CTTGGACGTC TTTCTGCAGT AGTTGCTAGC GT 5922 



(2) INFORMATION FOR SEQ ID NO: 268: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1988 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 68: 

TAACTATCTA CGATGAGCTG TTGTGATTCT CATTAGTTCC CCTTTCCCAA GAGGCATAGG 60 

GGTGCGCATA ATAGATGTGC TCCTCAGAAA ATATATCAAA CAAGCGATTG AATTCCGTTC 12 0 

CATTATCTGC CGTGATGGAA AGAATCTTGT GTTGTTTTAA GATGAGTTTT AGAGCCTGAT 180 

TGACCACCTC AGCACTTTTA TTTGGAATCA ATCGGATGAT CTGATGTCTA CTCTTTCGAT 240 

CCGTCAAGAC AATCAAGCAG TAGTTTTTCG ATCTCGTAAG TAGAACCGTA TCAATCTCAT 300 

AATGCCCATT CTCCAAGCGA AGATTGATAG CTTCAGGCCG CTGTTCGATG GATTGACCAG 3 60 

CAGGTTTAAA GTTGGTGCTA GCCTGTTTCT TAAGCGCTTT TCCTTTTCTA GGGTAAAGCA 420 

AATCCTGCTT GCTTAACCCC AATTTTCCAT GATGAATCCA ATAGTAAATG GTTGAAATTC 4 80 

CCACGTTAAC CCCTTTAGCC ATAACCATCA TTTCAGGCGA AAATTTTTGG TTATGATAGT 540 

GGAGAATCTT TTCCTTTAGT TCCTTGGTCA AGCTTGATTT CTTGACCGAG CGCTTGCGAT 600 

TGTTTTCATA AGACTGTTGA GCGTAGTCGG CAGAATAAAC CTCTTTGAAG CGCCCTTTTC 660 

CAAGACATTG TCGGACTGTC CCACGCTTGA TTTCAGTGTG ATAGTTTGAG GAGCTTTTCC 720 

AAGTAGAGAG GCAATTTCTC TATTTGATTT TCCTTCTTTT TTCCATCTTT CGATTAAGCG 780 

ACGGCTATCG ATTGTCAAAT GTTTGGCTTT TGTAGTATAA TTGTCTTGCA TCTCTGTGCC 840 

TTTCTTGTGT TTGTGGTTGA ACAACAAGTA TAACACAGAG GTGCTTTCTT ATGCCTACAA 900 
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GAGCTTTCAT TATTTCCATT TTCTTTTGGA TTTCACTCTA TTCTGAAAAA CTTGTGTATA 960 

TTTACTGAAG CTAGCAAGTC TTACCTGTAA ATTTAATGAA AGCAACACAA AATCCGAGAG 1020 

GGGAATCTCG GATTAATAGA TAGAGAGTTT TTAGTTTAAA TAAATTGTTT AAAATATCAA 1080 

CAACATCACT TCTTTTCTTA ACCTGATAAG TCTTGATTCC TAATTTTGGG GCTACGATTA 1140 

TATTGTCCTC AATATCGTCT AGAAAGACAC AATTTCTAGG TTATAACTGG TATTTATCGA 1200 

TAGTTACTCA TATACATCAG TCCACCTCCA TACTTATGTG CGAGCCTCTC TTTGTATTAT 1260 

ACCTCCATAC TCACCTTACA GATTCTTTTG GTAATAATAT CTTTGCCTAA TGTAGAGACA 1320 

GTCTTGCAAA GAAAAAACTT CCTTGTAGCC ATGTTTCTGA TAAAAGTCCG GTGCCTGGAA 1380 

CTGGTAAGTA TTGACAAAGG CAAAACAACA ATTTCGATTC TTAGCTTCAC TTTCTGCCTG 144 0 

TTGCAATAGT TTTGAACCGA TTCCTTGCCC TCGCAGTTCC TCTTTTACAA ACAAATACTC 1500 

GATTTCTAGC CAATTTCCAA AAGTCTCTGC TATCAAACCT GCCAGGAGAT TGCCCTTTTC 1560 

ATCTTCGACA TAAAGATTAA GTGGCTCACT TTCAGCCTCT TCTCTTTTTG AACGGTTATA 1620 

AACACGAATC AGATTCCCTA TTTCTTGCGA TTTATGTGAT TCCTTATTTT CCAATCTAAA 1680 

GTATAGTGAA ATGAAATAAA ACATGCGCAA ATCGATTAAG GAATTTAATC TAATTTCTAA 1740 

CAATGTCTTA GAAATCAAAG TGTACTATTT TAACTTCAAT GCACTATACA TCTAATACTC 1800 

AATAAAAATC AAAGAGCAAA CTAGGAAACT AGCCGCACGT TGCTCAAAAC ACTGTTTTGA 1860 

GGTTGTAGAT AGAAcTGACG 7VAGTCAGCTC AAAACATAGT TTTGAGGTTG TAGATGAAAC 1920 

TGACGAAGTC GGCTCAAAAC ATGGTTTTGA GGTTGTAGAT GAAACTGACG AAGTCAGCTC 1980 

AAAACAGG 1988 



(2) INFORMATION PGR SEQ ID NO: 269: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 709 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269: 

CCGGATATTT GTTTTATGTA ATTTTCTTGC AAGTTTCTTC TTAGTAGCTT GTCAGTCAGG 60 

TTCTAATGGT TCTCAGTCTG CTGTGGATGC TATCAAACAA AAAGGGAAAT TAGTTGTGGC 120 

AACCAGTCCT GACTATGCAC CCTTTGAATT TCAATCATTG GTTGATGGAA AGAACCAGGT 180 

AGTCGGTGCA GACATCGACA TGGCTCAGGC TATCGCTGAT GAACTTGGGG TTAAGTTGGA 24 0 
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AATCTCAAGC ATGAGTTTTG ACAATGTTTT GACCAGTCTT CAAACTGGTA AGGCTGACCT 300 

AGCAGTTGCA GGAATTAGTG CTACTGACGA GAGAAAAGAA GTCTTTGATT TTTCAATCCC 360 

ATACTATGAA AACAAGATTA GTTTCTTGGT TCGTAAGGCT GATGTGGAAA AATACAAGGA 420 

TTTAACTAGC CTAGAAAGTG CTAATATTGC AGCCCAAAAA GGGACTGTTC CAGAATCAAT 4 80 

GGTCAAGGAA CAATTGCCAA AAGTTCAATT AACTTCCCTA ACTAATATGG GTGAAGCAGT 540 

CAATGAATTG CAGGCTGGAA AAATAGATGC TGTTCATATG GATGAGCCTG TTGCACTTAG 600 

TTATGCTGCT AAAAACGCTG GCTTAGCTGT CGCAACTGTC AGCTTGAAGA TGAAGGACGG 660 

CGACGCCAAT GCCGyTGCTC TTAGAAaATA GTGATGATTT GAAAGAAGT 709 



{2) INFORMATION FOR SEQ ID NO: 270: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1680 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270: 



TATAAAATGT 


TAAGTTAAAT 


GATTTCAAAA 


TTCAGAAAGG 


GATTGCTTTA 


TGCAGTTCCT 


60 


TTTTATTTTA 


ACAGGAGTGA 


AACTATAGTG 


TTTCTAAATT 


GTGAATCAAT 


CAAAACTGAT 


120 


TGTGATGGGG 


CTATTCTAGC 


TTTAGAAACC 


TTCAAAAATT 


AAAATTTAAG 


GCAATCAATT 


180 


ACTTGGAAGA 


GTATGAAAGC 


ATTTAGTTTA 


TAGGAATTCT 


AGGTCTAGAA 


TTACATATAT 


240 


ATATTTATGA 


AGACGGGGTG 


TTCGATAGTT 


AGTATTGTTC 


TATTCTGAAA 


GATTTGAGCT 


300 


GTCAGTTGTA 


TAGAAAGTGT 


TCGAATTTTT 


TTAAGTGATT 


AAATTAGTTA 


ATTGTATGAG 


360 


GTGCTTTATG 


ATATAATGTT 


CTTAATGAAT 


TTTCAGAAAG 


GAAAACCTCA 


AATTGTTCTA 


420 


CAAATTTCTA 


CTCTTCGACC 


TCGACCACAC 


TCTTCTTGAT 


TTTGATGCTG 


CTGAGGATGT 


480 


GGCTTTGACC 


CAACTTCTAA 


AAGAAGAAGG 


AGTTGCGGAT 


ATTCAGGCTT 


ATAAAGATI'A 


540 


TTACGTTCCT 


ATGAACAAGG 


CTCTCTGGAA 


AGACTTGGAG 


CTGAAGAAAA 


TCAGTAAACA 


600 


AGAGCTGGTT 


AACACGCGCT 


TTTCTCGTTT 


ATTTGCTCAT 


TTTGGACAGG 


AAAAAGACGG 


660 


TAGTTTTCTT 


GCCCAGCGTT 


ACCAATTTTA 


CCTCGCCCAG 


CAGGGACAAA 


CACTATCGGG 


720 


CGCTCATGAT 


CTCTTGGACA 


GCCTCATTGA 


GCGTGATTAT 


AACTTGTATG 


CTGCGACAAA 


780 


TGGCATTACT 


GCCATTCAGA 


CAGGACGTTT 


GGCTCAATCT 


GGTCTAGCAC 


CTTATTTCAA 


840 


TCAAGTCTTT 


ATCTCAGAAC 


AGTTGCAAAC 


TCAAAAGCCG 


GATGCTCTTT 


TTTATGAAAA 


900 


GATTGGCCAG 


CAAATTGCTG 


GATTTAGTAA 


AGAAAAGACG 


CTGATGATTG 


GAGATTCTCT 


960 
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AACCGCCGAC ATTCAAGGTG GCAATAATGC GGGGATTGAC ACTATCTGGT ATAATCCTCA 1020 

TCACCTCGAA AATCACACAC AAGCCCAGCC GACTTACGAA GTCTATTCTT ACCAAGACTT 1080 

GCTGGATTGT TTAGATAAAA ATATTCTTGA AAAGATCACA TTTTAAAGGA GACGAGCTAA 1140 

TGACTACAAA AAAGCTAATA TTACTATTGA AGAGTACATT GAAATGTCTG AAGTTGATTT 1200 

TAATGAAGCT GTTAATTATG AATTTACATC TGACACTTGT CAATTAGCAA ATAGTATTTA 1260 

TCAATCTCTT TTTAAGTTTT TTGATAAGAA AAATTTCTCT GGCGATTTAA TTTTTACTTG 1320 

GAAATCTCCA TCATTAGTCA AAGAAGGGGA TTATATTGGG AGAAGGGATT CACAAGTAGA 13 80 

TAATCTTAGA GTAATAGGAA ATATATTTCC GAATTATCTT ACTAATCGAA AATATAGCCT 1440 

CAATATGAAT CGTAATGGCT GTATGGGAGA TTTTCCTCAT GACTTTTTTG ATATATACCT 1500 

AGATCATGTA GCAAAATATG CCTACGAACA AAAAGTTAAT AATATTAAAG AGTATTATCC 1560 

TTTAAAAAGA GCGATTTTAC ACCAAGAGAA TGCATTGTAT TTTCGATTTT TTTCTAATTT 1620 

TGACGACTTT TTAGAAAAAA ATTATTTAAA GACTATATGG CAAGTTTCTA AAGAAACTCC 1680 



(2) INFORMATION FOR SEQ ID NO: 271: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 598 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS r double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 271: 

AGCTCGGTAC GTAGTATnTG TGGTGCATAA ATGAGTGAAA AGAGGATAGA GAGGATGAGG 60 

CCGATAAGAA CACCGGTAGC TGCATCGTGA AATACTTGTT TTTTCATAGT TCTAATTTCT 120 

CCTTGATGGT TTTTAGATAA CGGCGTGAAG AGTAGGTGAA GCTTTCGTTT TTCAAGAAAA 180 

TTTCTACCAG ACCGTTTGGC GTGAgCTTGA GGTGAGAGAT GGAATCGATA TTGATGATTT 240 

CTGATTGGGA AATTTGGATA AAATTGGTTG GCAAGAGTTT AAGAACCTGA TAGAGTCGCA 300 

AATCAATGCT GTAGGTCTGA CTCGCGGTTT CTGCTAGAAC CTTCCGATTC TCGATATAGA 3 60 

AGCGCTGAAT CTTGCCAATC TCAACTAGAT AGACCTGATC ATCGATTTTT CCTTTGATTT 420 

TTTCTCTTTG GTCCAGATTT TCTGCGAACT CGATGACTTT CTGGACTTTT TCGGTTTCTT 480 

GAGGTGCTTG GACAATCAGC TTTTCCTCCT CGTAAGTCTC ACTAATCTGT AGTTCTACTT 540 

TCATAGTTTT CTCTCCTTTT CAGTTATACA AGGTTGTGAT CACTTCCTGT ATATCCGG 598 
(2) INFORMATION FOR SEQ ID NO: 272: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1099 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


272: 






CCAGCAAATC 


AATAACTGCA 


ATTGCTATAA 


AATGGATTCT 


ATAGAGTTTT 


TTCATGACAA 


60 


GACCTCCCTC 


TTTTATCTAA 


CTTCATTCTA 


CTCCAAAAGA 


ATGGGAGTTA 


CAACTAAAAT 


120 


GATAAAAATA 


GCAGAAGGGA 


GATTCTCTTA 


AGTTGGCTAG 


TATTCTTTAT 


TTGAGTTTCC 


180 


TTCTATTATC 


TAACTTCTTC 


ATCATTCCAG 


ACAAATAAAG 


CTCCGATTGC 


ATTGAGGATA 


240 


TAAAAGATGT 


ATTTACCGAT 


ATTGGCGAAG 


TTTCCTTGAA 


TACCAGCTTT 


TGTCAGCTGA 


300 


ACGAAATTGT 


AAATCAACCA 


AAAGCCCCAC 


TGAGTTGTTA 


GTTTTAATGC 


ATTCAAAGCA 


360 


TTGGCAATGA 


GGGACAGTGC 


AAAGGCAATA 


GTTGTTACGT 


AGGCAAGGAG 


ATTCATCTTG 


420 


CCCCCATATC 


CGATATAGTT 


GGTCACAAAG 


GCAAAGAGGA 


AGGCGATGAT 


GGAAATGATG 


480 


ATGGCCGCCA 


ATTTTACCTG 


TTTTTGGCTC 


ATTTGGTTGG 


GTCTGCCTTC 


TTGCGAAGCT 


540 


TCCCACTTCT 


TTATAGCAAA 


GGTATAAATG 


AGGAAGGTGA 


CGGGATAGGT 


AATGATGGCC 


600 


GCCTTATTTC 


CAAGGATATA 


ATCAATAGCA 


CCGGACAAAA 


TGGTATTAAC 


AATACCAAAG 


660 


TAATTTCCCC 


ATTTGCTTAA 


TTTCCCCGTG 




ACAACATGGA 


AATCCCAaCg 


720 


TTGGTTACGG 


AAATCAATCC 


AAAGGGTACA 


AGAGCTGTCC 


ATGATCCCCA 


GTCTACAAAT 


780 


TTATCGAGGT 


GTGAGTTGAG 


GTAACCAGAT 


GCAATCGCAA 


TCCCAACGAC 


CAAAGCAACC 


840 


CCGAAGAGGT 


CAAACTATTT 


AGATGTAGCA 


AAAATTTTTA 


GTGATTTTTT 


CATAGGTTAA 


900 


ACTACCTTTC 


TTTTTTTCAA 


ATATTCTCCC 


ACCAAATGAA 


AGTAAAATAA 


AATGATAGAA 


960 


ATAAAACCCT 


GAAAATAAAG 


GTTCTATAAT 


ATTTGTAGTG 


GGTAAATCCA 


CTATAGATAT 


1020 


TATGGAGCCT 


ATTTTATTGT 


AGAAAAAAAG 


TCCCATATGA 


CCTATAATGA 


AAAGCGACAA 


1080 


AACAACTCAT 


TAGAAAGAT 










1099 


(2) INFORMATION FOR SEQ ID NO: 273: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2723 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 273: 
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CTGGGATTCA CGTGAAAAGG AAGCCCAGAG AGTAGCCAGG TGTACTGCTA GAACAGTGAG 60 

TGAAATTGAA TATTACCATA GAGAGTCAAC CCAGATAGCT CAGGCTTTAG TTGAAAATCA 120 

AGCTCGTATC GAGGGAATCT ATAAATACTT TAGCCTTAGC ATGCCAGACT ATTTTTACTG 180 

GCAATTAGAG CGGAAAGCTT CGCCTTATAT ATCAGTCTCT CTGTATGAAA ATGTTGATGA 240 

CCTCTATGTT CGAAATGATT TTGTAACTGG GGTGGCCATT GCTTTTCAAG ATTACAAGGA 3 00 

AGTCTATGTT TCTACTAAAG ACAAACGTAG GkJcAGAAAAA ATCAGGGCTG AGGATTTCAA 360 

ACCAGCAGGA AATAGTTTTG CCATTCCAGT GTCAGATCCA GTGTCAGATC AAGACTTAGG 42 0 

AGTGATTTAC ATCTCCTTGG ATCCTGCTGT TTTATACCAT GCCATTGATA ATACTAGAGG 480 

TCATACTCCG ATGGCAGTAA CAGTGACCTC ACCTTTTGAT ACGGAGATTT TTCATATGGG 540 

TGAGACAGTT GATAAGGAGA GTGAAAATTG GCTAGTTGGC TTAACTTCTC ATGGATATCA 600 

GGTTCAGGTG GCAGTTCCTA AAAACTTTGT TTTACAAGGA ACAGTGACTA GCTCTGCTTT 660 

GATTGTGGGT TTGAGCCTTC TCTTTATTGT CATTCTTTAT CTGACTTTGA GGCAGACTTT 720 

TGCTAATTAC CAAAAGCAGG TAGTGGATTT AGTAGAATCC ATTCAAGTCA TTGCTCAAGG 780 

CGAAGAGGGG CGTCGGATTG ACATTTCCGA GAAAGATCAG GAATTACTCC TAATCGCGGA 840 

GACGACCAAT GATATGTTGG ATCGATTGGA AAAGAATATC CATGATATTT ACCAGTTAGA 900 

GCTTAGTCAA AAAGATGCCA ATATGCGAGC CTTGCAGGCG CAAATCAATC CTCATTTTAT 960 

GTATAATACG CTGGAGTTCT TGCGCATGTA TGCAGTTATG CAGAGTCAAG ATGAGTTGGC 1020 

AGATATCATT TATGAATTCA GTAGTCTCTT GCGTAACAAT ATTTCCGACG AAAGAGAGAC 1080 

CCTCCTCAAA CAGGAATTAG AATTTTGCCG TAAATACAGC TATCTCTGCA TGGTTCGCTA 1140 

TCCCAAGTCC ATTGCCTATG GTTTCAAGAT AGATCCAGAG TTAGAGAATA TGAAGATTCC 1200 

CAAGTTTACC TTGCAACCGC TGGTAGAAAA CTATTTCGCG CATGGTGTTG ACCACAGGCG 1260 

GACAGATAAT GTGATTAGCA TCAAGGCTCT TAAACAGGAT GGTTTTGTGG AAATTTTGGT 1320 

GGTCGATAAT GGTAGAGGAA TGTCGGCTGA AAAGTTGGCA AATATCCGAG AAAAATTAAG 1380 

TCAGAGATAT TTTGAACACC AAGCCAGCTA CAGTGATCAA AGGCAGTCTA TCGGGATTGT 1440 

CAATGTACAC GAGCGTTTTG TGCTCTATTT TGGAGACCGC TATGCCATTA CTATAGAGTC 1500 

TGCAGAGCAA GCCGGTGTTC AGTATCGTAT TACAATTCAA GATGAGTAGA AAGGGAGAAA 15 60 

ATGTATAAAG TATTATTAGT AGATGATGAG TACATGGTGA CAGAAGGTCT GAAGCGTTTG 1620 

ATTCCCTTTG ATAAGTGGGA TATGGAGGTC GTCGCAACAG CCAGTCATGC CGATGAAGCT 1680 

CTAGAATATG TTCAGGAAAA TCCTGTCGAT GTCATCATTT CCGATGTCAA TATGCCAGAC 1740 
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AAAACAGGGC 
CTGCTCTCAG 
GACTATTTGG 
GGTCAGCTCG 
GGATTTGTTA 
CAAGGTTCCT 
GGCCACCCCC 
GAACGCTGGA 
TCTGAGAGTC 
AATCAAATCG 
GTTTCGATTA 
CATCTCAAAG 
GATGAATTGG 
AATGAAAATG 
CTCAAGGATA 
CGTGAAACCG 



TTGATATGAT 

GTTATCAGGA 

TCAAGCCTGT 

GCGAGAGAGG 

GTTATTTAGG 

TCACCATTCC 

TAGATGGTTT 

AGCTGAATGC 

TCTTTGCCTA 

TAGAAGAGTT 

CTAAACAGCT 

CTGATGATAT 

TTTCTTATAT 

TGGTCAGTGT 

TCAGTAAGGC 

ATTCGACCTT 



TCGGGAGATG 
GTTTGATTAT 
TGATAAGGTA 
GAAGAAAAGT 
GGATAAGGAG 
CTACTATGTC 
AGTCGTTACA 
TGAGAAAACC 
TTACGAACCG 
AAATCTCTTG 
TTTTATCCAG 
GACGGACATT 
CAAGGAAACT 
GCTGGAAGTC 
CCTCTTTATC 
TGCAGAGTTA 



1304 
AAAGAGATCT 

GTAAAAAGAG 

GAGCTGGGAA 

CAGACTCTTA 

AATTGGTGGA 

TTGGGTCAAG 

CCTTTTGAAG 

CTCTTTTACG 

ATTTATAGGG 

GAGAAGGTAG 

TTTGTCATGG 

GTCAAAACCA 

CTGATCAGCT 

ATTGGTCGTG 

AATCCTGTCT 

CTAAACAAAC 



TACCAGATGC 
CAATGAACCT 
ATCTGCTGGA 
GTCAAGAATT 
TAGGTCTATC 
ACTGGCAGAT 
CTCCTTATCA 
GTTCTGTAAA 
TTATCATTCA 
TTCTTGAAAA 
ATGTTTTCCA 
TTCATGCTAT 
TTTTCGGTCA 
ATTACCAAAA 
ATCTAGGGCA 
AACGTATTAA 



TGCCTATATC 
TAGTGTGGTG 
GAAGATTGCA 
AGACGAGGCT 
CAAGGAAAAA 
TTTCATTTCT 
AGAACACTTT 
TCTGCAGCAG 
GGGAAATCTC 
TACACCTCGT 
TTTATTTGAA 
TCAATCCTTC 
ATACCGTATG 
AGAGCTTTCC 
GTTGATTAAG 
GGCTGCCCAG 



1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2723 



(2) INFORMATION FOR SEQ ID NO: 274: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274: 

CCGCAGTTTT TTTAAACCGT ATATAAGTAT AGCATAGTCA AAAAAAGAAT GCAAGATTTT 60 

TGCAAACTTT TTTAAAATTT TTCGTAATTT TTCTTTTAAA GTTCTACTGT CAGGACTTGA 120 

CCTTGCTTAA CAACCTGTTC TCCGGCGATA TAAACATCAT CTACATCACT AGATTTAACT 180 

GCATAAACCA GGTGAGACAG CATATTTTCC TGAGGTTGGA GATGAATTTT CCCTTGTGGT 240 

TGAATGACCA GAAAATCTGC TTGCTTGCCG ACTTCCAGAC TTCCTATCTG ATTTTCCATT 300 

CCAAGGACCT TAGCCCCTTC GATTGTCAGT ACCTTGAGAG CTGTTTCGAT TGGAAACTGG 360 

CTGGCATCCC CACTTTTCAT CTTCTGAAGA AGAGCTGCAG TCCTTCCTTC CTCAAACATA 420 
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TCTAGATTGT TATTGGAAGC AACCGAGTCA GTCGCAATTC CGACTGCTAC TCCCGCTTTT 480 

TGGAGCTGGA TAATTGGAGC AATTCCTGAT GCCAGTTTGA GGTTACTGAT AGGATTGTGG 540 

GCGATAGCnA CTTGAGAAGA TGCCAAGCGT TCAATTTCTC TCTCGTTTAA TTCGACCCCG 600 

TGAGCAAATA CGGACGGATG ATCTAAATAA CCCAGTTCTT CAAGAAAAGC AAGGGGGCGT 660 

TTGCCGTATC GTTTGAGGAT AATTCCTGAC TCCTCCTTGG TCTCCGCCAC ATGGACATGG 720 

AGCGGAATAT TTAGCTCTTT TGCCATTTCC AAACTCGCTT CCAGCAAGTC TCTACTGCAG 780 

CTATACGGAG AATGAGGTGC TACCATAACC TTGAAATTTG GATTTTTATA TTTTAA 836 
(2) INFORMATION FOR SEQ ID NO: 275: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2335 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
{D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 5: 

ATTTTATTTC ACTTTTTAGG TGGTCTGGGG CTATTCTTAT ATAGCnTCAA GACCATGGGA 60 

GACGGTTTAC AACAAGCTGC TGGAGATCGC CTGGGTTTTT ACATTGACAA ATATACTAGT 120 

AATCCTTTGT TTGGAGTTCT GGTTGGTATT GGGATGACTG CTCTAATTCA GTCTAGTTCT 180 

GGTGTAACAG TTATCACAGT CGGCCTGGTC AGTGCCGGTC TCTTAACCTT ACGTCAGGCT 240 

ATCGGGATTG TCATGGGTGC TAATATTGGG ACAACTGTCA CATCCTTTCT CATCGGTTTT 300 

AAATTAGGTA ACTATGCCCT ACCTATGCTC TTTATCGGTG CCGTCTGTCT TTTTTTTACG 360 

AAAAATCGGA CAGTCAATAA TATCGGACGC ATCCTCTTTG GTGTCGGTGG TATCTTTTTT 420 

GCCCTCAATC TCATGAGCGG CGCAATGGCT CCACTCAAGG ATTTACAGGT CTTTAAGGAC 480 

TATATGATTG AGCTAAGTAA GAATCCTGTT TTGGGTGTCT TTGTCGGTAC TGGCTTGACC 540 

TTGCTAATTC AAGCTTCTTC GGCTACCATT GGGATTTTAC AAAACCTCTA CGCCGGCAAT 600 

CTAATTGATC TACAGGGAGC TTTGCCAGTT CTATTTGGTG ACAATATCGG GACAACCATT 660 

ACAGCCATCA TTGCCTCTTT AGGGGCTAAT ATTGCAGCTA AACGGGTAGC AGGAGCTCAT 720 

GTTGCCTTCA ACGTTATCGG AACAGTTGTC TGCGTTATTT TTCTAGTTCC TTTTACTGTC 780 

CTGATTCATT GGTTTGAAGC TACGCTAT^T CTAGCACCGG AAATGACCAT CGCCTTTGCT 840 

CACGGAACCT TTAATATTAC CAACACCATT GTCCAATTTC CATTTATCGG AGCTCTGGCT 900 

TACTTTGTAA CCAAGATTAT TCCTGGAGAG GACGAGGTTG TCAAATACGA ACCCTTATAT 960 
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CTTGATGAAC ATTTCATCAA ACAGGCCCCA TCTATCGCTC TAGGAAATGC TAAGAAAGAG 1020 

CTCTTGCACT TAGGAAACTA CGCTGCTAAA GCCTTTGACC TTTCCTATAA GTACATCATT 1080 

GACTTGGATG AAAAAGTTGC TGAAAAAGGG CATAAAACCG AAGAAGCAAT TAACACCATC 1140 

GATGAGCAAT TAACACGTTA TCTCATTGCC CTTTCAAGCG AAGCTCTCAG CCAAAAAGAA 1200 

AGTGAAGTGC TTACCAATAT CCTTGATTCC TCCCGTGATT TGGAACGGAT TGGAGACCAC 12 60 

ACGGAGGCTC TACTCAATCT GACTGACTAT CTTCAACGGA AAAATGTTGA ATTTTCTGAT 1320 

GCCGCCTTGA AAGAATTAGA GGAAGTTTAC CGCCAAACTA GTGACTTTAT CAAAGATGCT 1380 

CTGGATAGTG TGGAAAACAA TGATATTGAA AAAGCACGCA GTCTTGTAGA ACGTCATGAA 1440 

GCAATCAATA AGATAGAACG TGTTCTCAGA AAAACCCACA TCAAACGCCT CAACAAAGGC 1500 

GAATGTTCAA CACAAGCTGG GGTCAACTTT ATCGACATCA TCTCACACTA CACTCGTGTA 1560 

TCAGACCACG CTATGAACCT TGCTGAAAAG GTTTTTGCAG AACAAATCTA AGAACCAAGA 1620 

AGCTATCCAT CATAATTGGA TGGCTTTTTA CTTTTTCCTA AGCAAGACTA GGATGAATGA 1680 

AACTGAAAGA GTATTCTGCA GATATATAGT CCCCAATTAT TCACCCCAAA TCTAAAAACC 1740 

ATCCAGAATC CTTGCCTTAG CTTAGATCCT GGATGGTTTC TTTTTTCACC CAATGGGTGT 1800 

TTTTTACTAG ACAAAAAAGA GTTTCCCCTT TATGGTATAA GTGTAGAAAA AAACACAAAA 1860 

AGAAAGGAAA CTCACATGAA CAGTTTACCA AATCATCACT TCCAAAACAA GTCTTTTTAC 1920 

CAACTATCTT TCGATGCAGG TCATTTAACC CAGTATGGTG GTCxTAtCi-I' TTri'CAGGAA 1980 

CTTTTTTCCC AGTTGAAACT AAAAGAGCGG ATTTCTAAGT ATTTAGTAAC GAATGACCAA 2040 

CGCCGCTACT GTCGTTATTC GGATTCAGAT ATCCTTGTCC AGTTCCTCTT TCAACTGTTA 2100 

ACAGGTTATG GAACGGACTA TGCTTGTAAA GAATTGTCAG CTGATGCCTA CTTTCCAAAA 2160 

TTGTTGGAAG GAGGGCAGCT TGCTTCACAG CCAACCTTAT CCCGTTTTCT TTCCAGAACT 2220 

GACGAGGAAA CAGTCCATAG TTTGCGATGC CTCAACCTTG AATgGkCGAA TTCTTTTTAc 2280 

AGTTTCACCA GCTAAACCAA CTCATTGTAG ATATCGATTC TACCCATTTC ACAAC 233 5 
(2) INFORMATION FOR SEQ ID NO: 276: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 752 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276: 

CGGATTCACT GTTGTTGACT AATCAATAAC ACAGTAGAAA ATCTCACAGC AGTCTATTAG 60 
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TTGCTTTTCA TACTAGGCAA GTGACTGAGG CTTGTACTTG GGTACAGCAA GGGAGCTTAA 120 

GGCCGTAGAA GAGAAAAATA GTAGACTGAA AACCCGCAAG ACTTCATCAT TTCGAGAAGT 180 

GACGTGGGAG ATGAAAATCG ATTGAACCAC TTACAAGGAG AATAGAAAAT GGCTAAAAAA 240 

AGCAAACAAC TTCGTGCTGC TCTTGAGAAA ATCGACAGCA CAAAAGCATA CAGTGTAGAA 300 

GAAGCTGTAG CACTTGCAAA AGAAACTAAC TTTGCAAAAT TTGATGCAAC TGTAGAAGTT 360 

GCTTACAACT TGAACATCGA CGTTAAAAAA GCTGACCAAC AAATCCGTGG AGCAATGGTA 420 

TTGCCAAACG GTACTGGTAA AACTTCACGT GTTCTTGTTT TCGCACGTGG TGCAAAAGCT 4 80 

GAAGAAGCAA AAGCTGCTGG TGCAGACTTT GTTGGTGAAG ATGACCTTGT TGCTAAAATC 540 

AACGACGGTT GGTTGGACTT CGACGTAGtT ATCGCTACAC CTGATATGAT GGCTCTTGTT 600 

GGACGTCTTG GACGTGTCCT TGGACCACGT AACTTGATGC CAAACCCTAA AACTGGTACT 660 

GTAACAATGG ATGTTGGCAA AGCGGTTGAA GAGTCTAAAG GTGGTAAAAT CACTTACCGT 720 

GCTGACCGTG CAGGTAACGT TCAAGCAATC AT 752 
(2) INFORMATION FOR SEQ ID NO: 277: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2643 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277: 

GTCAACATTG ATTTCAAGGC TGTTTGCTTT CTATCTCCCC TTTTTCATAA TGTATAATAA 60 

AATGAAATAA TAACAGGACG AATTGATCGG GACAGTCAAA TCGATTTCTA ACAATGTTTT 120 

AGAAGTAGAG GTGTACTATT CTAGTTTCAA TCTACTATAT TTTCGTACAG GTGCTTCAAC 180 

CATTTGAACG ATTTCAAATC CTTCTTTTTG GTAAAGATTC TGAGCTCTTT GATTTGCCTC 240 

GAAGACATTT AGAGAAATAC TGTCTATATC TCTATTTTCA AATGCTAAAC TAACAAATTT 300 

CCTTAAAGCC TTGCTACCTA AGCCTTGCTC CTGTTTCTGG GGGTTGATAA AAAATCTCCC 360 

GATATGAAGA TTGCTGTCTT CTAGCCTGAT TTTCTGGATA AATCCCACAA ACTCTTGTTC 42 0 

ATCAAAGATT GAAAAGACTC CTTCCAAGGC TTGAAGTGTC AGTAGAAAAG GAATCCTTGG 480 

TCCCATCCAT TGTTCTTGAA AGGATTTGCC TAGGGAGTTG GACCACTGGC ATACAAATTG 540 

AGCGTTTTCT GTGCTCACCT TTTCTTCAAA ACGAATTGTC ATCTTTTCCT CACCACCTTA 600 

TCTATGTTTC TCCATTATAC TATTTCTCCC ATTTTTTACG AATAGATAAG TATGATTGAT 660 
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TTTTATTTTT TTCTCGTCGG GAGCATTCTA GCTTCCTTTC TTGGTTTGGT CATTGACCGT 72 0 

TTTCCAGAGC AATCCATTAT CAGTTCAGCC AGTCACTGCG ATTCCTGTCA GACTCCCTTG 780 

CGTCCCTTAG ATTTGATTCC GATTCTCTCA CAGGTCTTCA ATCGCTTTCG CTGTCGCTAC 840 

TGCAAAGTTC GCTATCCTGT CTGGTATGCC CTCTTTGAAT TAAGCTTAGG ACTCCTCTTT 900 

CTGCTTTACT CTTGGGGATG GCTCTCCTTG GGGCAAGTCG TCCTAATCAC CGCTGGTTTG 960 

ACCTTGGGTA TCTACGACTT TCACCATCAG GAATATCCCT TACTGGTCTG GATGACTTTC 1020 

CAGCTAATCC TAATAGCTTC CTCTGGCTGG AATCTGGTCA TGGTCTCCTT CCTCATACTT 1080 

GGAATTTTGG CTCATTTTAT CGATATCCGC ATGGGTGCAG GGGATTTCCT CTTTTTAGCT 1140 

TCTTGTGCTC TCGTCTTTAG CGTAACGGAG TTACTGATCT TGATTCAGTT CGCTTCTGCG 1200 

ACGGGTATCC TGGCCTTTCT CCTGCAAAAG AAAAAGGAAA GACTTCCTTT CGTGCCTTTC 1260 

CTCTTACTTG CTACTTGTTT GATTATTTTT GGTAAGCTAC TGCTTGTCTG ATAAAATCCA 132 0 

ATTTCTGCCA TATATCCTTC ATGAAATTAT TTCACAGTTA AATTATAAAT TATTTCTTTT 13 80 

GTACAAAGGG ATGATGTTAT CAAATCGATC TGTTCTTCTA TCTTCTTGAT ACTGATCAAA 1440 

AAATTTCATT TCGACTGAAA ATATTTCGCT TATAAACTGT AAACGAATAC TTTGTTTAGA 1500 

CATTATAGTC GCTAGACTGA CTAGATGATT ACTCAAAACG ACGTCCAGAA TACTCTTTAC 1560 

TTTGCTTGGT TTTTTAACAA AAATTTGATC ATCCAAGGGT TCAATCATTT TGTAACCTTT 1620 

TTGCGCAATT TGACGATAAA AGTAAGAA.TG TTGCTTTGGA GTC7iATAATC CTAACT^AAA 1680 

AGCTCGATAC TCTAAAGCCT GTATCGAAAC ATTCAAATCC GACTTCAATA AAATATAACT 1740 

ATCAGGATTG CTGACACGCT TGCCAACCCT CTCTTCAAAT TTGACTAAAA ACTCTTCTTT 1800 

TGGCAATAAA AAACATGATG CAAAATAATT TGCTTCTTGC TCCAAACGAT CGCCATCTTC I860 

ATTCATATCT TTATATTTAT GTAAAAGAAT ATGTCCTAGC TCATGAGCTA AGTCAAAATT 1920 

TCGACGTACA GATGATTTAT TCGTTCCTAA CACAATATAA GGTCTTCCCA ATTTTGACCA 1980 

TGCGCTATAA GCATCAGCTT GGCCATTAAT TAATCGTTCC ACGATATAGA TGCCTGAACG' 2040 

TTCTAATTTA TAAAGCAAAT CATGATTATC TTTTGAAATA CCTAATTTTT CCCTGGCATA DlOO 

AAGAGCCAAT TCCTCAATGG ATTCTCCCTT ATGATAAGAT TCACTCACTA CATTACTTAG 2160 

GTCATGAATT ATAATATTAG GTATAATTAC AAAACTTTCA AAATAATCAA TCAAACTATC 2220 

TACCTTATGT AAATACATAG TTTGAATATC TATTGTTTTC CGTGTTGCTA GGTCTGCATT 2280 

TCTAAAGGCA ATTACAGAAG AATCAAATCG AATGCTCTCT TCTTCCTGTT CAAAATAAGT 2340 

TAAATCAACA TGAAATTGGT TGGCCAAATG CATTTTGGTT GATAATTTAG GTTTCGTTTC 2400 

GTTGGACTCA AACTGCCAAA TGGCTTGTTC CGTTAAATTA ATTCTCTGAG CTAATTCTGC 2460 
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TCTACTTAAA CCATTTAACA GCCGTAATTC TTTCAATACC CGACCATTAA ACATTTACAT 2520 

ACTCCTTACT ACTTTTGACC TTCTTGTTTT TCTATTCTTG GAATAATTTC AAAATCTTCT 2 580 

GTTTCCGATA ATTCTGAAAA ATTAGGAATA TCTTGATATT TAGCTTCTTC GAAATGGTAC 2 640 

GGG 2643 



(2) INFORMATION FOR SEQ ID NO: 278: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 582 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278: 

TGACCAGTGG CAAAATGGCT ATCCAAATGC AGATGTTATT ATCGATGATA TCATCTCAGG 60 

GCAAGCCTAC GTAGCCTTGG AAGAGGGAGA ACTGCTAGCC TATGCTGCTG TGACCAAGAG 120 

TCCAGAGGAG GCCTATGAAG CTATTTATGA GGGAAACTGG CAAGCTGGAG AGTCAGAGTA 180 

TCTAGTCTTT CACCGTATTG CTGTGGCAGC AGATGTGCAG GGAAaAGGAG TTGCTCAAAC 240 

CTTCTTAGAG GGCTTGATTG AAGGTTTTGA TTATCTTGAT TTTCGCTCAG ATACGCATGC 300 

TGAAAACAAG GTTATGCAAC ATATTTTTGA AAAACTTGGT TTTAAACAAG TCGGTAAGAT 360 

GCCAGTAGAT GGCGAACGCT TGGCCTATCA AAAATTAAAG AAATAATGCA AAAGAAGTAT 4 20 

GTAAAAATCC TCTACTCCTC ACCAATTGGT ATTCTATCAC TTGTAGCTGA TGACCATTAT 480 

TTGTATGGAA TTTGGGTTCA GGAGCAGAAG CATTTTGAGA GGGGACTAGG AGATGAAACG 540 

ATAGAAGAAG TTGTwAGTCA TCCTATTTTA GACCCAGTTA TT 582 



(2) INFORMATION FOR SEQ ID NO: 279: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 554 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 279: 

CCCAAGCTAC TAAGAGACTA AAACTTGCTA dAGAAGCAAG AGAAAGTGTG AATCTTTTTA 60 

ATTTCATGAT GAATTTCCTT TCTGCTACCA ATTTAGAGAA ATTTTCTCTA ACCAGCAATT 120 

CCCCTAGTAT AACAAGTTCA AAAAATGGAG TCAATTTATC TGCTCACGGT CCAGCAGGTA 180 
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GCCCCGTACT TCTGAGATAA AATAGAGAGA CCCTGTAACG AACAGCAAGT CTTGAGCGTC 
TGCCCTTTCT TCAAAATCGC TGATAAATTC TCGGTAAGAA GAAACTATAT CGTAACCTGT 
CACATCCCTT TCGTCCAAAG CCCCCTGATA GTCAAAGCCG GTCACCTTGA GTTCCACCTG 
AGGCAATTTT TCAGTCAGAT AACCCAACAT CCCTTGATAA TCCTTACGTT TCAAGGATCC 
AAAGAGGATT TGAGGTCGAT AGCCTTCCTG CTCTTTTTCT TTGATAAACT CAGCCAAGCG 
AGTCAAGGCA GGGAGGTTAT GAGCACCATC CAAATAAATC TGTGGGCGAA TACGCTCCAA 
GCGAsCAGCC CAAT 

(2) INFORMATION FOR SEQ ID NO: 280; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 766 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 280: 

CCGGTTTTTC AAATGAATTT CTTGGTTGTG GCTAAAAAAT ATGCTACACT ATCAATATGA 

AAATTTTAAT CCCAACAGCA AAAGAAATGA ACACAGACTT CCCAAGTATC GAGGCAATTC 

CTTTAAAACC AGAAAGTCAG GCCGTGCTTG ATGCCTTGGC TCTCTATTCT GCCAGTCAAT 

TGGAGAGTTT CTAC?J^.GGTA TCAGCTGAGA AAGCGGCGGA AGAA'i-rrCAA AATATCCAAG 

CTTTGAAAAG GCAAACTGCT CAACACTATC CAGCCTTGAA ACTTTTTGAT GGGCTTATGT 

ACCGCAACAT TAAGAGAGAT AAGCTGACCG AGGCGGAACA AGATTATCTT GAAAATCATG 

TTTTCATTAC CTCGGCTTTG TACGGTGTTG TTCCAGTCTT GTCACCCATG GCTCCTCACC 

GTTTGGATTT TTTGATGAAA TTAAAAGTCG CTGGTAAGAC TTTGAAGAGC CATTGGAAGG 

CAGCCTATGA TGAAACTCTG AAGAAGGAAG AAGTGATTTT CTCTCTCTTG TCATCAGAGT 

TTGAGACTGT ATTTTCTAAG GAAATCAGAG CAAAGATGGT GACCTTCAAA TTCATGGAGG 

ATAGAGGCGG TCAGCTGAAG ATTCACTCAA CTATCTCCAA GAAAGCGCGC GGGGCCTTTC 

TAACAGCTTT AATAGAAAAT CAAGTACAAA CTGTGGGGGa AGCACGTCGC TTGAACTTTG 

CTGGATTTGT TTACCGAGAA GATTTGTCAC AACCACAGGG GGATGG 

(2) INFORMATION FOR SEQ ID NO: 281: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 901 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 281: 

CCGGCCACGG TTCCATCCAA CTTCACAGGT GTGCACTTGA TTGTGTATGT AATTGTCACT 60 

AACGGTAGAA TTTCACCTAT CCCTCCTATC TGCTCGCAGT ACCCGCAGAC TTTCTGAAAG 120 

AAGAAGATAA CCTACTTATC CGTTGCTATG ATTATACTAA AGTTTCTACT TTTTTGCAAA 180 

TAGATTTTTA AATTTTTGGC TAATTGTCTG AATCAGGGTC GGAAGTTTGA CGACCTTGTC 240 

ATTGCCTAGT TTTTCGCGTG CAATTTTGAG AATGGCACCT GAGTCTTTTG AAGCAAAGAG 300 

GAATTTTCCT TTGTCTGTAA AGACTTCGAA GTGGCGGCTG ATTTTGCGTC CAGTGACATT 360 

GGCTCCAATC TGATTGATAT GGCTCCAAGG AATCTGGATA AATTGTTCGA CATTGACATC 420 

TGGGTAAAAT TCCAAAGCCT GATCTCCGAC AAGGAATTTC CCAACTTTCC CAGCGATAGA 480 

GAGGTAGGAA GTGCCTGTCG TACTGAGGAG TACTGTTTTG TTAAGTGATT GGGCCATGCT 540 

TAGTCTTCCT TACTTTCTCC AAAAAAGGCA TTGTAGAGGG CTTTAATTGC TGCTTTCTCT 600 

TGGTCTTTAT TGACAACAAA CATAATAGAA ACTTCACTAG AACCTTGAGA CATCATCTGG 660 

ATGTTGATTT TGTTTTCAGA TAGAGCGCGT GTCGCAGTAG CAGTCACTCC GATATGGCTC 720 

TTCATTTTTT CACCAACAAT CATAATGATA GAAAGGTCGT GTTCGATTTC TGCATGATCT 780 

ACTTTAGCCT TTTGAACCAA CTGACGCAGG ATTTCTTCTT CCTTGATGGG AGTTAGTTGG 840 

CGAGAACGGA GAATGATAGA AAGAwCGTCG ATACCTGTTG GCATATGTTC CCAACCGATG 900 



T 901 
(2) INFORMATION FOR SEQ ID NO: 282: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1765 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 282: 

CCCTGTTACG TGGATAATAG GGTAAGACTG CTCAGGATTT CCTAACAAAT CCACCGCTTG 60 

CTGCATTCGA CCCAAACCTG ATCGAAAATT CAAACCAATC CGACTATGGA GCCATTCTTC 120 

TACTTCAAAC ATACACATCT CCTTGACAAA AGTCCAATCA ATTATCGCAT TAAAGTATGG 180 

TTACTAATAA AAACAAGGCC AGGATTTTCG TCCCGACCTC TTACCTGGTT AGCTAATAAC 240 

TAGCTACTAT GAATGTGAAT ATGGGCTAAA AACATCCACT GGACGTTCCA ACTCTTCCCC 300 
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ATTTCTGGGA GTTGGGGTAA 


AAATGTTCAC 


TGGACGTTCC 


AACTCTTCCC 


CATTTCTGGG 


360 


AGTTGGGCTG ATACAGTCTC 


CCAGACTGTA 


TCACTCCTCC 


ATAAAGCTGT 


TGAAGACTTC 


420 


TTCAATCATG TTCCATTCGT 


CTTCTGAGTC 


TTCTGGGATT 


GGTTGCAATT 


CGCCTTCTGT 


480 


TCCATCTTCG TTTTCGATGA 


ATGAGTAAGC 


TTGGATTTCA 


ACTTGTCCGT 


CTTCGTCTTC 


540 


TTCTGCGTTA ACTGGTACTA 


GAAGAACATA 


GTTTTTACCA 


AATTCTTCTT 


TTCCATCAAT 


600 


TGTCAAAAGG ATTTCAAACA 


AGGTTTCATT 


TCCTTGCTCA 


TCTACTAGTG 


TGATTAGTTC 


660 


ACGTTCTTCG TGGTCGTGGT 


TATGATCGTG 


TGACATAGCC 


TCGCCTTTAT 


ATTAAAATTT 


720 


TCTATCTAAA TAATTTTGTA 


AAATCAGCTG 


AGCTGCTAAC 


TTATCAATGA 


CTTTCTTGCG 


780 


CTTATTGCGA CTGATATCTG 


CTTGTTCAAT 


CAACATGCGC 


TCAGCAGCCA 


CTGTTGTCAA 


840 


GCGTTCATCC TGATAGTCTA 


CTGGTAAACC 


AAAAAACTCT 


TCTAGCTTTG 


CTCCGTAGCT 


900 


TGACTAGCTT CTACGCGCGG 


TCCACTTGTA 


TTGTTCATGT 


TTTTAGGCAA 


GCCCACTACA 


960 


AATCGTTCCA CCTTGTAAGT 


ATCAACCAAT 


TCCTTAACGC 


GGTCAAAACC 


AAATTGGCCT 


1020 


TGTTCTTCAT TTATCTGGAT 


GATTTCAAGC 


CCTTGAGCTG 


TAAAACCAAG 


CGGATCGCTA 


1080 


ATCGCCACCC CTACCGTTTT 


TGAACCGACG 


TCCAATCCCA 


TAATTCTCAT 


AGGTTATAGA 


1140 


TCGACTCCTT GTCCTTTGAG 


GTAGTAGCGA 


ACCAATTCCT 


CAACGATTTC 


ATCACGCTCA 


1200 


TACTTACGGA TTTGATTTCG 


TGCATTATTA 


TAACGAGGAA 


CGTAGGCAGG 


GTCTCCACTC 


1260 


A AT ACGT AAC CT ACG ATTTG 


GTTAATTGGG 


TTCxAsGOCT" 


TAT CGTTC AA 


L: AT AA 


ij2u 


ACATCTGTCA AAGTTTCGCT 


AATTTCTTTT 


TTATTGGAAT 


CGTCCAATTT 


AAAACGTACT 


1380 


GTTTCTTCAG TAAATCCCAT 


TCTAACACCC 


TCTTTCCTTA 


GAATAGTACC 


ATTATAGCAT 


1440 


AATTCCTTAC CTTCTACAAT 


TCAGGCAGTC 


TATTTATTTG 


GATTTTCTAT 


TGTTCTGTCG 


1500 


CGCCATTTGC CAATCTATCT 


GAAATATATT 


TGCTTGGTTC 


ATTTTTCAAA 


AGATTTTCCA 


1560 


AACCAATATT CTTCAGATGT 


TCCAACTGGG 


AAGCCTTCTT 


GACATCCAGA 


ACTTGAAAAT 


1620 


CAAAACTAGT CGTTGTTTGA 


AGTTCCGTTG 


CGCTCAATAG 


TTTTGTTTCA 


AGTTTGAAAC 


1680 


CTGCCAATTT ACGAGCTTCA 


ATGATAGACT 


TATCCTTCTC 


CTCCGCTTCA 


AGAAGAGCTT 


1740 


TTTGAGTTTC CTCCACTCCA 


TGTTG 








1765 


(2) INFORMATION FOR SEQ ID NO: 283: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 134 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDWESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283: 

CTTATCCATT CACTTTCTTG TCTGTTATTC TATAAATCTT ACTCCTAAGT ATACCACATT 60 

TGCCCCTAGA TGTGAACGAG AGAAACGCTC TAGACATTGC CAAGAAGGAA AAAAAAGGGT 120 

ACAATGTAAC AAAATCAAGG GAGGTCTGGA ATGAAGAAAC AAAGCAAGTA CAAAGAGGTC IBO 

GTTTCCTATC TGAAAAATGG TATCGAGTCT GGACGATTTC CGACGGGTAG TCGCCTGCCT 240 

TCTATCCGTC AACTGAGCCT TGACTTTCAC TGCAGCAAGG ACACCATTCA ACGAGCCCTG 3 00 

CTGGAATTAC GGCACGAACA ATACCTCTAT GCCAAGCCTC AGAGTGGCTA CTATGTATTA 360 

GAACAAGGGC AACATCAAGA CCTAGAAATC GAGGTTACCG ACGAACATGC CAGTGCCTAT 420 

GACGATTTCC GACTCTGTGT CAATGAAACC TTGATTGGCC GAGAAAACTA CCTCTTCAAC 480 

TACTATGACA ATCAAGAAGG ATTAGAAGAC CTAAGACAGT CCATTCACAA ACTCCTCTTT 540 

GAGCAAGCTC TCTACTGCAA GGCTAACCAA CTAGTACTGA CTTCTGGAAC CCAACAAGCC 600 

TTGTTTATCC TCTCTCAAAT ATCCTTTCCT AGACAAGCCA AGGAAATCTT GGTGGAACAG 660 

CCAACCTACC ATCGGATGAA TCGCCTCTTG ATTGCACAGG GGCTGGACTA TCAAACGATT 720 

GAACGAGGCA TTGATGGGAT TGACTTGGAG GAGCTGGAAG GCCACTTCAA AACAGGAAAA 780 

ATTAAGTTTT TCTACACCAT TCCCCGATTT CACTATCCCC TGGGACATTC CTATTCTGAG 840 

CAAGACAAAC GATCTATTCT TAACTTAGCT GCCAAGTATG ATGTCTATAT CGTAGAGGAC 900 

GATTATCTGG GTGATTTGGA CTCCAAGAAG GGCCAAACCT TCCACTATCT TGATACAGAG 960 

GAGCGTGTCA TTTATATCAA GTCCTTCTCG ACCAGCCTTT TTCCTGCCCT TCGTATTACA 1020 

GCACTCATTC TTCCAAATGC TATCAAAGAA GCATTTGTGG CCTACAAAAA TATCCTAGAC 1080 

TACGACAGCA ACCTCATTAT GCAAAAGGCC CTGTCACTCT ATATTGACAG TCAATTGTTT 1140 

GAAAAAAATC GTTTGGCTCG CTTGACCAAT CATGAATCTT ACCAAAAACA AATCGAGGAA 12 00 

AGGATAACTA AAACACCTTG TCCCCTTCCT CATTATTCCC TACACGATGG yTTATTGCTA 12 60 

GACCTGAGAC AGTATCCTAA AATCGCCAGT CTCAAACACA GTCAACTGGG cTTGGACTTC 1320 

TTTGAAGAGG CCTATTTAAG CACCTG 13 4 g 
(2) INFORMATION FOR SEQ ID NO: 284: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 284: 

CTATATTCAG AATATGCCAA AAATTCGGAA TGGTATAAAT TTGCGGAGGG TTCATTTGAC 60 

ATATTTAGAA AACTCCCCCA AAGAATTAAT TTTAAGAAAG ATTTTTCTAG AATTTTGGCC 120 

CCCTTTATTA TTAATTTGCT TAAATTAATC AATAATTATC TAGAGAATAA AGAATACGAG 180 

TGGATTGACA AGAATGGAAA TATTTTTTCC TCTCTAGTAT TTTATTTAGA AGATTTAATC 240 

TATCCTTGGA TTGTTAAACC TTTGGTTTTA GAGATAAATT CATTGCGTGA AAAAGGTTTA 300 

CTTGAAGGGG AATCGGAGCA GCAACGGTAC AAATATTTTA TAACATTGTT TGACAAGGAA 3 60 

GAGAATATAT TAAATTTTTA TAACAAATAT CCCGTTTTAC TGAGGCAAAT ATCGGAGTCT 420 

TGTCTTCGGT TCTATACTTA TTTTATAGAA ATTTTATCAA ATTTAGAAAA TG/CTTTTAGT 480 

GTGCTAGAAG AAGAATTAGG GCTAAGGGGG AAATTAAATG ATATAAAATT TGGAAAGGGT 540 

GATACACACA GCCAAGGAAA AACTGTTTTG ATACTCTTCT TTGATGACGC GAAAATTGTT 600 

TACAAGCCTA AAAATTTAAT AATCAATAAC TCACTAAATA CTATTGCTGA GTATATCCGA 660 

AAGGTTGATG AAAAAATTAG GATAAGAATA CCTCGAACTA TTGCTTATTC GGATCACAGC 720 

TATGAAGAAT TTATTGATTA TCTACCTCTA GAGCAAAAGA AAAATTTACC TGAATATTAT 780 

TATAATTTTG GTGTGCTTTT AGCATTTATA TATTTATTTA ATGGGAGTGA TATACATTTT 840 

GAAAATTTAA TTTCCTATGG AGATATGCCT GTAATAATAG ACTTTGAAAC AATGTTACGG 900 



(2) INFORMATION FOR SEO ID NO: 2 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 862 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 85; 

TTATTTAGCA GAGGCAGTTT TAAATGTGAA GGATTTGGTC AGTCAAACAG TTTTTTATCA 60 

GCAGATTATT GGTTTAGAAA TCCTATCTCA AACGGATACA GAGGTCGTTC TGGGACTTGG 120 

AGGAAAAGCC TTGGTACACT TGATTCAAGC ACAAGAGGGT GGAGAAGTAA GGGAACATTA 180 

TGGTCTTTAC CATCTGGCTA TTCTTTTGCC GACACGAAAG GCTTTGGCGG ATGTCTTGAA 240 

GCACCTGACG GATTTACAGA TTCCTCTTGT TGGCGGTGCA GATCACGGTT ACAGTGAGGC 300 

CCTTTACTTA GAGGACTTGG AGGGAAATGG CATTGAACTC TATCGAGATA AGCCAGTTTC 360 

CACATGGGAT ATTCGAGAAG ATGGACGTAT TATCGGGGTG ACTGAAGTCC TTGCGGCTCA 420 

GGATATCTAT GAGTTGGGGG AAAGAGTAGA GCCTTTTATC CTAGCAGAGG GTACGAGAAT 480 
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GGGGCATATT CATCTTTCTG TCAAGGATAG TCGAAAGTCC AGACAGTTTT ATCAAACGGT 540 

GTTAGGGCTC GAGGATAAAT TCAGTGTGCC TAGTGCTAGT TGGATCGCAG CTGGGGACTA 600 

CCATCATCAT TTAGCAGTCA ACGAATGGGG AGGAAAAGGT CTGGATCCGC GTAAACAAGT 660 

CCTACCAGGT TTAGCCTACT ATGTCATCGA AGTCGCACAT AAAGAAGAAC TGTTAACGAT 720 

TGCCCAACGA GCACAAGAAG TTGACGCACC AATCAAATGG ATGACATCGA TCCAATTGGA 780 

AATCACAGAC TCAGATGGCA TCGTGACCCG TATTCGTTTA GCTAGATAGA TGGTATGTGA 840 

TGAAGGTAGA GCATCAATTG TA 862 



(2) INFORMATION FOR SEQ ID NO: 286: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 650 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286: 

TCGTTTACAA GATCGCTAAA ATGCATCTCA TGATCGCGAC CACGAATTCC AAGATAGCAC 60 

GCGCTACCTC AATCATAGAT AGTTCACTTT TTTCTTGCCC AGCAAATACT TCTAATTCCA 120 

AAGCGTTTCT CCTCATTTAT ACTACTATCG CCAGAGCGAA CAGACTCTGA CCTCATTTTA 180 

TCATTTACTC TTTATTTTAC GATAATTTTG CGGAATAGTC AAAGGTTAAG GGGGAGAAAG 240 

TGGCAGGATT AGACTAATTC CAATATAAAA CTCATTCCTT TTTCTGTTGC TCCATTTTCC 300 

ACAAATCCAA GCGACTTGAA ACACCTCCTA GAAGCATGAT TGTAGGTGTA GATTTTCTTG 360 

ACTCTCAATT CTTTCCATCC TTTTACTCGA GCCAATTCAA TCAAAGCACT TAGAATCTTT 42 0 

TTTCCAAGTC CTCGATGTTG GTAAGCGGAA TTCCCAATCA CAATGGGGAG ATTATCCTGA 4 80 

GATAGTGTAA TATCCCCAAT TGGAAACCAT TCTCCCTTCT CCTTGACTTC AATCCAAAAA 54 0 

AGCTCACCAT GCCGATyCAr ATAGGAATAC ATGGCTTCCA AGGTCGcTtG ACTGTAAGGA 600 

AGCTTCACCC CATCTACGAG GtAAcCAAGT TCACATCCGT GATACCAAGC 650 



(2) INFORMATION FOR SEQ ID NO: 287: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1119 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287: 

GATAGCAATC CGCTTCAGAA ACTTCTCGCT TACCTCTAAC TCCGATCGCT AGTTTGGGAG 60 

AAGATACTTC CATTCTCATA CTATCTGTTG GCTTTGCAGG CTGTAAAAAC AACTTTTCTC 120 

TTGCTACTTC CTGAAAATCT GAATCTTGCA GTTCTTTGCT TTCAAAATAG TCCTGTACTC 180 

GCTCCACATC AAAATTCCCA GCTAAAGACA GAGACATGTT TACAGGTTTG TAAAACTTTG 240 

TAAAATTTTC TTGCAAATTA GTTAGATTGA TTTGGGAAAT GGACTCCTCA CTTCCAACTA 300 

TATCAGTTGC TAAAGGTGTA CCAGGATACA AATTCGCTAA AGTTGAAAAG AATAAACACG 3 60 

AATCTGGATC ATCTTGGTAC ATTTCTCGTT CTTGCTGAAT AATATCCTGC TCTGTCAGAA 420 

TGGAAGCTTC AGTAAAGTGT GCTGATGTTA CCAATTCATC AAGTAAATCT AAATTTTCTA 480 

AAAAATAATC CGTTGCTGAA AAAAGATAGT TTGTTTTTGT AAAGCTTGTA AAGGCATTAC 540 

TATCTGCACC TAGACTCGTA AAAGCCGACA TCAAATCACT AGAATCTTCT CTCTCAAATA 600 

ATTTATGTTC AAGAAAATGA GCAATTCCTC CAGGATATTG TTTTACATCT CCGTCAACTT 660 

CTGTGACAAA CGTATCTACC GAACCAAACT GTACAGTGAC ACTCCCGTAA ACCTCTTTAA 720 

ATTCCTTTTT AGGCAAAAGA GCAACTGTCA ATCCGTTGGC CAAACGAGTT CGATAAACCA 780 

TTTCTTTTAC AGCTGGATAG TATTTTTCTT CAAAAACAAC CTTTGTCATT CTATTCCTTC 840 

CATAAAGTAA ATCGCTTGTA GTTTCACATT ATTAGCTACT CTACAAATAG CATCTTTGTC 900 

AATTTGTTCA AGCTTTGCAA TCCAAC'TT"rr AAAUTCTGCT GAAGATTTTC CAAATAAGGC 960 

ATTTTGATAA GCACGTTCAA TCAATGAAGA ATGATTATCT TGAGAAAGTA ACAACGACCA 1020 

ACGAATCATT TCCTTGGTCT GATTTAACTC AAACTCTGTA AAAAAACCTT TTTTTAAATC 1080 

AAGCCGTTGA TTATTCATCA ATTTACGAGC CTGGTTACG 1119 



(2) INFORMATION FOR SEQ ID NO: 288: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 540 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 88: 

ACGCCCTCGC GGGGACATGA CGAATTCCCC GTTCATCACG AAGGCCGCCG AGGAGTGGGG 60 

GGTGCCGTCC AAGTCAAAAG CGGCCCCACA TCGATTCAGT TCCCCGACGA ACAGCCCTTT 120 

CCCCCAGCGT TCCTGGCTTT GCAACCGTTT CACAACAGCC TCGTAAAGTA GGCCGGACAA 180 

GGCAGACGGA CTCCAAAGGA GTTCTTCCAT CTGCAAGTGC GCCTGCGTTA TGTGATCCCG 240 
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GTCTTTTGCA TGTGTGTGGC ATGAATGCTG TTCCCAATCC CACTCCAGAA CATTCTCCTC 
AAAAGTGCGC AACGTCGCCC TGAATGAATC CTGCCTTGTA GTCGTGACCA TTCCTATGAA 
GGGTCGCAGA GGATTTTCCC CGAGTGCAAG CGCATCCTCC GGCTCAAATC GGGTGCATTT 
CACAGTCCCG CTCAACGCTA GCCCGATCCC TTTTTGGCAT GGTGACTCAA GCGTCCTTTC 
AAACAAAAGC TCCTCATCCG CTCCAACCGG CCCGACGTAG ACGCGTAGAC CGAAGTCGTC 
(2) INFORMATION FOR SEQ ID NO : 289: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 194 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 289: 

AAAGAATTCG ACCAATTCAA GGTTGAGGCA TCGCAAACTA TGGACTGTTC CCCCGTCAGT 60 

TCTGGACAGA AAACGGGATA AGGTTGGCTG TGAAGCAAGC TGCCCTCCTA CCAACAATTT 120 

TGGAAAGTAG GCATCAGCTG ACAATTCTTT ACAAGCATAG TCCGTTCCAT AACCTGTTAA ISO 

CAGTTGAAAG AGGAACTGGA CAAGGATATC TGAATCCGAA TAACGACAGT AGCGGCGTTG 240 

GTCATTCGTT ACTAAATACT TAGAAATCCG CTCTTTTAGT TTCAACTGGG AAAAAAGTTC 300 

CTGAAAAAAG ATAAGACCAC CATACTGGGT TAAATGACCT CCATCGAAAG ATAGTTGGTA 360 

AAAAGACTTG TTTTGGAAGT GATGATTTGG TAAACTGTTC ATGTGAGTTT CCTTTCTTTT 420 

TGTGTTTTTT TCTACACTTA TACCATAAAG GGGAAACTCT TTTTTGTCTA GTAAAAAACA 480 

CCCATTGGGT GAAAAAAGAA ACCATCCAGG ATCTAAGCTA AGGCAAGGAT TCTGGATGGT 54 0 

TTTTAGATTT GGGGTGAATA ATTGGGGTTT TACAATATCA ACTCCCATGA TAGTCATGAG 600 

ATGACTCTTC ACGAATTGAC GTGATGACTG TCCTTCCTTT TGCATAATTA CCTCCGAAAC 650 

ACAAAAAAAG GGGTAGACAA TCTAGTGTCT ACCCCCGAAA GTTTATTAAA ACAAAAATCC 72 0 

TGCCAAAGAA TTTTTGGCAG GAAACCAAAT CAATTTATCA GTTTCTATCA ATCGCTTATC 78 0 

GCTCTCAAAG ACTGGTAAAT AGGGATTCCG CAATCAAATT GCGATACTCT ATTATTTAAG 84 0 

AGTAACTGAA GCTCCAGCTT CTTCCAATTT AGCTTTGATT TCTTCAGCTT CTGCAGTTGC 900 

AACGCCTTCT TTAACAAGTG CTGGTGCACC GTCAACAAGT TCTTTAGCTT CTTTAAGACC 960 

AAGACCAGTG ATTTCACGTA CAACTTTGAT AACGCCAACT TTTTTGTCGC CTGCAGATGT 1020 

CAATTCAACG TCGAATGAAT CTTTAGCAGC ACCAGCATCA GCTGCATCAG CTGCAGCAAC 1080 
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AGCTACAGGA GCAGCTGCAG TTACACCAAA TTCTTCTTCG ATAGCTTTTA CAAGGTCGTT 1140 

CAATTCAAGG ATTGAAGCTT CTTTAATTTC AGCAATAATG TTTTCAATGT TCAATGCCAT 1200 

TGTTATTTCC TCCAAATAAG TTTTAAATTT TATAATAGTT TTTTTCGTAG CTAGksTACG 12 60 

CTGTGTAGCT TAAGATTAAG CCGCGTCTTC TTTGCTTTCT GCAACCGCTT TGACTGCAAG 1320 

AGCAACGTTG CGCACTGGCG CTTGAAGTAC AGAAAGGAGC ATAGAAAGAA GTCCTTCGCG 1380 

GTTTGGAAGA GTTGCAAGTG CAAGAATCTC TTCTTTAGAT GCGACAGCGC CTTCGATTGC 1440 

ACCACCTTTA ATTTCAAGTG CTTCAGCGTT TTTAGAAAAG TCGTTCAAGA TTTTCGCTGG 1500 

TGCGATAACA TCTTCATTAG AAAATGCTAC TGCAGATGGT CCAACAAATA CAGATGCAAG 1560 

ATCTTCAAGA CCAGCTTTTT CAGCTGCACG ACGCAAGATT GAGTTTTTAA TAACTTTATA 1620 

CTCAACTTCG CTTCCACGAA GCTCACGACG AAGAACTGTA TCTTGCTCAA CTGTCAAACC 1680 

ACGAGCGTCT ACAACGACGA TAGATGCAGC AGCTTTCATT TTTTCAGCTA tACGTCAACT 1740 

AGTTCCGCTT TTTTAGCAAT AATTGCTTCA CTCATTAGTG TGTTCACCTC CGTAATTATT 18 OO 

TTGCTTGGGG AATTTTTCAA AAAGAAAAAC GCGCCCAATC CTAGACACGA AAGTACAATA 1860 

CGCTTCTTTT TACATGATAC GTTTTGTCCT CGGTAGGATA TTTATGAGTC GAGCTCCCCT 1920 

ACTGTCTTAG GCAGTTTTTT TAGATACGG I949 
(2) INFORMATION FOR SEQ ID NO: 290: 

(A) LENGTH: 1023 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
{D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 90: 

GGACTGTTTG ATCTTATACA GTAGCTGCTT GATCCAAGCT TTCACCGATA GCGGCTAGGC 60 

GCTCGATAAC TTCAGCTTGT GTCAATTCAT TTTTTGAAAC ATAGCGGTTA CGTGGGTG.^ 120- 

CACGGCACTC GTGTGAGCAT CCACGAAGGT ACTTGTCTTC ATTTTCTTCT GATGTCAAGA 180 

TACGACGGTT ACAGAATGGA TTTCCACAGT TGACATAACG TTCACATGGT GTTCCATCAA 240 

ACCAGTCTTT CCCTACGATA GTTGGGTTGA CATGGTTGAC ATCAACGGCA ATACGCTCGT 300 

CAAAGACGTA CATTTTCCCA TCCCAAAGCT CACCTTGAAC TTCTGGGTCT TTACCGTAAG 360 

TTGCGATTCC TCCGTGCAAT TGGCCGACAT CTTTGTAGCC TTCACGGACC ATCCAGCCTG 420 

AGAATTTCTC ACAGCGAACG CCACCTGTAC AGTAAACCAC GACACGCTTG TCCATGAATT 480 

TTTCCTTGTT ATCACGGACC CATTGTGGTA ACTCACGGAA GTTGCGAATA TCTGGGCGAA 540 



wo 98/18931 



PCT/US97/19588 



1319 



TAGCTCCACG GAAATGTCCT AGGTCGTACT CATAATCGTT ACGTGTGTCA AGGACAACGG 



600 



TATCTTTATC AAGAAGCGCT TCTTTGAACT CTTTTGGAGA CAAGTAAGCA CCTGTTGTTT 



660 



CAAGTGGGTT GATGTCATTG TCAAAGTCGT TGTCTTCCAA ACCAAGGTGG ACAATTTCTT 



720 



TCTTGTAGCG AACAAACATC TTCTTGAAGG CTTGTTCATT TTCTTCGTCA ATCTTGAACC 



780 



AGAGTTCTTC CATTCCTGGA AGGCTGTGAA CGTAgTCCAT GTATTTTTGA GTTGTTTCAT 



840 



AGTCACCTGA AACTGTTCCG TTAATTCCCT CGTCAGCGAC TAGGATACGG CCTTTAAGGn 



900 



CGATTGATTT ACAGAAAGCC AAGTGGTCTG CAGCAAATTG CTCTGCATTT TCAATTGGAG 



960 



TATAAAGGTA GTAAAGTAAG ACACGAATAT CTTTTGkCaw AAGATTTGTA TCTCTTTATC 



1020 



TAT 



1023 



(2) INFORMATION FOR SEQ ID NO: 291: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3831 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291: 

ACTATGAACA AGACCCAGAA AAAGTAGCCT TATTTCTTAA GAATTTTAAT AGTTTAAAGC 60 

ACCTAGCACC TGTTTAGATT GACGAAACAG GATTCGATAC TTATTTTTAT CGAGAATATG 120 

GTCGCTCATT AAAAGGTCAA TTAATAAGAG GCAAAGTATC TGGAAGAAGA TATCAGAGGA 180 

TTTCTTTGGT TGCAGGTCTA ACAAATGGTG AATTAATCGC TCCAATGACT TACGAAGAGA 240 

CGATGACGAG CGACTTTTTT GAAGCTTGGT TTCAGAATTT TCTCTTACCA ACATTAAACA 300 

CACCATCGGT TATTATTATG GATAATGTAA GATTCCATAG AATGGGGAAG CTAGAACTTT 360 

TATGCGAAGA GTTTGGGCAT AAACTTTTAC CTCTTCCTCC CTACTCGCCT GAGTACAATC 420 

CTATTGAGAA AACATGGGCT CATATCAAAA AGCACCTCAA AAAGGTATTA CCAAGTTGCA 480 

ATACCTTTTA CGAGGCTTTT TTATCCTGCT CTTGTTTCAA TTGACTATAT TAGAGGCGAG 540 

ACATTTTTCG GTTCTTTGTC AACTGTAGTG GGTTGAAGAA AGCGAAGATC TAGAAAGGAC 600 

AAATTTCGTC CTTTCTTTTT TGAAGTTTTC AAAGTTCCTA AAACCAAAGG CATTGTGCTT 560 

GATAAGTTTG ATGAGATTAT TGGTGGCTTC CAGTTTGGCG TTGGAATAAG GTAATTGAAG 720 

GGCGTTGACG ATTTTCTCTT TATCTTTGAG GAAGGTTTTA AACAT^GTCT GAAACAGAGG 780 

TGGAAAAGCA AGAGCTGATA GAGATTATAG TGGTGTTTAA AGTCTTCGGA ATAGCTCAAA 840 
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AGTTTATCTA 
TTATCACTCA 
TATTCATGGG 
GTTTATCAAA 
CCGGACTGTT 
GTGAAAATCA 
GTCGGTCATA 
AATTTGATGA 
GCACTTAAAA 
CAGAACCATA 
AAGGCGGTCT 
TGTCAACTTT 
AGTGAATGTG 
TATGTTACAA 
TGGTCCTACA 
TTATTTAGTG 
TGCTTATCGT 
TAATTTTAGT 
AGTAATTTTT 
CTTGATAAAA 
CCTGCCATGT 
GTTGTTGTAG 
GCGAATGCAG 
ATTGACGGTA 
ATATCTGTCA 
TATCTTCATG 
ATTTTGCCTT 
ATTGGTACGA 
ATTCGTATTG 
GCTGAAAAGA 



GAATTTCTTT 
GTTTCTGACT 
ATTTCGGATG 
GTCCTGAGCA 
TCAACsTCCT 
TTGTTCTTGC 
GAAGTCTTTT 
TTCTTCTTGA 
CGGCCTTTTC 
ATACCTATAT 
TTTTAGAACT 
TCCTATTTTT 
TAAAATTTTT 
AAAATTTATG 
TTTGGTGATA 
AGATATGGCA 
TTGTATCTTG 
AAATTTCCGA 
CTAAATCATT 
AGGCGATTTT 
ATGTGAGAAA 
GTACCCAATG 
AAGTGATTGC 
AGAAATTTAA 
TTGGGAATGG 
AGGAAGGTGT 
ATCATATCGA 
CAATTAAGGG 
CAGATCTTTT 
ATCGTCTTTT 



ATTAGTCAAG 
ATCTTGTTGA 
ATGGCTTGTG 
ATAAAGCTCA 
AGGACATAAT 
GAATGACAGT 
TAATTAGCTT 
CGATAGAAGT 
TAAGAAGAAT 
AAAAATATTA 
TTAATTGTTT 
ATCTTGTTGA 
TGTTAGAATA 
AGCAGATGGC 
ATTTTGACTG 
TTGGTTGTCG 
AAAAATTGGT 
ACTAATTTAC 
TTTTAATAGT 
TTATTATAAT 
GGAAGAGCCT 
GGGTGATGAA 
ACGTTACCAA 
GTTGCACTTG 
TATGGTTGTA 
AACAACTGAT 
GTTGGATCGC 
AATTGGTCCA 
AGATAAAGAT 
TGAAAAATTG 



1320 
TGCATACGAA 



ATGAGCTTCC 
TTCTGCTCTC 
TCTCCATCTC 
CTCAGGAAGA 
TGAAGTTGAA 
CTGAGCAATC 
CTCAGCGAGC 
TCTAGTTTGA 
TAGTTCTAAT 
GAAATTTAGG 
GGCTGGTATT 
AGTTTATAAA 
TAATTTCTAT 
GGAACATGTT 
TAAGGATTTT 



TCTTTTATGG 
TGGAAATAGC 
AAATTGTAAG 
GATGGCTCAG 
GGTAAAGGGA 
GGTGGTGATA 
ATTCCATCTG 
AATCCTAAAT 
AACTTGCGTA 
TTGCAAGAAG 
GCTTATATGG 
ATTTTCCGTG 
TATGACAGTA 



AAGTAGGGCG 
AGTAGCGCTT 
AAGAACAGTT 
CCGATTGAAA 
CGCGAAAAAT 
ATAGACAACT 
TTTTGGTTGA 
TCCATTTTTG 
ATTTTTTTAT 
AGGATTTACC 
TAGCAAATTT 
TTAACAATTC 
AAAGAAAAGG 
GATAGTATTG 
CATTTTAAAT 
ATTGTTTACC 
GGTTTTATTt 
AAAGATGATA 
AAATCTTTCT 
ATATAATTGC 
ACAAGATTAT 
AGATTACAGA 
ATGCTGGTCA 
GGATTTTCTT 
CTCTTGTAAA 
TTTCTGATCG 
AAGCTAAGGG 
ACAAGGCTGC 
AGCGTTTAGA 
AAGCGATTGT 



ataaaatcgt 
gatagccttg 

ATGATATTGA 
CAGTCACTCC 
CATGCTCAAA 

gatgatcaat 
tgatacaagg 
agcaatgata 
actagaaaat 
caaaagtttt 
gtttctattt 
aggaattgat 
agtatttgat 
aagaagagta 
ttttaattta 
attatcgtgt 
cttgttgagg 
gtaaatagct 
attgtttctt 
aggtgagagt 
gacttcagtt 
cttcctttca 
cacgattgtg 
ccctgaaaaa 
agagttgagc 
tgcgcatgtt 
cgacaataag 
tcgtgttgga 
acgtaacctt 
tttcgatgat 



900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
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ATTTTTGAAG AATATTACGA ATATGGTCAA CAAATCAAGA AATACGTGAT AGATACATCT 2700 

GTTATCTTGA ATGATGCGCT TGATAATGGC AAACGTGTGC TTTTTGAAGG TGCACAAGGT 2760 

GTTATGCTAG ATATCGACCA AGGTACTTAT CCATTTGTTA CGTCATCAAA CCCTGTAGCT 2820 

GGTGGTGTGA CAATTGGTTC TGGTGTCGGT CCAAGCAAGA TTGACAAGGT TGTAGGTGTA 2880 

TGTAAAGCTT ATACGAGTCG TGTAGGAGAT GGTCCTTTCC CAACTGAGTT GTTTGATGAA 2940 

GTGGGAGAAC GTATCCGTGA AGTGGGTCAT GAATATGGTA CAACAACTGG TCGTCCACGT 3000 

CGTGTAGGTT GGTTTGACTC AGTTGTGATG CGTCATAGCC GTCGTGTTTC TGGTATTACT 3060 

AACCTTTCTT TGAACTCTAT TGATGTTTTG AGCGGTTTGG ATACTGTGAA AATCTGTGTG 3120 

GCCTATGATC TTGACGGTCA ACGTATTGAC TACTATCCAG CTAGTCTTGA ACAATTGAAA 3180 

CGTTGCAAGC CTATCTATGA AGAGTTGCCA GGTTGGTCAG AAGATATTAC CGGAGTTCGC 3240 

AATTTGGAAG ATCTTCCTGA GAATGCGCGT AACTATGTTC GTCGTGTGAG TGAATTGGTT 3300 

GGCGTTCGTA TTTCTACTTT CTCAGTAGGT CCTGGTCGTG AACAAACAAA TATTTTAGAA 3360 

AGTGTTTGGT CCTAAGAGAT TTTTAAGATT TGTTTAAGAT AGGTCGGGTA TACTATAGAC 3420 

GGTTACAAGA AGACCTCCTA ACTTGTTGTA ACAAATATCC TAAACTTTTC TTTTTCATAA 3480 

TAATCTCCCT ATAGAGTCAC CGCATTCGGT GGCTTTTTTT GTGTTGGGAT TCATGATATA 3540 

ATAATAAAAT CGATAAGTAG GAAAAGAGAA AAGAGATGTA TTATACGCTT GAAGAAAAAG 3600 

AAGTCTTTAT GAGGGAGGCT TTGAGAGAGG CTGAGATTGC TCTTGAACAC GATGAAATTC 3 660 

CAATTGGTTG TGTGATTGTC AAAGATGGGG AAATCATTGG TCGTGGGCAT AATGCGCGTG 3720 

AGGAATTACA GCGAGCGGTT ATGCATGCGG AAATTATGGC TATAGAGGAT GCGAACTTGA 3780 

GTGAGGAGAG TGCGCTTGCT GGATTGCACA CTTTTTGTGA CCATTGAACC G 3831 
(2) INFORMATION FOR SEQ ID NO: 2 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1441 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 292: 

CCGCTGTTCC AACCGCAACA TACCATAGTC CGTACGGGAT TCGAACCCGT GTTACCGCCG 60 

TGAAAAGGCG GATGACTTAA CCCCTTGACC AACGGACCTG AGTTGTTATT TTCAACTCTT 120 

ACTATTATAC AGTCTTTTCA AACTTTGTCA ACTACTTTTT CTAATTTTTG TTTATTTTTT 180 
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CAACTTATAG TAAAAAAAGC CAGAATTATA CTGACTCTTC TATCGCTCAT TAAACTTAGA 240 

AGCACGTTCT TTTCCCCACC AATAAGGGAT TAGTTCTGCG ACTTTAACTG TTTTTCTTAT 300 

ATTATAGTCC ATCATGAATT CTGCATCTTT ATTTTCAGCA TTAAGCTCTA AAAGGAATTC 360 

TCTACAAGCA CCGCAAGGCA TGGCTGAACT TCCACCATAA GGTGGrTTGT CTCGAAAGGC 420 

TAATACTTTC TTAACCTTAG TTTGTCCTGA AAATTGGTAC ATATTGAAGA GGGCCGCCCG 480 

TTCTGCGCAG AGATGGAAAA CACCACAGGT TCCCTCCATA CAGAATCCTG TAAATATTTG 540 

TCCATCTCCT GCTTCTACTG CAGCTACAAC ATGATTGGCA TAAACAAAGT CTGATACTTC 600 

ATGTGGATTG TATAGTTTCT GTGCTTCTTC GTACATCTTT TCCCAGATGT CCATTATTGT 660 

ATCCTCTTTA TTTAGAGATT TCTTTTAGCA TGTTTTCGAT ATGCTGAATT GATTTTTCAC 720 

GTCCAAGCAA GAAAATTGTA TCTGGTAATT CTGGCCCATG CATTTCGCCT GAAACTGCGA 780 

TACGAATAGG CATGAAAAGA TTTTTCCCTT TAATACCTGT TTCTTTTTGG ACTGCTTTAA 84 0 

TTTGTGGGAA GATATTTTCT GTCACAAATT CATCATCTGT CATCGCTTCA AGTTTTGCTT 900 

TGAATGCTTC AAGAACTGTT GGAACTGTTT CACCCGTCAT GACTTCGCGC TCTGCTTCTG 960 

TCAATTCTGG GAAATCTGAG AAGAAAAGAT CTGTCAATGG GATAATCTCA TCTACTGATT 1020 

TCATTTGTGG TTTATAGAGC TCAACTAATT TTTCAGCCTT GTCAGTCAAA CGGCCTGCTT 1080 

CCTCTAAGAA TGGTTTTGCC ATTTCAAAGA TGGTTTCAAG GTCTGCATTC TTGATATAAT 1140 

CATTGCTCAT CCAGTrTAGT TTTTTCTGAT C.\AAGGCTGC TGGTGACTTG CTGaggcgGT 1200 

TTTCATCAAA AAGTTTAATG AATTCTTCAC GAGAGAAAAT CTCATCCCCA CCACCTGGGT 1260 

TCCAACCAAG AAGAGCAATA AAGTTAAAGA CTGCTTCTGG AAGGTAACCT TTCTTTCGGT 1320 

AATCTTCGAT AAATTGAAGT GTATTAGTAT CACGTTTAGA TAACTTCTTA CCAGTTTCAG 1380 

AGTTGATAAT CAAGTGTCAT GTGACCGAAC TCTGGAGCTT CCTCAACCTA AGAGCGGGTA 1440 
T 

1441 

(2) INFORMATION FOR SEQ ID NO: 293: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4398 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 293: 
CGGCTTATGT AGTGGCAATC TTTCTACGTA AGCGAAACGA GGGGAGATTA GAGGCGCTAG 
AAGAAAAAAA AGAAGAACTA TACAATCTTC CAGTAAATGA TGAAGTAGAA GCTGTAAAAA 



60 
120 
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ATATGCACTT GATTGGACAA AGTCAAGTGG CTTTCCGTGA ATGGAATCAA AAATGGGTCG 180 

ATTTATCTCT CAACTCTTTT GCCGATATTG AAAATAATCT CTTTGAAGCA GAAGGCTATA 240 

ACCATTCATT TCGTTTTCTC AAGGCCAGTC ATCAAATTGA CCAAATTGAG AGTCAAATTA 300 

CTTTGATTGA AGAAGATATT GCGGCAATTC GCAATGCTTT GGCAGACTTA GAGAAGCAAG 360 

AATCTAAAAA TAGTGGTCGT GTTCTTCATG CTTTGGATTT ATTTGAGGAA CTTCAGCATA 420 

GAGTTGCTGA AAATTCAGAA CAGTATGGTC AAGCCTTGGA TGAAATTGAA AAACAATTAG 4 80 

AAAATATCCA ATCTGAATTT TCACAATTTG TAACCTTGAA TTCATCGGGT GACCCTGTGG 540 

AAGCCGCAGT GATTTTGGAT AATACAGAAA ATCACATTTT GGCCTTAAGT CATATTGTGG 600 

ATCGTGTTCC AGCCTTGGTT ACGACGCTTT CTACAGAATT GCCAGATCAA TTACAGGATT 660 

TGGAAGCCGG TTATCGTAAA CTAATTGATG CTAATTATCA TTTTGTTGAA ACGGATATTG 72 0 

AAGCGCGTTT CCACTTGCTT TATGAAGCAT TCAAGAAAAA CCAAGAGAAT ATTCGTCAGT 780 

TGGAATTGGA TAATGCCGAA TATGAGAATG GACAGGCACA AGAGGAAATC AATGCCTTGT 84 0 

ATGATATTTT TACTCGAGAA ATTGCTGCTC AGAAAGTAGT GGAAAATCTA CTTGCAACTC 900 

TTCCAACTTA TCTTCAACAT ATGAAAGAGA ATAATACTTT ATTGGGAGAA GATATTGCAC 9 60 

GTTTGAACAA GACCTATTTA CTTCCTGAGA CAGCTGCAAG CCATGTTCGT CGTATTCAGA 1020 

CAGAATTAGA GAGTTTTGAG GCAGCTATTG TTGAGGTAAC TTCAAATCAA GAAGAACCAA lOQO 

CCCAAGCTTA TTCAGTTCTT GAAGAAAATC TTGAGGATTT ACAAACTCAA CTAAAAGATA 1140 

TTGAAGATGA GCAAATTTCA GTTAGTGAGC GCCTGACACA AATTGAGAAA GATGATATTA 1200 

ATGCACGTCA AAAGGCCAAT GTTTATGTCA ATCGTCTCCA TACTATCAAG CGATACATGG 1260 

AAAAACGCAA TCTGCCAGGT ATTCCACAAA CTTTCTTGAA GTTATTCTTT ACGGCAAGCA 1320 

ATAATACCGA GGATTTAATG GTTGAGTTAG AACAAAAAAT GATTAACATT GAATCTGTTA 1380 

CCCGAGTTCT TGAAATTGCA ACGAATGATA TGGAAGCTTT AGAAACGGAA ACTTATAATA 1440 

TTGTACAATA TGCAACTTTG ACAGAGCAAC TCTTGCAATA TTCTAACCGC TATCGCTCAT 1500 

TTGATGAACG CATTCAAGAA GCATTTAACG AaGCTTTAGA TATTTTTGAA AAAGAATTTG 1560 

ATTATCACGC TTCATTTGAC AAGATTTCTC AAGCATTGGA AGTGGCAGAG CCTGGTGTAA 1620 

CCAATCGCTT TGTTACCTCA TATGAGAAAA CACGTGAAAC GATTCGTTTT TAATAAAAGA 1680 

AAAAGATTTT ATTGTGTGAG GAGCAGAATC AAATCTTTTT CTATAGTTGT GGGGAGATTT 1740 

ACTTCATTTT CTCCTGAGAT TGAGTTTTTG CCCAGCCGAT TTATCCACTA CCTCAAAACA 1800 

GTGTTTTATA CTCTTCGAAA ATCTTTTCAA ATCACGTCAG CGTCGCCTTA CCGTACTCAA I860 
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GTACAGCCTG AGGCTAGCTT CTTAGTTTGC TTTTTGATTT TCATTTAGTA TTAAAGTGAT 1920 
TTCGCCAGTC TTATCTGCAG CTTCAAATCT GTACTTTGAG TAACTTGGTA ACCGTCCAAT 1980 
AACGAAGTCT ATTGAAAAAT CTCCAGACTA GAGAACTCAC GGATAGTTCC TAATCTGGAG 2040 
ATTTCTTATT TGCACTTTTC TTGTACAACT TTAGTCCACG GTAAATAGAC CTCTAAAACC 2100 
TCTTTGTTTA CGAGAGTTTC CTCGTTTGGA AGACATTCTA GAAGATAGGA TAGATATTTC 2160 
TCGCTATTTA TACTAGACTA AAATCAAAAA GCATTATATA ATAGTGATAT GAAATCAACT 2220 
AAAGAAGAAA TCCAAACCAT CAAAACACTT TTAAAAGACT CTCGTACAGC TAAATATCAT 2280 
AAACGCCTTC AAATCGTTCT ATAGTAAAAT GAAATAAGAA CAGTACAAAT CGATCAGGAC 2340 
AGTCAAATTG ATTTCTAACA ATGTTTTAGA AGTAGAGGTG TACTATTCTA GTTTCAATCT 2400 
ATTATATTTC GTCTGATGGG CAAATCTTAT AAAGAGATTA TAGAACTTTT ATAGTAGATT 2450 

GAAATAAGAT GTGAACAACT CTATCAGGAA AGTCAAATTA ATTTATAGAA ATATTTTAGC 2520 

AGCCAAGGTG TACTGTTATA GATTCAATAC ACTATAGACT GTAATCAAAC AACGATTTGG 2580 

CGAAATGTAA AAAAATATGA GGAGTTCGGA CTCGACTCTC TCCTTCAAGA AACACGTGGT 264 0 

GGTCGTAACC ATGCATATAT GACAGTTGAG GAAAAGAAAG TCTTTCTTGC CCGCCATTTG 2700 

AAGGCTGCAG AGGCAGGAGA ATTTGTTACA ATTGATGCCT TATTTCAGGC TTATAAAAAG 27 60 

GAGTTAGGTC GTTCCTACAC ACGTGATGCC TTCTATCAAC TGTTGAAGTG CCATGGTTGG 2820 

CGAAATATTA TGCCACCTCC AGAACATCCx AAGAAAGCAG ACwrCAAAC CATTGTCGCG 2880 

TCTAAAAATA AAATCTCAAT TCAAGAAGAA AAGAAAGCGC TTTAAAACCA GTAGACGTTT 2940 

TCGTAAGGTT CGCTTGATGT ACCAAGATGA GGCTGGTTTC GGTAGAATCA GTAAACTGGG 3000 

ATCTTGTTGG GCTCCAATAG GAGTAGGTCC ACATATCCAT AGTCACTATA TACGAGAATT 3060 

TCGCTATTGT TATGGAGCTG TTGATGCCCA TACAGGCGAA TCATTTTTCT TAATAGCTGG 3120 

TAGATGTAAT ACTGAGTGGA TGAACGCCTT TTTAGAAGAG CTTTCACAAG CTTATCCAGA 3180 

TGATTATCTT TTACTCGTTA TGGACAATGC TATATGGCAT AAATCAAGTA CCTTAAAGAT 3240 

TCCGACTAAT ATTGGTTTTA CCTTTATTCC TCCATACACA CCAGAGATGA ACCCCATTGA 3300 

ACAAGTGTGG AAAGAGATTC GTAAACGTGG ATTTAAGAAT AAAGCCTTTC AAACTTTGGA 3360 

AGATGTCATG AATCAACTCC AAGATGTTAT ACAAGGATTG GAGAAGGAGG TGATAAAGTC 3420 

CATCGTTAAT CGGAGATGGA CTAGAATGCT TTTTGAAAAC AGATGAGTAT AAAAAGAAAG 3480 

TCCTCATTTC AATAGAAATC ACGACTTTCT GATGGATTTA TAGTAAAATG AAATAAGAAC 354 0 

AGGACAAATC GATCAGGACA GTCAAATCGA TTTCTAACAA TGTTTTAGAA GCAGAGGTGT 3 600 

ACTATTCTAG TTTCAATCTA CTATATTTTT GGAGTGATAG AAAAGCCCTT CATAAGCTAG 3650 
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TCTACTTGTT CAGGTGCGAG AGCTTTGACA TCTTTTTCTG TACTTAGCCA AGTCAGTTTT 3720 

CCGTTCTCAA AGCGTTTATA TAGTAGCCAA AATCCTTGAC CATCCCAGTA AAGGGCTTTA 3780 

AAGCGGTCTT TACGTCCACC ACAAAAGAGA AAGACTTGAC CGGAGAAAGA ATCCAATTCA 3 840 

AAGTGGGTTT TAACTACATA GGCTAATGAG TCTATTCCCT GCCTCATATC TGTCTTGCCA 3 900 

CAAACAAGGT GAACTTGACC TAAATCACTT AGTTGAATTA TCATAGTACA ATACCTTTCC 3 960 

TCCGATAATT ATTTTTTATC TAGTATACTG GAAGTTGGGG AATTAGGATA GATACCTTGT 4020 

TATGACGCGC TTACGTAACT TGTAACTAGC TGCCTAGTTT GATCTTTGCT TCTTCATTGA 4080 

TTAGCAGTAG ATTTCAAAAT GATAAAAACG CATAGTATCA GGTATTGAAA TGTACTGCCC 4140 

CAAAAGTTAG ACAGAAAAAA TCTAACTTTT GGGGTGTTTT TGTTATGAAA TTAAGTTATG 4200 

ATGATAAAGT TCAGATCTAT GAACTTAGAA AACAAGGATA TAGCTTAGAG AAGCTTTCAA 4260 

ATAAATTTGG GATAAATAAT TCTAATCTTA GGTATATGAT TAAATTGATT GATCGTTACG 4320 

GAATAGAGTT CGTCAAAAAA GGAAAAAATC GTTACTATTT TCCTGATTTA AAACAAGAAA 4 380 



TGATTAATAA AGTCTTAC 

(2) INFORMATION FOR SEQ ID NO: 294: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 718 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDMESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 294: 

AGATTTTTAG ACTTTGTCTT TAATCGTTTC TTTTTAGGGA TGATTGCGAC ACCTTCTTTT 60 

GGCTATTAAC TTTAGCAGGA GGGATTATCC TTGGTCTAGC GCCGGCTAGT GCCACCTTGA 120 

TGAGCTTATA TGCAGAACAT GGTTATAGCT TTCGGGAATA CAGTTTGAAG GAGGCTTGGT 180 

CTCTTTACAA GCAAAATTTT GTCTCAAGCA ACCTGATTTT CTATAGCTTT TTAGGTGTGG 240 

GTCTAGTTTT GACCTATGGT TTGTATCTCT TGGTGCAATT GCCTCATCAG ACCATTGTTC 300 

ATTTGATTGC GACCCTTTTG AATGTCCTAG TAGTTGCCCT GATCTTTTTG GCTTATACAG 3 60 

TATCTTTT^ ATTACAAGTT TATTTTGCCT TGTCCTATCG AAATAGTCTC AAATTATCCT 420 

TGATTGGCAT CTTTATGAGT CTAGCAGCTG TGGCTAAGGT TCTCCTTGGG ACTGTGCTAC 4 80 

TTGTAGCAAT TGGTTATTAT ATGCCTGCCC TGCTATTTTT TGTAGGAATT GGGATGTGGC 540 

ATTTCTTTAT CAGTGATATG TTGGAACCTG TCTATGAAAT CATCCATGAA AAATTGGCGT 600 
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CAAAATAGAA TGAAGCAGTT TTGGCTACAT ACGCTTCTAA GAACCTATAG TTCAGTGATG 660 
ATCATTATCA TTGCGAGTTT TGCAATCTTA CTCTCTTACG CTGTCTGGGA TTCACGTG 718 
(2) INFORMATION FOR SEQ ID NO: 295: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 718 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295: 

TCGGTACCAA AATTCTGGAT TTATACTAGC AAAGATCCAA GAGCAAATTA TTTAACAGAT 60 

TTAGGTCTAG TTTTCCCTGA ATCATTAAAA GAATTTGAGA GTGAAGATAG TTTTGCAAAG 12 0 

GAAATTTCTG CAGAAGAAGC AAATAAGATA AATGATGCTG ATGTAATCAT AACTTATGGT 180 

GATGATAAAA CTCTTGAAGC TTTACAAAAA GATCCTCTTT TAGGTAAAAT AAATGCAATT 24 0 

AAAAATGGTG CCGTTGCTGT AATTCCAGAT AATACACCGT TAGCAGCCTC ATGCACTCCA 3 00 

ACACCACTTT CAATAAACTA TACTATTGAA GAATACCTAA ATCTTTTAGG AAATGCATGC 3 60 

AAAAATGCGA AATAAAAAAC AAATAAACCT AGGCATAATT TTTATAATCT GCCTAGGTCT 42 0 

TCTTATTACA ATATTTTTGT CATTAAAGCT TGGAACAAAA GAAATTAATA TCAGAGATTT 4 80 

TTTAGCAGCT TTTGGAATGG GTAATACAAA TGATGATTTT ATTAAATCAA TTATATATAA 54 0 

rAGAATACCT AGAACTATTT TTGCAATTTT AGCAGGTTCT AGTCTTGCCA TAAGCGGTGT 600 

ATTGATGCAA TCAGTTACTA GAAACCCAAT AGCTGATCCA GGTATACTCG GTATAAACAC 660 

AGGAGCAAGT CTTAGTGTAG TAATTGGTCC TTCtTTTTAG GGAATTCATC AAGCATAA 718 



(2) INFORMATION FOR SEQ ID NO: 296: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1436 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 296: 

GAACTAATCA TTTTTACAGG ATGAGATTTA CAGCAGAGAG TTTGAAGGCT TTATCAAAGG 60 

TTTTTCTTGG CATAATGACT TTTCCTCGTT TCCACTTAAT TTTGTGTCTA CTTTATTATA 120 

CCAAGTCCAC sCTTAAGTTA GATAATAAAT CTAACTTAAG GAAGCTAGAA GGATGAGAAT 180 

CCAGGTGGTC AAGAGTCCCA AACTTAAGCT GATGGGGACA CCCAGAATAA TTTGCTTTTT 240 
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GAAGGCAAGG CCACGTTCCT CTATATTGGG AAGTGAGAGT TGAATGAGAG AACCAGCTGA 300 

TGAAAAGGGT GAGATATTAG TAGATAGAGC GCCAATAACG GTGGCTGTTG TGAGTAAGTG 360 

AATATCAATC TGAGGATTTT GAGGACTGAT GATAGCAATG ATGGGAAAGA GGGCTGGAGC 420 

TACAACGGAT AGGGTGGAAC TAAAGAGTGA CATCACTCCG GCTATCACAC AAAAGAACAG 480 

AGGTAACCAG AAATGAGGAA TGGTTGTTGT CATGAGGTGC CCTATCAGTG TGACTAAACC 540 

TGACTTGACC GCTAGAGACA TTAGTAAGCT CATGCCGCAG AGCATGATAA TTGTAGCCCA 600 

GGGAACCTTA GCTAAAATGG CTTCTTGCTT CCCTAATTTG AGCCTTAAGG CGAGGCAGAC 660 

CATGAGTATT GAGACAAAGC CAATATCAAA TGTTTTTTGA TAAGTAGCTA TCCAGGCGAT 720 

GTTTGGGAAA ATGAGATGCA ACAAGGGAAA AAGCCAAACC AAAACCATGC TGCTGATCAT 780 

GAGCAAGGTG GTTTGTCTTT GAACCTTGCT GAGGAGTGGT GGTTGGTCAA TAGTCAAGGA 840 

TGAGTTTGTT CTTCCCTTAC TATAGTGACT GTAACAGGAT AATAAAAGCA AGACGATGAG 900 

TGGGTAGATA ATGCTGACGA TAAAGATATG ATTGCCAAGT GAAAAAGCTT GCTCTTCCCA 960 

TCCCATTTGC TTAAACAGGC CTTGAAAGAC AATGCCTGAG CTACTGGTTA TCAAATTAGC 1020 

CCCTCCTGAA GCTCCCCAAT TGACGGCTTG AGCTCCAATC AAAGGGTGTT TGTCCGCTTT 1080 

TTGACAGAGG GTAATCGCTA GAGGACAGCA AACGGCCATA GTAGTGAAAA ATCCAGCACC 1140 

TAAAGCAGAC AAAAGGGTTG CCATCAGGTA TAAAATCATG TAGAGGGCGT TAGGGTGGGT 1200 

GCGTGTGCGG TAGAGAATGT GTTGAGCCAA AACATCAAGA GTACCGTTAG TTGTTGCAAC 1260 

GTTATAAAAG AGAGAGACGC TAAAAATGGT AAAAAAGAGT GAGGTTGGCC AAAAATGAAG 1320 

AAGTTCTTTG GGGCTTAATC CCATGAGAGT GGTTGCGATG AGGTAAGAAA AAGCAATAGC 1380 

CAGCAGGCCA ATATTGATTT TGGTGCGGTA ACCAATTCCA ATGGCTAGAG CAATGG 143 6 
(2) INFORMATION FOR SEQ ID NO: 2 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1596 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297: 

CCATTTGGGA AAGAACGTAA GAGTTTGCAG GGTGAGATTC CAGAAGAATT TTCAATGTCA 60 

GCCGTTGACA TGTCTATGAT TGACCACATT CCAGATATGA TTGAAAATGG TGTGGACAGT • 120 

CTAAAAATCG AAGGACGTAT GAAGTCTATT CACTACGTAT CAACAGTAAC CAACTGCTAC ISO 
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AAGGCGGCTG TGGATGCCTA TCTTGAAAGT CCTGAAAAGT TTGAAGCTAT CAAACAAGAC 
TTGGTGGACG AGATGTGGAA GGTTGCCCAA CGTGAACTGG CTACAGGATT TTACTATGGT 
ACACCATCTG AAAATGAGCA GTTGTTTGGT GCTCGCCGTA AAATTCCTGA GTACAAGTTT 
GTCGCTGAAG TGGTTTCTTA TGATGATGCG GCACAAACAG CAACAATTCG TCAACGAAAT 
GTCATTAACG AAGGGGACCA AGTTGAGTTT TATGGTCCAG GTTTCCGTCA TTTTGAAACC 
TATATTGAAG ATTTGCATGA TGCCAAAGGC AATAAAATCG ACCGCGCTCC AAATCCAATG 
GAACTATTGA CTATTAAGGT GCCTCAACCC GTTCAATCAG GAGATATGGT TCGTGCATTA 
AAAGAAGGAC TCATCAATCT TTATAAGGAA GATGGAACCA GCGTCACAGT TCGAGCTTAA 
GAAAGGAAAA GGAAATGATA GAGGCACAGG GTTTCTTAGT GGATAAGCAA ACAAGATGCA 
TTCATTACCA TAGCAAGCTG GATATTATTG CTTTACAATG CTATGATTGT AAAAAGTATT 
ATGCTTGTTA TCGGTGTCAT GATTCATTAG AACATCACCC TTTTGAGCCX5 TATCCCTTAT 
CTTTGATACA GGATAAGCCT ATTTTATGTG GTGTTTGTCT AAAACTACTA ACATATAAGC 
AATATAAAGA AAGCTTAAGT TGCCCCTTTT GTTTTTCTCG CTTTAATCCA GGTTGCCAAA 
ATCATAAGGA ACGCTATTTT AAATAGCAAA TCATCTAGTT TTGAAGTAGG AGAAAACTCA 
ATTTCAAGAG AAAATGAAGT AAATCTTCCC ACAATAAAAC GCATAATATC AAGATTGTTC 
AATACCTGAT ACTATGCGTT TTTAAGATTT TAAAGACTTT TTTCCTTTAT CTGGTATTTT 
GACTACTTGT TAAAACtGGG UTAATTTTCG ACTGTTTAAT AGTTATTATG CAAAGTCTAA 
AAGGTTAGAA TTGTCAAAAC AATCCGTCTA GAGTATGCGT GATGCCAACC GTGGTGGATG 
TTCTCAGTCA TGCCGTTGGA AGTACGACCT TTACGATATG CCATTTGGGA AAGAACGTAA 
GAGTTTGCAG GGTGAGATTC CAGAAGAATT TTCAATGTCA GCCGTTGATA TGTCTATGAT 
TGACCATATC TCAGATATGA TTGAAAATGG TGTGGACAGT CTAAAAATCG AAGGACGTAT 
GGAGTCTATT CACTATGTAT CAACAGTAAC CAACTGCTAC AAGGCGGCTG TGGATGCCTA 
TCTTGAAAGT CCTGAAAAGT TTGAAGCTAT CAAACAAGAC TTGGTGGACG AGATGTGGAA 
GGTTGCCCAA CGTGAACTGG CTACAGGATT TTACTATGGT ACACCATCTG AAAATGAGCA 
GTTGTTTGGT GCTCGTCGTA AAATCCCTGA GTACAAGTTT GTCGCTGAAG TGGTTTCTTA 
TGATGATGCG GCGGTA 

(2) INFORMATION FOR SEQ ID NO: 298: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1022 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1696 
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ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 298: 
CCGAGTTTAT TATGGTTTCT TCGGAATTTA TCTCAAAGAT TGAATTTGCT TGCAATAAGA 60 
AAGAAAGTCT TTATAGTCAA AGCAAATTTA AGTATGCGAT TCGTTCGATG TTCGCAGGTG 120 
CATTTTTAAC CTTCAGTACT GCTGCAGGTG CAGTTGGGGC TGACTTGATT AATAAAATTG 180 
CACCAGGTAG TGGACGCTTC CTCTTTCCAT TCGTTTTTGC TTGGGGCTTG GCCTACATTG 240 
TTTTTTTGAA TGCCGAGTTG GTCACTTCAA ACATGATGTT CTTGACTGCT GGTAGTTTCT 300 
TAAAAAAAAT CTCTTGGAGA AAAACAGCTG AGATTTTACT ATACTGTACC TTGTTCAACC 
TTATCGGAGC CTTGATAGCA GGGTGGGGCT TTGCTCATTC GGCAGCCTAT GCGAATCTGA 
CACACGATAG TTTCATCTCA GGTGTTGTTG AGATGAAGTT AGGCCGCTCA AATGAATTGG 
TCTTGCTTGA GGCGATTTTG GCAAATATTT TTGTAAATAT TGCGATTCTG TCATTTATTT 
TGGTCAAAGA TGGTGGTGCC AAACTTTGGC TTGTGTTGTC AGCTATTTAC ATGTTTGTAT 
TCTTAACAAA CGAGCACATT GCGGCGAACT TTGCTTCTTT CGCGATTGTG AAATTCAGTG 
TTGCTGCGGA TTCAATTGCC AACTTCGGTG TTGGAAATAT GCTTCGCCAC TGGGGTGTGA 
CTTTCATCGG AAACTTTATC GGAGGAGGCC TCTTGATGGG TCTTCCATAT GCCTTCCTCA 780 
ATAAAAACGA AGATACTTAT GTAGATTAAG AAAATGAGCA CGATTGAGTC GTGCTTTTTT 
CATTTTCAAA ATAAGGTAAT AGCTATTTCT TATATCAAAA TATAGAAAAC TGATATTTGT 
ArACTATAAC TCAAGGTGCT ACAATATCCT TAATAAAATA ATATGGAGGT CACCTTATGA 
CTTGTGATTT TAAATnTGAA ACTCTACAAC TACATGCTGG TCAAGTTGTG GCTCCAGCTA 1020 
CT 

1022 

(2) INFORMATION FOR SEQ ID NO: 299: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 663 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 299: 

CCTTAAGTAA TCTCTGATAA TATTTTCTTT ATTAGCATAG GGGAATATCG ATATAATGGC 60 

TTCATTATGA GTGGCAGGAA TATCCAATAT GGCAACTTTT CCAATAGATA ATTTAAAACT 120 

CATTAATAAA GTTCCTTTAG GTGAAATGTC TATTTTCTTT GATTTTAATG CTAATTTAGA 180 



360 
420 
480 
540 
600 
660 
720 



840 
900 
960 
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AATAGATTCT 


CTCGCATTAG 


1330 

TTACATAACC AGATATAGGC 


ATATCTGATA 


TAGATACCCA 


240 


AGGTATTTCA GTTCCCCAAA 


AAGTAGCTTC ACTGCGTGGA GGAGTTTTTC 


CTATTCTGAA 


300 


GTTAACTAGG 


CTAGCAAATT 


TAATATATCT CCATGCTTCT 


GGGATTTCAT 


ATATAGGATA 


360 


AGAGGTTGTT 


TCGTCTTTGT 


TCCCATAATA AGAGTTATCA 


TCTCCTTGGG 


AAACAATAGA 


420 


AATGTCCAAA 


TCTTTCTTTT 


TAATCTTGCC TTCTTCAAAG 


AGTTTTTGTT 


TTTCTGCTCG 


480 


TATTTTTTCA 


AGTAAAACTT 


CGACTGATTC ATCATTTGGG 


TCTTGTTCAA 


CTAATTTTCC 


540 


TTGCATAGCA 


TATTGAAGAA 


TAGATTTTTT TAGTTTATCT 


GGAAATTCTT 


TATCTAGCTG 


600 


TTCTAGTCTA 


TTATAACTTT CAGCATATTC ATCTACTTTT 


TCTAAAGCTG 


ATTCGATTGC 


660 


TTC 










663 



(2) INFORMATION FOR SEQ ID NO: 300: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 881 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300: 
CGTCGCTGAA CATGTCAACA GCAAATTAAA CTAAACAAAC TAAAATTATG TGATACTTCA 

f^Tl'VZi. 71 T'TT'tTI/"' mmm -k r-^ •» - . — _ - 

' ^ ATTATCAGAA GAAAGTTGAG AAAAATGGCA QPJ^J^PJ^iZK? 

ATCCTATGAC CCTTGAGGAA AAGGAAAAAC TTGAAAAAGA ATTAGAAGAA TTGAAATTGG 
TTCGTCGACC AGAAGTGGTA GAACGCATTA AGATTGCCCG TTCATACGGT GACCTTTCAG 
AAAACAGTGA GTACGAAGCA GCTAAGGATG AACAAGCCTT TGTCGAAGGA CAAATCTCTA 
GCTTAGAAAC AAAAATCCGC TATGCTGAAA TCGTCAATAG CGACGCAGTT GCCCAGGACG 
AAGTAGCGAT TGGTAAAACA GTCACCATCC AAGAAATTGG TGAGGACGAA GAAGAAGTTT 
ATATTATCGT AGGTTCAGCT GGTGCAGATG CCTTTGTAGG TAAGGTTTCA AATGAAAGCC 
CAATTGGGCA GGCCTTGATT GGCAAGAAAA CAGGTGATAC AGCAACCATT GAAACGCCTG 
TTGGTAGCTA TGATGTAAAA ATCTTGAAGG TTGAAAAAAC AGCCTAAAAA CAGAAAAAGG 
AGTGGGGAGG CGATGTGCTT CACTCACTCC TTTTTCCATT TTGCTACTCT TCGAAAATCT 
CTTCAAACCA CGTCAGCGTC GCCTTGCCGT ATGTATGGTT ACTGACTTTG TCAGTTTCAT 
CTACAACCTC AAAACAGTGT TTTGAGCTAA CTTCGTCAGT TTCATCTACA ACCTCAAAAC 
TATGTTTTGA GCTGACTTCG TCAGTTTCAT CTACAACCTC AAAACCATGT TTTGAGCCGA 
CTTCGTCAGT TTCATCTACA ACCTCAAAAC TATGTTTTGA G 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 

660 

720 

780 

840 

881 
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(2) INFORMATION FOR SEQ ID NO: 3 01: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 949 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 301: 
CCTTTTTTAA TACAAGTTAT TTTGATTTAA CCGGCTTGTC TTGAGCTGTC TGCAAAGCTG 
TGGCAATCGT ATCTGCATAC AATTTTGCTC CTGCTTCGAT AGTGCTACTC TCACTCCCGA 
AATGAACCTG GTCTGTTCCA GCCCAAATTT CTGGATGCTC TTTCGCAACT TGATTCCAAT 
CTGCTATCGT AATGTAAGGT GTCTTCTCTG CCAATTCTCT CATATAGGCA GCAGCCTTCT 
CAACGATGGC ATAGGTCTCT TTTGTCTTAT CTCCCTCATA AGGAGTCACC AAAATCATAT 
GGTGTCCCTT AGGAAGATTT TTCACGATAC TGTCCCAGTC ATCCTTGTAA TTCTCAGGAT 
TATTTACCCC AGTCGCAATG ACCACCGTCT TAGGTAAAAA TTTATTCTGG CTATTATTTA 
GCATGATTTC ATTTGCGGTC TTGGTTGTTA CGCTGACCTG CGCGTTAATC TGTGCTCCAG 
GAAGAGCTGT CTGTAGTGCT GTATTTGCCC TTAAAGCCAC TGAGTCACCA ATTAACATAG 
TGCCATCAGC AATTCCCAAA CTGTTTGCAT CTGCCCGTTC TGCCATCACC TTGGTCTGGC 
CAATATTTGT TGCAGCTTGC TTCAAGCCAT TGACAGTCAA GTCTGTCTCA AACGCTCCCA 
CTTGTGGTGC CAACAAGGTC ACCGTGCAGA CAATGATGGT CAAGATTCCT GTACCTGCTG 
CAAGAATTGC GTGAATATAA GGCAGGGGAC GAAsGGTTTG GACAATAGGT GTGTTCTTGC 
CTGCAATCCA AGGTTCCAAT ACATAAAATG ACAGACTGGC AAAGCCATAA GAACAAATCA 
GAGTCAGTAA TACAGCAAGA AGATTTGATG TCAACTGTGA GAAAATGATA TAGAAAGGCC 
AATGGAAAAG ATAAACCGCA TAGCTAGTAT CCGCTAAAAA GCTGATAAT 
(2) INFORMATION FOR SEQ ID NO: 302: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 622 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 

720 

780 

840 

900 

949 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302: 
AAGATATATT TTTTACACAG AAGTATGCAA AAGTAAAGAG TGCAAAAAAT GGAATTAAAG 



60 
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CGAAAATAAA 


1332 

AGCCGTGTAC AGGCGACCAA ACCAACGTAC 


ACGGCTAAGG AAAAATAACA 


120 


AAACTCAAGC 


AAAGGCAAGG CGCGTGGTTT TGTTAGGTAT 


TTAGCAAGGG GACAAACCCC 


180 


TTTGTAAATA 


ATCTCCTCTT ATTTTATCAA AATTAGAGGA 


AAATGACAAC TTAATTTATA 


240 


AAAAGGAAAA 


ATGGAGGATA TAAATGGAAA TTCTGTCTAA AGAAATACAG TTACAGGGCT 


300 


TACAACTTCT 


TAAACAGACT CTTGAAACT? TAGTTGAGCT 


AGAAAAACAA CGATCTAGTA 


360 


AGTTAGATTT 


AATTTCTCGT AAAGAATTAA TGGATCTGCT 


AGGTATAAGT GCTACAACCC 


420 


TTGATAACTG 


GGAGGATCTT GGTCTTAAAC GATATCAGAC 


TCCGATGGAT GGAGCTAAGA 


480 


AAGTATTCTA 


TCGTCCGTCA GATGTGTATT TATTTTTAGC 


AATAAAATAG GAGTTATGAA 


540 


ATGAAAATTG 


TTACTTTCAA ACCAACTAAA CAAATAGACG 


ATGGGTTTTA ACTGCCAGGT 


60O 


ATTGACATTC 


TATTTGTCTC AG 




622 


(2) INFORMATION FOR SEQ ID NO: 303: 







(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


303: 




CGCTAACTTG 


' CAAACAAAAG AAGAACCCAA 


. ACTCCACAArt 


. tcctitacgc agaaactcaa 


60 


TCTCATCTAC 


TTACCTTGCT GACTTGGTAG 


AGTATGTTGC 


AGACAAAGAC TTCTCAGTAA 


120 


ACGTAATTTC 


TAAATCAGGT ACAACAACTG 


AACCAGCGAT 


tgctttccgt GTCTTTAAAG 


180 


AACTCTTGGT 


TAAGAAATAC GGTCAAGAAG 


AAGCTAACAA 


ACGTATCTAT GCAACAACTG 


240 


ACCGCCAAAA 


GGGTGCTGTT AAGGTTGAAG 


CAGACGCTAA 


CGGTTGGGGA ACATTTGTTG 


300 


TTCCAGATGA 


TATCGGTGGA CGCTTCTCAG 


TATTGACAGC 


CGTTGGTTTG CTTTCAATCG 


360 


CAGCATCAGG 


AGCTGACATA AAAGCTCTTA 


TGGAAGGTGC 


GAATGCAGCT CGCAAAGACT 


420 


ACACTTCAGA 


CAAAATCTCT GAAAACGAAG 


CTTACCAATA 


CGCAGCTGTT CGTAACATCC 


480 


TTTATCGTAA 


AGGCTATGCA ACTGAGATCT 


TGGTAAACTA 


TGAGCCATCA CTTCAATACT 


540 


TCTCAGAATG 


GTGGAAACAA TTGGCTGGTG 


AATCAGAAGG 


AAAAGACCAA AAAGGTATCT 


600 


ACCCAACTTC 


AGCCAACTTC TCAACTGACT 


TGCACTCACT 


TGGTCAATTT ATCCAAGAAG 


660 


GAACTCGTAT 


CATGTTTGAA ACAGTTGTCC 


GTGTTGACAA 


ACCTCGTAAA AACGTGCTTA 


720 


TTCCTACTTT 


GGAAGAAGAC CTTGACGGAC 


TTGGTTACCT 


TCAAGGAAAA GACGTTGACT 


780 


TTGTAAACAA 


AAAAGCAACT GACGGTGTTC 


TTCTTGCCCA 


CACAGATGGT GATGTACCTUV 


840 
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ACATGTATGT GACTCTTCCA GAGCAAGACG CTTTCACTCT TGGTTACACT ATCTACTTCT 900 

TCGAATTGGC AATTGCCCTT TCAGGTTACT TGAATGCTAT CAACCCATTT GACCAACCAG 960 

GTGTTGAAGC TTATAAACGT AACATGTTTG CCCTTCTTGG AAAACCAGGA TTTGAAGAAT 1020 

TGAGCAAAGA ACTTAACGCA CGTCTATAAT AGAAGAAAAG AGTGGTTTGC CCACTCTTTT 1080 

TACTCTCTTT ATCCATAGAA ATTGGACTCA GCCAAGACTT GTGATATAAT ATAGAAAGCA 1140 

AAAAGGCAGA CGCCTAGATA ATAGGAGAAA CTATGTCAAA AGATATCCGC GTACGTTACG 1200 

CACCAAGTCC AACAGGACTA CTACACATCG GAAATGCTCG TACAGCATTG TTTAATTACT 1260 

TGTATGCGCG CCATCATGGT GGAACATTTC TCATCCGTAT CGAAGATACT GACCGTAAAC 1320 

GCCATGTCGA GGATGGTGAA CGTTCACAAC TTGAAAACCT TCGCTGGTTA GGCATGGATT 1380 

GGGATGAAAG TCCAGAATCA CATGAGAATT ATCGCCAGTC TGAGCGTTTG GACTTGTATC 1440 

AAAAATATAT TGACCAACTA TTAGCTGAAG GAAAAGCCTA TAAATCTTAC GTTACAGAAG 1500 

AAGAGTTGGC AGCTGAACGC GAACGCCAAG AAGTAGCTGG CGAAACACCA CGCTACATCA 1560 

ATGAATACCT TGGTATGAGT GAAGAAGAAA AAGCAGCTTA CATCGCAGAA CGTGAAGCAG 1620 

CAGGGATCAT CCCAACTGTT CGTTTGGCTG TCAATGAGTC AGGTATCTAC AAGTGGCATG 1680 

ATATGGTCAA AGGCGATATC GAATTTGAAG GTGGCAATAT CGGTGGTGAC TGGGTTATCC 1740 

AAAAGAAAGA CGGTTACCCA ACTTACAACT TTGCCGTTGT TATCGATGAC CACGATAT-GC 1800 

AAATCTCTCA TGTTATCCGT GGAGATGACC ATATTGCTAA TACACCAAAA CAGCTTATGG 1860 

TCTATGAAGC TCTTGGTTGG GAAGCTCCAG AGTTCGGTCA CATGACCTTG ATTATCCACT 1920 
CTGAAACTG 

(2) INFORMATION FOR SEQ ID NO: 304: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 708 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



1929 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 304: 
AAATTTAAGA AAAAGGAGAC ACATCATGTC TAAAAAAGTA TTATTTATCG TCGGATCACT 
ACGTCAAGGT TCTTTCAACC ACCAAATGGC GCTCGAAGCT GAGAAAGCAC TTGCTGGTAA 
AGCGGAAGTT AGCTACCTTG ATTATTCAGC CCTTCCTCTC TTCAGCCAAG ATTTGGAAGT 
TCCAACACAT CCAGCTGTAG CTGCTGCTCG TGAAGCAGTT CTCGTTGCGG ATGCTATCTG 
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GATTTTCTCT CCAGTCTACA ACTTCTCTAT CCCTGGTACA GTGAAAAACT TGCTTGACTG 300 

GCTATCTCGT GCCCTTGACT TGTCTGATAC ACGTGGCGTT TCTGCCCTTC AAGACAAGTT 360 

TGTCACAGTA TCATCTGTAG CCAATGCAGG GCACGATCAA CTTTTCGCTA TCTACAAAGA 420 

CCTCTTGCCA TTTATCCGTA CACAAGGCGT TGGTGATTTC ACTGCTGCAC GTGTTAATGA 480 

CTCTGCCTGG GCAsACGGAA AATTGGTTCT TGAAGAAACA GTCCTAAACT CACTTGAAAA 540 

ACAAGCTCAA GACTTGGTCG AAGCTATCAA GTAACTAACA CTCAATAAAA ATCAAAAAGC 600 

AAACTAkGAA GCTArCCGCA AGCTACTCaA gCACTGCTTT GAGGTTGTAG ATAGAACTGA 660 

CGAGTGTiinA ACATATATAC GGTAAGGCGA CACTGACGTG GCTTGAAn 708 
(2) INFORMATION FOR SEQ ID NO: 305: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 781 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305: 

CTTCTTTTCT TGGAAATAGG TGTATAATAC GTTTATTAAA TTTTTGAGGA GTTGTCTATG 60 

AAGAAAAGTT TTATCCATCA ACAAGAAGaA ATTTCCTTTG TCAAAAACAC TTTTACCCAG 120 

TATTTGAAAG ATAAGCTAGA AGTTGTCGAA GTTCAAGGTC CTATCTTGAG TAAGGTCGGT 180 

GACGGAATGC AGGACAACCT GTCTGGTGTG GAAAATCCAG TATCGGTCAA GGTTCTCCAA 240 

ATCCCTGATG CTACTTATGA AGTGGTGCAC TCACTTGCTA AATGGAAACG CCACACCTTG 300 

GCTCGTTTTG GCTTTGGTGA AGGAGAGGGT CTCTTTGTCC ACATGAAAGC CCTTCGTCCA 3 60 

GATGAGGATT CCTTGGATGC AACCCACTCT GTTTATGTTG ACCAGTGGGA CTGGGAGAAG 420 

GTTATCCCAA ATGGTAAGCG TAACATCGTT TATCTAAAAG AAACAGTTGA GAAGATTTAT 480 

AAGGCTATTC GCCTGACTGA GCTAGCTGTT GAAGCCCGCT ATGACATCGA GTCTATCT'IG 54 0 

CCAAAACAAA TTACCTTTAT CCATACAGAA GAATTGGTAG AACGCTACCC AGACTTGACA 600 

CCGAAAGAAC GTGAAAATGC GATTTGTAAA GAATTTGGAG CCGTCTTTTT GATTGGTATC 660 

GGTGGCGAGT TGCCAGATGG TAAACCGCAC GATGGACGTG CACCAGACTA TGATGACTGG 720 

ACAAGCGAGT CTGAGAATGG CTACAAGGGT CTAAATGGTG ATATTCTTGT CTGGAATGAG 780 

T 781 
(2) INFORMATION FOR SEQ ID NO: 306: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 84 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
{D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 306: 



CCCGCATCTT 


GTAGGGTTTT 


AACGGGCACG 


ATTTTCATAT 


CCGTCTTGAT 


TGTTTTAGCC 


60 


GCTTCTAGGG 


CTGTTTGGTA 


GTTGTTTTTC 


GCGTCCGGAT 


GCGCCTTTTG 


TTCTTCTTCG 


120 


CTAACAGGGT 


TATCAGGAGC 


AAAGAAAATA 


GCAGCACCTG 


CCCTAGCCGA 


AGCTACAACC 


180 


TTCTTATCAA 


TACCTCCAAT 


GTCTCCCACA 


TTACCATCGC 


GGTCAATGGT 


ACCTGTACCG 


240 


GCAACAATAC 


GACCATTACG 


AAGATCTGGG 


TGAGCTATTT 


GAGTATAGAT 


AGCTAGACTA 


300 


AACATGAGAC 


CAGCACTTGG 


ACCGCCAATA 


CCAGCTGTTG 


AAAAGCTAAT 


TGGGACATTG 


360 


CTGATTACCT 


CTGTACGGTC 


AATCAAGCCG 


ATTCCAATTC 


CATTTTTGCC 


ATTTTCCAAG 


420 


GTGATGATTT 


TTCCTTCTGC 


AGACTTGGTT 


TGCCCATCCT 


CTTCATAGGT 


GACCTTGACG 


480 


GAATCCCCTA 


ATTTTTGAGA 


ACTGACGTAA 


TCAATCAAGT 


CTTTGGAACT 


ATCAAAGGTC 


540 


TGATCATTGA 


CTGCTGTGAC 


TGTATCAGAG 


ATATTGAGAA 


TCCCTTTAAA 


GGTTGAATTA 


600 


TCCGTCACAT 


TCAAAACATA 


AACTCCAAAG 


TACTTGAGTT 


CGATATCCTT 


ACCAGCTGTT 


660 


TTTAGTCCTT 


GATACTTGGC 


CATATTTTCC 


GATGTTTGCA 


TGTAGaATtG 


A-ri-GATTCGC 


720 


ATAAATTCAA 


CATCGGAAGA 


ACCACCTGTA 


GTCTCCTGAG 


CACTACGAAT 


ATCTGTAAAA 


780 


GGTGTCAACC 


AAGCATAAAT 


CATATGAGCT 


AAAGTGGCAT 


GTTGAACACC 


AACCGTAACG 


840 


AATTGT 












846 



(2) INFORMATION FOR SEQ ID NO: 307: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 829 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307: 

GCGATCTGCT TGGGCTTTTC CTATTACCTT ATCTAATAAA TAGGTACGCA GACTCATAAC 60 

CATATAAAGT CCACCCCCCA TGGCACCGAC AAGAGCTACA TAAAAGAAGC TCCACAAACG 120 

TCCACTTGGT TGGAAGAAAA ATCCTAACAG CCACTGGATG GTTCCTATTA ACAGAAACAT 180 

GACTAGGGTC AGCAAACTGA TTAAAATGGT TCGCTTCAAA ATCACCTTGC GCTTGACACC 240 
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AGTTACTTTA CAAATATCCC GATACATCAA GACGTTAGGA ATGATGAGAG CAATGGTTGT 300 

TGAAATCAAA GGACCATAAC TGTGGAAGAG GGCGATGGTA GGTAGTTGCA AGACTAGCTT 3 60 

GGCAATAGAA CCATAGATAA AATAGAGAAC GGCCTTGCGG TTGCGGAACA TGGCCTGAAG 420 

CATTGGAGAC AAGACCATGT ACAAGCCTAA AATAATAGAC TGCAAAACTG CAAAGACAAA 480 

TAAGCCCAGA GCCAAACTAT CTGGCTTACC ATAGAAGACC GTATAAAGAG GTTCTCCTAC 540 

CATAACCACT CCAACCGTTG CTGGTAGCAA GAACATAAAG AGTAGGGTGA GACTGTCCTG 600 

AACGAGACGA GAAGCTGCTT TCAAGTCCCC CTTGACATAG TTTTCCGTCA AAAGTGGCAA 660 

ACCAACACTC CCAATCGAAA CCCCTACAGA AATCAAAATC ATCGTGATTT TATTAGGATT 720 

GGCTGAGAAA TAAGAAAACA TGACAACCAA GTCCTCATTG CTGTAGTTGG TAAACCAGCT 780 

CATACTATTG ATAAAGGTCA GCTGAGTCCA AATCTGGAAG AGCTGGATG 829 



(2) INFORMATION FOR SEQ ID NO: 308: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 464 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 308: 

CGAACATCTT GCTGGCTGAT TCGTCTGCCG CCATCGCAGC CCCGAACACA TTGCGACCCA 60 

TGGCAAGCGG GCTCAATCCG CACATGGGAT CCGTGCCAAA GCCCCGCGTG TGCATCATTT 120 

GCTCATCTAG TAACGTATGA GGTTTGCCTT CGCTGTCGAT AAACCGATAT TCAATCGCAC 180 

CACTGCTCGT TCTCCGCGGA GGGGAAACCG ACTGCGGTAG GATGAACTCC AGAGAAGAGA 240 

GATCACGACC TACCAGGTGC GGCTCGTTGA AGCTGTTGCC GCTTAGCAGC AGGCTCGICCA 300 

CCACGCATTC CCAGAACTCA ACGGGGGTTT GATCGGCGTT CGGTTGCTGA CTAATAACTC 360 

GGTGCACGGG ATGCGAAGTG GCCACTTCTG GCACACCGTT CTTGTCTTCG TAGAGAGCAA 420 

TTGGGAGGGT GGCCAGCGTT TCGGCGATGA GGCGCACGCA GGCC 464 



(2) INFORMATION FOR SEQ ID NO: 309: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 982 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309: 
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CCGTCTATAA 


TGGTAATAGA 


TTTTATTTGG 


AGGTTTTTAT 


GTCATTTCTA 


TCAAAAAATG 


60 


GAGCAGGTAT 


CTTGGCCTGC 


CTTCTCATTT 


CCATCCTATC 


TTGGTACTTA 


GGAGGATTCT 


120 


TCCCTGTGGT 


TGGCGCGCCC 


GTTTTTGCCA 


TTTTCATAGG 


CATGCTCCTA 


CATCCCTTTC 


180 


TCTCGTCCTA 


TAAACAACTG 


GATGCTGGTT 


TGACCTTTAG 


TTCCAAGAAG 


TTGCTCCAAT 


240 


ATGCCGTTGT 


CTTGCTTGGT 


TTTGGTCTCA 


ATATCTCGCA 


GGTCTTCGCA 


GTTGGCCAAT 


300 


CTTCACTCCC 


TGTCATCCTG 


TCCACTATCT 


CAATAGCTCT 


GATTATTGCC 


TACCTCTTCC 


360 


AGCGTTTCTT 


TGCCCTGGAT 


ACAAAACTGG 


CTACCTTGGT 


TGGAGTAGGT 


TCTTCTATCT 


420 


GTGGGGGTTC 


TGCCATTGCA 


GCGACAGgCC CGTTATTGAT 


GCTAAGGAAA 


AGGAAGTAGC 


480 


CCAAGCCATT 


TCCGTTATCT 


TTTTCTTCAA 


TGTCTTGGCT 


GCGCTCATCT 


TTCCAACCCT 


540 


CGGCACCTGG 


CTTCATCTAT 


CCAATGAAGG 


CTTCGCCCTC 


TTTGCAGGGA 


CTGCGGTCAA 


600 


CGACACTTCC 


TCTGTAACGG 


CTGCCGCCAG 


CGCTTGGGAC 


AGTCTTTACC 


AAAGCAATAC 


660 


CCTCGAGTCT 


GCAACCATTG 


TTAAACTCAC 


ACGTACTTTG 


GCCATTATCC 


CTATCACGCT 


720 


CTTTCTATCC 


TACTGGCAAA 


GTCGCCAACA 


AGAAAACAAG 


CAAAGCCTGC 


AACTGAAAAA 


780 


AGTCTTCCCA 


CTTTTTATCC 


TTTACTTTAT 


CCTTGCCTCT 


CTCCTCACTA 


CACTACTCAC 


840 


CTCTCTAGGT 


GTGTCCAGTA 


GTTTCTTTAC 


TCCTCTCAAA 


GAACTCTCTA 


AATTCCTTAT 


900 


TGTCATGGAC 


ATGAGTGCTA 


TCGGTCTCAA 






TCAAATCCAG 


ybo 


TGGAAAATCC 


ATTCATCATG 


GA 








982 



(2) INFORMATION FOR SEQ ID NO: 310: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1939 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310: 
CTAGCTGCCA ATATGATTGG GGTGCAGAAG CGCGTGATTA TCTTTAATCT TGGCTTGGTT 
CCTGTGGTCA TGTTTAACCC AGTGCTTCTG TCCTTTGAAG GATCCTATGA GGCAGAAGAA 
GGCTGTTTGT CCTTGGTAGG TGTGAGATCA ACTAAGCGTT ATGAAACCAT AAGGCTTGCC 
TATCGTGACA GCAAGTGGCA GGAACAGACC ATTACCTTGA CAGGCTTCCC AGCTCAGATT 
TGCCAGCATG AGCTGGATCA CTTGGAAGGA CGAATCATTT AGGAGGAAAG CAAATGAAAC 
GAATAGTCTT TGAACTTATT TTTATCGCAA CGACCTGGTA TATCTTTTTA CCGCCCCTTA 
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ACCTGACCAG 


CTGGGAATTT 


CTCTTCTTCC 


TCTGTGGGCA 


TTTGTTAGTT 


GTGGCAATAT 


420 


TATTTGGCTT 


TGGCAAGGGG 


ATAAACCTTG 


TCAAAACGGT 


TCATGTGCGC 


CACGGTAAGG 


480 


CGGAAGCTGC 


CTTAAATCTT 


GAGGGTTTCA 


AAATCT^TCG 


GTTAGGGAAA 


ATTCTGTTAG 


540 


CTTCGATTGG 


AGGAATTCTT 


CTCTTGGCAG 


CTTTGGTTTc 


CTTGGTAACT 


TCCAGCATGT 


600 


TTCAG6CT7^ AAATTATGCC 


AATGTAGTCA 


CGGTTACGGA 


AAAAGACTTT 


ACTGAATTTC 


660 


CTAAGAGTGA 


CACCAGTAAG 


GTTCCTATCC 


TAGATAGAAG 


TACTGCTGAA 


AAAATTGGAG 


720 


ACCGCTACTT 


GGGTTCCCTA 


ACCGATAAGG 


TGTCGCAATA 


CGTAGCGGCA 


GATACCTATA 


780 


CCCAATTGAC 


AATTGATGGG 


AAACCTTATC 


GGGTCACACC 


ACTAGAATAT 


GCAGACCCTA 


840 


TCAAATGGTT 


TAACAATCAA 


GCCAAGGGAA 


TCGGTGAGTA 


TATTAAGGTG 


GACATGGTAA 


900 


CTGGAAATGC GGATTTGGTG 


GACTTGAAGA 


CACCAATCAA 


GTATTCAGAC 


TCGGAGTATT 


960 


TTAACCGTGA 


TGTCAAACGT 


CACCTGCGCT 


TGAAGTACCC 


GACCAAAATC 


TTTAAAACTC 


1020 


CATCTTTTGA 


GGTGGACGAT 


GAGGGCAATC 


CTTTCTATGT 


AGCAACGGTT 


TACCAAAAGC 


1080 


AATTTGGACT 


TGCTGTTCCT 


CGTCCTGCTT 


CAGTCATTAT 


CTTGGATGCT 


ACAAATGGAG 


1140 


AAACCAAGGA 


ATACAGCTTA 


TCAGATGTTC 


CAGAATGGGT 


GGACAGGATC 


TATCCAGCAG 


1200 


AGGAAACCAT 


TGAGCAAATC 


AACTACAACG 


GCAAGTACAA 


GGACGGTTTC 


TTGAATGCCA 


1260 


TGATTTCCAA 


GAAAAACGTG 


ACCCAGACTA 


CCAATGGCTA 


TAATTACTTG 


TCTATCGGTA 


1320 


ATGACATCTA 


TCTCTACACA 


GGTGTGACGT 


CGGCTAATGC 


GGATGAGAGT 


AATCTTGGTT 


1380 


TCATCCTTGA 


AAATATGCGA 


ACAGGAGAAA 


TCACTAAGTA 


TAGCTTGGCT 


TCTGCGACAG 


1440 


AAGAATCAGC 


CCGTGAATCA 


GCAGAAGGTG 


CTGTTCAGGA 


GAAATCCTAC 


AAAGCAACCT 


1500 


TCCCAATCCT 


CATCAACCTC 


AATGACAAGC 


CTCTCTACAT 


CATGGGCTTG 


AAGGACAATG 


1560 


CTGGCTTGGT 


CAAAGAGTAC 


GCCCTGGTAG 


ACGCAGTCGA 


GTACCAAAAT 


GTTATCGTTG 


1620 


CTACTACAGT 


GGAAGAGATG 


CTCAGCAAGT 


ATGCCAATAA 


AAACGACCTT 


GAAATTGACA 


1680 


ATGCAACGAC 


AGAAAGCATC 


AATGGAGTAG 


TAGCAGACCT 


CAAATCAGCT 


GTTATCAAt;G 


1740 


GAGACACTGT 


CTACTTCTTT 


AAAGTTGATG 


GCAACATCTA 


CAAGGTCAAG 


GCTTCAGTAT 


1800 


CCGATGACCT 


TCCTTACCTT 


GAAAATGGTA 


AAACCTTCGA 


AGGTCAAGTA 


GGAAAAGACA 


1860 


ATTATCTCAA 


GACCTTTAAG 


CTACGGTAAA 


AATAGGTTTT 


TTTCAGAAAG 


TATATGTTAT 


1920 


AATAAGGTAA ATTAAGCCG 










1939 


(2) INFORMATION FOR SEQ ID NO: 311: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 907 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311: 



CCTGCTAATA 


GAGAGAAAGA 


CTAGGAGTAG 


AAGTAAGCCA 


ATTAAATAAT 


GAGAAAGTTT 


60 


CATACCCCGT 


CCTTTCATGT 


AGATTTGGTA 


TCGAAAGATA 


TCTGCGGATA 


TAAATGTAAC 


120 


ATTATTTTTC 


TAATCTGTCA 


ATAAAATTTC 


TGACAATTTA 


ATAAATACAA 


CAAGGAGAGA 


180 


GCAACAAGAC 


TTTCTCCTTT 


GTTATCCTAT 


TCTAAAATGT 


TTTTACCTTA 


ATCTGATAAA 


240 


ATAATATCTT 


CGAGGGAGTA 


GCTAGCCGTC 


CAATCAAGAT 


ATTGTTTAGC 


TTTTGAAGCA 


300 


TCTGCTAGGA 


CACTGGCTGG 


GTCACTAGCA 


CGTCGAGCAA 


CAATCTCGTG 


TGGGATTTTT 


360 


TAATTTAGTA 


ATTCTTCAGC 


AGTTTTAAAG 


ATTTCTTTGA 


TAGTATAGCC 


TTTTTTAGTT 


420 


CCTAAGTTAA 


AGATTTGAGA 


AGAACTGTCT 


TCTTGAAATA 


GGTAGTTCAT 


TCCTTTAACA 


480 


TGAGCCTATG 


CAAGGTCCAA 


GACATAAATG 


TAATCTCGAA 


TACATGAACC 


GTCACGTGTA 


540 


TCGTAGTCAT 


CTCCAAATAT 


TTTTAAGCTA 


TCATTTTGTC 


CCAATGCGGT 


CTTGTTGATA 


600 


TTTGGAATGA 


TGTGAGTTGG 


ATTTTTCACA 


CGCAGACCGT 


TTGAAGCATC 


CATTTCAGCC 


660 


CCAGCAACAT 


TAAAGTAACG 


GAAAATAACA 


TATTTCCAGT 


CGTAGCGATT 


GGCCATCCAG 


720 


TAAATCATTC 


GTTCGCCCAT 


CAGTTTTGTC 


TCTGCATAAG 


GG TTG AC AG G 


GTCGAGCnGG 


7HU 


GTATCTTCAG 


TCACCGGCTT 


GTCAATACAG 


TTATTTCCAT 


AGAGAGAAGC 


AGTCGAAGAG 


840 


AACATGATTT 


TTTGAATGCC 


AACTTCAGAT 


AAGACTTTGA 


GAACTTGGTT 


CATACCAGCA 


900 



ACGTTGG 

(2) INFORMATION FOR SEQ ID NO: 312: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2170 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312: 

CCACATAAAG GTAAATATCT TTTGTACTAT CTTGGGCATC CAAGAAAAGC AATTGGGCAA 60 

TAACAGAGTT AGCCATATTG TCTTCAACCG GACCTGTCAG CATAATGATG CGGTCTTTGA 120 

GAAGACGTGA GTAAATATCG TAAGAACGTT CTCCACGGCT TGTTTGTTCA ATAACTACAG 180 

GAATCATTCA TTTCTCCTTT TGAGTTTTAA TTTTGTTGGT CAAATGACTG AAGATAAGAC 240 
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TATTATAATA TCTTGGTCAA AAAAGGTCAA ATTTTTGCTC TGCTTTCATT AGACAGAAAC 300 

AAAAACCCAA CCTCCTTTCG TGACTGGAAA TACTTTTCCA AGTCATTCTT CTTTTCGATC 360 

TTATTTTGTA CCGAACAAGC GGTCTCCAGC ATCTCCAAGA CCTGGAACGA TATAACCGTG 420 

TTCGTTCAAA CGTTCATCCA AGGCTGCTGT AAAGATTTCT ACATCTGGAT GAGCTTCTTG 480 

AAGGGCTTTT ACACCCTCTG GAGCAGATAC AAGGCAGACA AATTTGATAT TTGATGCGCC 540 

ACGTTTTTTA AGAGAATCAA CAGCCAAGAT TGCTGAGCCA CCTGTTGCCA ACATTGGGTC 600 

TACTACAAAA ATTTGACGTT GGTCAATGTC CTCAGGCAAT TTCACCAAGT ATTCAACTGG 660 

TTGAAGTGTT TCTTCATCAC GGTACATACC GATGTGGCCA ACTTTAGCAG CTGGAACCAA 720 

GTTCAAGAGA CCATCAACCA TCCCGATACC TGCACGCAAG ATTGGGACGA TGGCCAATTT 780 

CTTACCTGCC AATTGTTTTT GAACTGTTTT TGTAATTGGT GTTTCGATTT CCACATCTTC 840 

TAGTGGAAGA TCACGAAGTA CTTCATACCC CATCAACATT GCAATCTCAT CTACTAGCTC 900 

ACGAAAAGCT TTTGTAGAAG TATCTGTACG ACGCAAGATT GACAATTTGT GTTGAATCAG 960 

TGGGTGATTA ATAACTTCAA TTTTTCCCAT TTTTGGAATT CCTTCTTTCA ATTTATTCTT 1020 

CTTATTATAC CAAAAAACGG TTTAAAAATC TTTCTAAACC ATTTATTTTT GATAATTTTT 1080 

ACATTAGATC AGCCTCTTTA AGAGCTGTCT GTACTGTCTC AAGTGGTAAA TGGGTCAATT 1140 

CTGTCCCTTT TTCTTGATAA AGGTATTGGG CGTAGTCGTC CATTCGGTAC TGGTTGATAT 1200 

AAACCACGCG CTTGCAGCCG ACCTGAAGCA ATTGTTTTGT ACAGTTGAGA CAAGGAAAAT 1260 

GGGTTACATA GGCTGTAAAG CCTTTGGGAA CACCACGCTC AGCACCTTGA AGGATAGCAT 1320 

TGACCTCAGC GTGAAGGGTG CGAACGCAGT GGCCTTCAAT GACCAAACAT TCGTGATCAA 1380 

TACAATGCTC AGTCCCTGAC ACCGAACCAT TGTAACCAGT GGAAATAACC TTATTATCTT 1440 

TTACCAGAAT CGCGCCCACT TTAGCACGTT TACAAGTGGA ACGATTCGCA ATTAGTAGAG 1500 

CTTGGGCTGC AAAATACTCA TCCCAGGCCA GTCTTTTTTC AGTCATCTCT TTTCTCCTTT 1560 

TTCTCTATTT TTTAAAAAAT GGTAAACCTA AATCTGCAAT CTTTTCAGCT GGTACCTTCA 1620 

TGCCATCCTT GATCCATTTT AGAAGGACAG AGACGATGGC TGAGCTCCAG AAGGAATGAA 1680 

GATAAGAGCT GACACCTTTT GATTTCCCAT GGTATTTTTC TAGAAATTCC TGCATGGCTT 1740 

GGACAAAGAT TTTTTCCAGA TGGTAATCCA AGGCCAATTG AATTACTCTA GCTTCCTTTC 1800 

TGGCCTCCCG GAAAAGGTGA ACCCAAACCA AATAAAGGTC TGTCTTTAAA TCGTAATGAT 1860 

GCAGCTGTTC CATAATATTG TGGACAGTTC GTTTAAAGAC GCTCTCTAAA ATTTCCTCTT 1920 

TGGAGTCATA ATTGCGATAA AAGGCCGCAC GCGAAACACC TGCACGTTTG ACCAATTCAG 1980 

AAATACTAAT CTTGGTCAGT TCCTTTTTTT CCAAGAGTTG CAAGAGGGCT GTTTCAATGG 2040 
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CTTCTCTGGT TAATAAATTG GATTCTTGGT TTGATTTTCT GAGATTTTCA AGAGACTTTT 2100 

CAGAGATTCT ACGTTCAGAC ATAACATTTT CTTTCTACTT GTCACAACAG ACGGATGATG 2160 

CTTTTGTTTC 2170 



(2) INFORMATION FOR SEQ ID NO: 313: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 539 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
{D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313: 

ATCTGCACGA ATCAGGGCTT TCTAAGTGAC TATTTCCACC GAAATATTAT TTATATCAGG 60 

AGGACATTCA TATGTCACGT TATACAGGAC CATCTTGGAA ACAAGCTCGT CGTCTTGGCC 120 

TTTCACTTAC AGGTACAGGT AAAGAATTGG CACGTCGTAA CTACGTACCA GGACAACACG 180 

GACCAAACAA CCGTTCTAAA TTGTCAGAAT ACGGTTTGCA ATTGGCTGAA AAACAAAAAC 24 0 

TTCGTTTCAC TTACGGTGTA GGTGAAAAAC AATTCCGTAA CTTGTTCGTA CAAGCTACAA 300 

AAATCAAAGG CGGAATCCTA GGTTTCAACT TTATGCTTCT TTTGGAACGT CGTTTGGATA 360 

ACGTTGTTTA CCGTCTTGGT CTCGCGACTA CTCGTCCTCA AGCTCGTCAA TTCGTAAACC 4 20 

ACGGTCACAT CCTTGTTGAC GGGAAACGCG TTGATATCCC ATCATrCCGC GTAACTCCAG 4 80 

GTCAAGTGAT CTCAGTTCGT GAAArATCAT TGAAAGTTCC AGCAATCCTT GAAGCAGTA 53 9 



{2) INFORMATION FOR SEQ ID NO: 314: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 667 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 314: 

CCGGTTTTGC TCCTTCTCTA CGGCTACGAC GTGATGTATC TCTGATGATA TCCACTGTTT 60 

CTGTAGCAGG CGTAGGTGTT TCTGGACCTG CTTGTTCTGC TTTTTTCTCT GCCGTCGTAT 120 

AGGAAACAGC TACCCTTGTT GGGGTTTCAT TGTATTCTCT TTCAAGTTTC TTAGGTCTAA 180 

CAGGACCTGG ACCTGGTCTT GATCCACTTT CTTCCGCTGG AGAAGAAGGT ACATCTTGAC 240 

TTGGATGACT TGGAACACCA GGAGTTTCTC TTTGAATCTC ATCTGCTGGA GAAGCTGGTA 300 



